Zum Hauptinhalt springen

Species discrimination and total polyphenol prediction of porcini mushrooms by fourier transform mid‐infrared (FT‐MIR) spectrometry combined with multivariate statistical analysis

Wang, Yuanzhong ; Li, Xiu-Ping ; et al.
In: Food Science & Nutrition Food Science & Nutrition, 8(2):754-766, Jg. 8 (2020), Heft 2, S. 754-766
Online unknown

Species discrimination and total polyphenol prediction of porcini mushrooms by fourier transform mid‐infrared (FT‐MIR) spectrometry combined with multivariate statistical analysis 

The plateau specialty agricultural products, wild porcini mushrooms, have great value both as a superb cuisine and as a potential medication. Due to quality different between species added with the fraud behavior in sales process, make poor quality or poisonous sample inflow into the market, which pose a health risk for consumers, but also disrupted the mushroom market. Traditional analysis way is time‐consuming and laborious. Therefore, the aim of this study is to develop a way using fourier transform mid‐infrared (FT‐MIR) spectrometry and data fusion strategies for the fast and accurate species discrimination and predict amount of total polyphenol in four porcini mushrooms. The t‐distributed stochastic neighbor embedding based on mid‐level data fusion showed two species of Boletus edulis and B. umbriniporus have been identified. The order of correct rate of PLS‐DA models was mid‐level data fusionq (100%) > mid‐level data fusione (97.06%) = mid‐level data fusionv (97.06%) = stipes (97.06%) > low‐level data fusion (94.12%) > caps (91.18%). The order of correct rate of grid‐search support vector machine models was low‐level data fusion (100%) > caps (94.12%) > stipes (91.18%), and the order of particle swarm optimization support vector machine was low‐level data fusion (100%) > caps (97.06%) > stipes (88.24%). The mid‐level data fusionq and low‐level data fusion had best discrimination accuracy (100%) allowing each mushroom classed into its real species, which could be used for accurate discrimination of samples. B. edulis mushrooms had highest total polyphenol, with 14.76 mg/g dw and 17.33 in caps and stipes mg/g dw, respectively. The phenols were easier to accumulate in the caps in Leccinum rugosiceps (1.03) and B. tomentipes (1.19), and the opposite phenomenon is observed in B. edulis (0.85) and B. umbriniporus (0.95). The correlation coefficient and residual predictive deviation of best prediction model were 86.76% and 2.40%, respectively, indicating that that there is good relevance between FT‐MIR and total polyphenol content, which could be used to predict roughly polyphenols content in mushrooms.

Keywords: data fusion; FT‐MIR spectroscopy; porcini mushroom; species discrimination; total polyphenol prediction

The current study developed a way with fast, cheap and reliable for discriminating mushroom species and a way to roughly predict polyphenol content in mushrooms. Besides, the study applied three ways of feature extraction and provided data of content and enrich ability of total polyphenol in different morphological part of mushroom for the first time.

fsn31313-toc-0001.jpg

INTRODUCTION

Mushrooms existed on earth before human appeared (Bilal, Ahmad, & Wani, [2]) and have been consumed over a long period of time (Manzi, Gambelli, Marconi, Vivanti, & Pizzoferrato, [25]). The edible porcini mushrooms as a kind of macro‐fungus belong to Boletus and Leccinum genus is large enough to be picked and ate valued in gourmet. These mushrooms are known as reputation of "vegetable meat" with characteristics of hypertrophic fruiting body, pleasant aroma, and unique taste. Besides, they have other features of high in proteins while low in fat and calories (Manzi et al., [25]). The porcini mushrooms contain natural chemical compositions such as proteins, phenols, flavones carbohydrates, alkaloids, vitamins, minerals, which are useful in nutrition and medicine purposes like antioxidant, antiviral, anti‐tumor, antifungal, and enhance immunity (Bilal et al., [2]; Kaewnarin, Suwannarach, Kumla, & Lumyong, [17]). They have also many mineral compounds including valued selenium with high antioxidant properties, but it can be dependent on species and its place of origin (Bekyarov et al., [1]). Polyphenols are one of the compound which have positive function for human health (Wong & Chye, [38]), wildly finding in vegetables, fruits, and grain products (Velioglu, Mazza, Gao, & Oomah, [36]). Studies have revealed that polyphenols available from mushrooms have higher antioxidant activities compared with many fruits and vegetables ([15], & Foong, 2[15]; Jia, Tang, & Wu, [16]; Velioglu et al., [36]).

The total porcini mushrooms in Yunnan Province are approximately to be 224 species in which 114 species are edible. Porcini mushrooms are one of the "four kings of mushroom" and are one of the traditional bulks exported agricultural products. According to statistical data, export quantity of wild porcini mushrooms from Yunnan Province accounts for about 80% of total export quantity of China (http://ynsyj.yncoop.com). The average price per kilogram of hot‐selling porcini mushrooms can reach about 90 CNY according to market investigation at August 2019. Due to high economic value, mushroom can be used by people of noncompliance with the morals to seek profits. Some fraud actions have been reported. Several mushrooms without formally scientific names have been sold with the mushrooms from species of porcini (Dentinger & Suz, [10]). The fraudulent action is also reported by Casale et al. ([4]), which revealed that commercial dried mushrooms which are composed of Boletus edulis and related species have been adulterated with B. violaceofuscus. These actions have seriously influencing mushrooms quality and interfering quality supervision, resource evaluation, food safety, and even man health. In addition, there is just one specie namely Phlebopus portentosus can be cultivated successfully, and the over‐picking by local peoples of lower income makes the seasonal wild porcini mushroom increasingly rarer. These reasons stated above confirm the urgent necessity regarding the species discrimination for porcini mushrooms.

Morphology is a common method based on surface features from fruiting body such as color, shape, size, or reticulate pattern (Tsujikawa et al., [35]). However, this method did neglect the phenotypic variability of mushroom. Besides, fruiting body will lose its characteristics when it suffered from drying, dehydration, salting, and bleaching, and usually processed into slice, can, or extractum to adapt long‐distance transportation for exporting. Molecular tools have been developed to elude these flaws. Internal transcribed spacer (ITS) primer of 28 B. edulis samples combining phylogenetic tree analysis has provided an effective identification and revealed that B. edulis mushrooms are often sold by intermixed with mushrooms species of B. violaceofuscus (Mello et al., [26]). Another study for species clarification used ITS and fungal immunomodulatory protein (FIP) sequences (Zhou, Liu, Guo, Su, & Zhang, [43]), shown that mushrooms from Ganoderma actually belong to different species. Now, nevertheless, infrared spectroscopy is preferred by researchers because this analytical technique has advantages of wide range of applications (liquid, solid, and gaseous samples), less sample consumption, without destroying samples and rapid, not only for compound identification and molecular structure characterization, but also for quantitative analysis (Borràs et al., [3]; Dutta, [11]). Just as there are no two identical leaves in the world, there are also differences between samples, which can be displayed in the infrared spectrum. The mushroom specie of Catathelasma ventricosum had higher amounts both of total phenols (9.24 ± 0.42 mg/g dw) and total tocopherols (2.76 ± 0.22μg/g dw) than species of Clitocybe maxima (6.96 ± 0.32 mg/g dw and 0.62 ± 0.02 μg/g dw), Stropharia rugoso‐annulata (5.52 ± 0.45 mg/g dw and 1.34 ± 0.17 μg/g dw), Craterellus cornucopioides (5.39 ± 0.28 mg/g dw and 1.94 ± 0.43 μg/g dw), and Laccaria Amethystea (9.08 ± 0.54 mg/g dw and 2.72 ± 0.27 μg/g dw) (Liu et al., [22]). Therefore, the C. ventricosum species may have higher power to decrease the oxygen radicals (Kim et al., [20]). Influenced by latitude, the sea buckthorn berries collected from Québec, Canada, may have better pleasant taste than those grown at Sammalmäki, Finland, which contributed by higher levels of total sugar and sugar/acid ratio and a lower level of total acid (Zheng, Yang, Trépanier, & Kallio, [42]). It is this variability between samples that is key for building spectroscopic fingerprint spectrum and make it possible to track the sample source. Fingerprint spectrum available from Fourier transform mid‐infrared (FT‐MIR) spectra has been united with chemometrics for food quality like tannin characterization (Tondi & Petutschnigg, [33]), honey adulteration (Das et al., [9]), camel milk metamorphism (Nagy et al., [27]), content determination of caffeine and trigonelline (Hagos, Redi‐Abshiro, & Chandravanshi, [12]), variant screen of short‐chain fructooligosaccharides (scFOS) (Trollope, Nieuwoudt, Görgens, & Volschenk, [34]), and storage effects on saffron (Ordoudi, de los Mozos Pascual, & Tsimidou, [29]). Despite the excellent practicability of FT‐MIR spectroscopy, potential flaw of the single data matrix is that it is not enough to represent the entire sample (Borràs et al., [3]).

Data fusion is a rising technology intersected by multiple disciplines for resulting in a satisfactory inference. Thanks to the rapid development of computers, improvement of experimental instruments, and gradual maturation of data fusion; data fusion has been broadly used in many areas including environment supervision, food quality, medical diagnosis, military, mapping, and robot (Hall & Llinas, [13]). Low‐level data fusion is an effective strategy to distinguish official rhubarb (recorded in the Chinese Pharmacopoeia) and unofficial rhubarb (Sun, Zhang, Zhang, & Zhu, [32]), and also to trace geographical origin of herbal medicine of Panax notoginseng (Li, Zhang, & Wang, [21]). Another commonly used strategy is mid‐level data fusion. Unlike direct data splices in low‐level data fusion, mid‐level data fusion digs effective information available from singe data matrices in order to enhance run speed of algorithm and output a better outcome. This strategy of mid‐level data fusion has been used to trace specie and geographical origin of Porcini mushrooms (Yao, Li, Liu, Li, & Wang, [41]) and also to classify organic and nonorganic orange juices (Cuevas, Pereira‐Caro, Moreno‐Rojas, Muñoz‐Redondo, & Ruiz‐Moreno, [8]), and the manufacturer of beer with same brand and product could be discriminated (Vera et al., [37]). According to literatures, the several common ways of feature selection are: as follows (1) latent variables (LVs) which selected according to R2Y(cum) and Q2(cum) based on partial least squares‐discriminant analysis (PLS‐DA) model (Yao et al., [41]), (2) principal components (PCs) obtained from principal component analysis (PCA) (Vera et al., [37]), (3) variable importance in the projections (VIPs) picked by the values of VIP >1 ([30], & &. Wang, 2[30]). R2Y(cum) and Q2(cum) were used to assess the ability of the model to fit data of training set and to predict new sample (test set). The R2Y(cum) represents the degree of fitting the data; a large value (close to 1) is usually a necessary condition for a good model. However, the Q2(cum) represents the predictivity of model, a large value (>0.5) indicates good potential to predict sample origin. VIP values larger than 1 indicate "important" X‐variables, while the values lower than 0.5 indicate "unimportant" X‐variables.

Although there are many studies on the species discrimination using data fusion strategy, there is no enough information regarding the data fusion using different morphological parts of porcini mushrooms of Yunnan province. The current study aims to find a fast and simple way to discriminate species and predict content of polyphenol using Fourier transform mid‐infrared (FT‐MIR) spectra available from different morphological parts of mushroom.

METHODS

Sampling and sample preparation

A total of 100 fruiting bodies of the four different species (Boletus edulis Bull. Fr, Leccinum rugosicepes (Peck) Sing, B. tomentipes Earle and B. umbriniporus Hongo) were collected from Yuxi, Yunnan province, China, during 2011–2012. The detailed information both of samples and collected place is shown in Table. Picking behaviors were carried out in the forests. In other words, the collection locations were in mountain area, which is far away from villages and human activities. Therefore, these mushrooms could be considered as clean and undamaged. Fruiting bodies of similar‐sized and robust were selected for analysis.

The information of mushrooms samples

CodeSpeciesLocationQuantity
1Boletus edulis Bull. FrPubei, Yimen, Yuxi11
2B. edulis Bull. FrPubei, Yimen, Yuxi7
3B. edulis Bull. FrPubei, Yimen, Yuxi10
4B. edulis Bull. FrPubei, Yimen, Yuxi10
5Leccinum rugosicepes (Peck) SingPubei, Yimen, Yuxi9
6L. rugosicepes (Peck) SingPubei, Yimen, Yuxi7
7B. tomentipes EarleFuliangpeng, Eshan, Yuxi6
8B. tomentipes EarleXiaojie, Eshan, Yuxi9
9B. tomentipes EarleTongchang, Yimen, Yuxi10
10B. tomentipes EarleChah, Eshan, Yuxi7
11B. umbriniporus HongoHuangcaoba, Yuxi8
12B. umbriniporus HongoTongchang, Yimen, Yuxi6

The fresh fruiting bodies were treated using soft brush to cast off soils and other litters (leaves or branches) and then washed with running tap water until no impurity is visible by naked eyes. These watery mushrooms were dried in laboratory oven at 50°C for constant mass and crushed immediately. It is underlined that each dried fruiting body was crushed after dividing into two morphological parts, namely cap and stipe. Finally, the crushed sample was passed through an 80‐mesh sieve and then stored in zip lock bag wait for next analysis.

Determining FT‐MIR information

Spectra information was determined using common method of KBr tableting. The 1.0 ± 0.2 mg sample was homogenized using 100 ± 0.2 mg KBr powder in an agate mortar in first step. Because KBr is easy to absorb moisture, this step was completed under infrared light. After thoroughly homogenization, mixed sample was adjusted to a thin slice under pressure available from tablet press. In last step, the FT‐MIR spectra were immediately determined with a Fourier transform infrared spectroscopy spectrometer equipped with a DTGS detector. The determination conditions of wavenumber range, the number of successive scans, and resolution were set as 4,000 to 400 cm−1, 64 and 4 cm−1, respectively.

Determining amount of total polyphenol

The unerring 0.2500 g sample was extracted by an ultrasonic cleaning machine with 5 ml (solid‐liquid ratio of 1:20) of 40% ethanol at room temperature for 30 min. Then, the extracted liquid was filtered through qualitative filter paper, and the filtrate was adjusted to 25 ml with deionized water.

The determination method of vis absorbance was similar to the procedures reported by Kaewnarin et al. ([17]). The 0.1 ml filtrate was transferred in a test tube using transfer liquid gun and then added into 7.9 ml distilled water to dilute 80 folds. After that, the sample was added with 0.5 ml of 1 mol/L Foline–Ciocalteu reagent. Eight minutes later, 1.5 ml of 20% saturated sodium carbonate solution was added to the diluted sample. After seal by preservative film, the sample was incubated for 1 hr at room temperature and then determined using visible spectrophotometer at 760 nm. Calibration curve for quantification was constructed by gallic acid. All the reagents belong to analytical reagent (AR) grade.

Statistical analyses

Spectra information was converted to data matrix using software Omnic 8.0 (Thermo Fisher Scientific Inc.). In order to fuse low‐level data, data matrices from the two parts were directly spliced. Three types of feature variables were used, which correspond to PCs, LVs, and VIPs. And the selection standards were eigenvalue >1, maximum Q2(cum), and VIP >1, respectively. These feature matrices were spliced according to same extraction way to get three new data matrices for mid‐level data fusion. The t‐distributed stochastic neighbor embedding (t‐SNE) was applied to preliminarily visualize discrimination of mushrooms. For building discrimination model, fix data matrices (two single, one low‐level data fusion and three mid‐level data fusion) were divided into training set and test set according to Kennard–Stone (KS) algorithm. Training set with 2/3 data was used to build model, while the rest of data namely test set were used to test discriminatory capacity of the model. Supervised classification methods of PLS‐DA, grid‐search support vector machine (GS‐SVM) and particle swarm optimization support vector machine (PSO‐SVM) were developed with help of software of SIMCA‐P+ 13.0 (Umetrics AB) and MATLAB R2014a (Math works).

In essay of total polyphenol prediction, the amount and extraction rate of total polyphenol of per sample were calculated based on vis absorbance. Then, the data of amount and FT‐MIR both from same morphological part were spliced to develop two new data matrices for total polyphenol prediction. Data treatments of first‐order derivative (FD), second‐order derivative (SD), multiplicative scattering correction (MSC), standard normal variate (SNV), Savitzky–Golay (SG) smoothing, and combination of these treatments were performed to optimize spectral data. The derivative order, points in each, and distance between each were set as 1, 15, and 1 and 2, 15, and 1 in FD and FD, respectively. The order derivative was powerful to enhance resolution, whereas it also enhanced the noise (Roy, [31]). SG smoothing may be a helpful method to solve this problem (Xu et al., [39]). The MSC and SNV were usually applied to reduce the light scatter effect caused by sizes and shapes of granular sample (Helland, Naes, & Isaksson, [14]). Before KS, data matrices were normalized to a range of 1–2 to avoid reduced accuracy brought by different dimensions. The method of KS was as described above. The prediction of total polyphenol was performed using supervised regression methods of GS‐SVM. Root mean square error of cross validation (RMSECV) and root mean square error of prediction (RMSEP) were applied to estimate total error for samples and estimate of total prediction error, respectively. The calculation approaches were as follows:

RMSECV=i=1ny^i-yi2nc

RMSEP=i=1nP(y^i-yi)2nP

where the nc represents the quantity of samples of training set while np represents the quantity of samples of test set, and the y^i and yi represent the predicted and measured value, respectively.

Coefficient of determination (R2) was used to reflect the correlation relationship between predicted and measured value. Another useful assessment parameter of residual predictive deviation (RPD) was calculated via standard deviation (SD) divide by RMSEP. The calculation methods were as follows:

R2=i=1ny^i-yi2i=1ny^i-y2

RPD=SDRMSEP

The RPD value between 1.5 and 2 indicated that the model can discriminate low values from high values of the contents. For predicting roughly, the RPD between 2 and 2.5 is a necessary condition, while the RPD greater than 2.5 or 3 corresponds excellent power for content prediction (Nicolaï et al., [28]). It should be noted that the data of amounts and spectra were one‐to‐one correspondence in total polyphenol prediction. For better understanding this study, the workflow is shown in Figure.

fsn31313-fig-0001.jpg

RESULTS

FT‐MIR absorption peaks interpretation

The raw average spectra of caps and stipes from four species of porcini mushrooms are shown in Figure. The major absorption peaks of gaps and stipes have been interpreted. It can be seen from Figure that the absorption bands of 2,928, 1,313, 1,082, and 1,025 cm−1 were shared in caps and stipes. The single perk at 2,928 cm−1 could be assigned to methylene C‐H asymmetric stretching; the weak peak around 1,313 cm−1 was the vinylidene C‐H in‐plane bend (Coates, [7]). The peaks around 1025–1082 cm−1 corresponded to ‐C‐O stretching (Yang & Irudayaraj, [40]). Two broad peaks of 3409–3287 and 3396–3281 cm−1 were assigned to the N‐H stretching vibration (Kasprzyk, Depciuch, Grabek‐Lejko, & Parlinska‐Wojtan, [19]). The around peaks at 1,640, 1,634, 1,543, and 1,550 cm−1 belonged to the C=O stretching, C‐N stretching, and N‐H bending vibration in amide‐I and in amide‐II bands (Choong et al., [6]), which mean these mushrooms contain protein. The peaks of 1,402 and 1,377 cm−1 were O‐C‐H bending and C‐H bending, respectively (Chen, Guo, Yan, Sun, & Zhou, [5]). Although values of absorption peaks were varied in caps and stipes, the shapes and positions of the absorption peaks were similar among all samples, which revealed that the substance components are similar within mushrooms of four species.

fsn31313-fig-0002.jpg

Visualizing discrimination by t‐SNE

PCA and t‐SNE are the most common approaches to provide more vivid and easy‐to‐understand classification result. However, both methods can be used to draw two‐ or three‐dimensional graphics to visualize sample classification. The difference is that the t‐SNE can capture more information to draw the graphics using random walks on neighborhood graphs. Studies have reported that t‐SNE overmatches existing most advanced techniques such as sammon mapping, lsomap, or locally linear embedding (Maaten & Hinton, [24]).

Here, specie discrimination by t‐SNE is shown in Figure where dotted circle represents 95% confidence level of per species. Mushroom classifications were disordered using single data matrices in which any one species could not be separated (Figure a,b). Low‐level data fusion was superior to single data matrix (Figure c). Samples show clustering tendency, and the samples that belong to species of B. edulis show an obvious clustering, although one sample is beyond confidence interval and far away from other samples within the species. Mid‐level data fusione shows a better result, with all samples that belong to B. edulis clustered together and landed within confidence interval but still be mixed with samples belonging to other species (Figure d1). Compared with mid‐level data fusione, mid‐level data fusionv displayed a better discrimination in which one species namely B. edulis was separated completely (Figure d2). The mid‐level data fusionq had best cluster classification among all scatterplots, which allowed two species of B. edulis and B. umbriniporus be discriminated (Figure d3). Besides, clustering results reveal that the within a species is smaller than that between species since the clustering effect was better in same species than in different species. For higher discrimination result which per sample was classified into actual species, supervised model combination with data fusion was performed.

fsn31313-fig-0003.jpg

Discriminating samples by PLS‐DA model

The PLS‐DA is confirmed as a universal strategy for reducing variables, regression prediction, source discrimination, etc. (Loong, Liong, & Jemain, [23]). Table shows the results both the parameters of models and discrimination accuracy of test set. The parameters of RMSEE and RMSECV were used to assess the reliability of training set and test set, respectively; a small value is usually necessary for model quality. Compared with caps, the stipes model has better performance, with the values of parameters of R2Y(cum), Q2(cum), RMSEE, and RMSECV as 0.9, 0.7, 0.15, and 0.26, respectively, and the total correctness rate of test set, with a rate of 97.06%, was higher than that of caps, with a rate of 91.18%. In data fusion strategies, the same total accuracy of 97.06% has been noticed in mid‐levele and mid‐levelv data fusion. Two samples from species of L. rugosiceps were misclassed into B. tomentipes, both in mid‐levele and mid‐levelv data fusion. The lower accuracy (94.12%) at low‐level data fusion may due to redundant information brought by data direct splicing (Borràs et al., [3]). Importantly, model of mid‐level data fusionq has highest value of R2Y(cum) (0.91), Q2(cum) (0.86) and lowest value of RMSEE (0.13), RMSECV (0.19), which meant that the model could fit the new data very well and had a good potential to predict sample origin, and the fact is that this model did allow per sample classed into real species corresponding to total accuracy of 100%. The order of accuracies in all classification models is as follows: mid‐level data fusionq > mid‐levele data fusion = mid‐levelv data fusion = stipes > low‐level data fusion > caps, which indicated that mid‐level data fusionq combined with PLS‐DA model can be seen as a reliable way to discriminate mushroom species.

Mushroom discrimination by PLS‐DA

Caps
  R2Y(cum)Q2(cum)RMSEE (avg)RMSECV (avg)LVS
  0.840.630.180.2612
  1234Accuracy (%)
 114000100
 2032150
 300100100
 40004100
Total3414312591.18

Mushroom discrimination by PLS‐DA

Stipes
  R2Y(cum)Q2(cum)RMSEE (avg)RMSECV (avg)LVS
  0.900.700.150.2614
  1234Accuracy (%)
 114000100
 2051083.33
 300100100
 40004100
Total3414511497.06

Mushroom discrimination by PLS‐DA

Low‐level data fusion
  R2Y(cum)Q2(cum)RMSEE (avg)RMSECV (avg)LVS
  0.870.690.150.248
  1234Accuracy (%)
 114000100
 2041166.67
 300100100
 40004100
Total3414411594.12

Mushroom discrimination by PLS‐DA

Mid‐level data fusione
  R2Y(cum)Q2(cum)RMSEE (avg)RMSECV (avg)LVS
  0.870.790.150.214
  1234Accuracy (%)
 114000100
 2051083.33
 300100100
 40004100
Total3414511497.06

Mushroom discrimination by PLS‐DA

Mid‐level data fusionv
  R2Y(cum)Q2(cum)RMSEE (avg)RMSECV (avg)LVS
  0.840.560.170.287
  1234Accuracy (%)
 114000100
 20501100
 300100100
 4000483.33
Total3414610597.06

Mushroom discrimination by PLS‐DA

Mid‐level data fusionq
  R2Y(cum)Q2(cum)RMSEE (avg)RMSECV (avg)LVS
  0.910.860.130.1927613
  1234Accuracy (%)
 114000100
 20600100
 300100100
 40004100
Total34146104100

  • 2 Note
  • 3 1, Bedulis; 2, L. rugosiceps; 3, Btomentipes; 4, Bumbriniporus; FD, first‐order derivative, LVs: the number of potential variables.
  • 4 Predicted category.
  • 5 Genuine category.
Discriminating samples by SVM model

Supervised models of GS‐SVM and PSO‐SVM were developed to compare with the models of PLS‐DA for fast and simple way for discriminating mushroom species. Table summarizes the results of both the parameters of model and total correctness rate. As shown in Table , the model PSO‐SVM had higher accuracy of test set than GS‐SVM based on caps data, with 97.06% higher than 94.12%, and the best parameters of penalty parameter (c) and kernel parameter (g) were 30.44 and 20.93, respectively. Correspondingly, the misclassified samples of PSO‐SVM were lower than GS‐SVM, with one lower than two. In the models of stipes, however, discrimination accuracy was lower in PSO‐SVM model (88.24%) compared with GS‐SVM model (91.18%). The highest accuracy was noticed using low‐level data fusion strategy, which means the 100% correctness rate and without misclassified sample in both GS‐SVM and PSO‐SVM. Regarding the best discrimination model, low‐level data fusion combined with model of GS‐SVM or PSO‐SVM could be used as a fast and simple approach to discriminate mushroom species.

Mushroom discrimination by GS‐SVM and PSO‐SVM

Origin of dataStrategyBest cBest gAccuracy of training set (%)Accuracy of test set (%)
CapsGS‐SVM1.05e + 061.91e−0687.8894.12 (32/34)
PSO‐SVM30.4420.9389.3997.06 (33/34)
StipesGS‐SVM2,896.316.91e−0493.9491.18 (31/34)
PSO‐SVM2.3854.7392.4288.24 (30/34)
Low‐level data fusionGS‐SVM5.6645.2592.42100 (34/34)
PSO‐SVM7.927.8592.42100 (34/34)

Total polyphenol content

In this study, 40% ethanol was used for total polyphenol extraction from mushroom. The arithmetic means, standard deviation, median value, content range, extraction percentage, and bioconcentration factor of phenolic compounds in two morphological parts of mushrooms are presented in Table.

The arithmetic means, standard deviation, median value, amount range, extraction percentage, and bioconcentration factor of total phenolic content in two morphological parts of mushrooms

Morphological partSpecies

B. edulis

n = 39C

n = 38s

L. rugosiceps

n = 17

n = 16

Btomentipes

n = 43

n = 42

Bumbriniporus

n = 35

n = 37

Caps15.10 ± 1.3714.33 ± 2.0212.27 ± 1.8510.22 ± 1.97
14.7614.0612.0310.13
12.28–17.559.49–17.048.98–16.347.59–16.91
1.511.41.231.02
Stipes15.81 ± 3.4814.00 ± 2.1011.30 ± 3.8310.74 ± 1.40
17.3313.6810.0910.70
8.16–19.7110.63–19.136.32–18.507.59–13.42
1.581.41.131.07
BCFc‐s0.851.031.190.95

6 Abbreviations: dw, dry weight; C, the number of caps; s, the number of stipes; BCFc‐s, quotient calculated by the median value of cap divided by the median value of stipe.

The median values of total phenolic content for four species analyzed varied between 10.13 and 14.76 mg/g dw in caps, and between 10.70 and 17.33 mg/g dw in stipes. Compared with B. tomentipes, L. rugosiceps have greater total polyphenol content, with the median values of the caps and stipes as 14.6 mg/g dw and 13.68 mg/g dw, respectively, then 12.03 mg/g dw and 10.09 mg/g dw, respectively. The smallest contents were found in species of B. umbriniporus, with the median values of 10.13 mg/g dw in caps and 10.70 mg/g dw in stipes. Among these results, mushrooms belong to B. edulis showed greatest total phenolic content both among the caps (14.76 mg/g dw) and stipes (17.33 mg/g dw). The content range of caps was from 12.28 to 17.55 mg/g dw, while the range of stipes was from 8.16 to 19.71 mg/g dw. Higher total phenolic amount may be the major contributes to the better flavor of mushrooms of B. edulis which show higher popularity in customers.

Table also shows the value of bioconcentration factor (BCF) in order to assess enrichment ability of total polyphenol by different morphological parts of mushroom for the first time. The BCF was calculated by the median value of cap divided by the median value of stipe. The enrichment ability of total polyphenol varied from 0.85 to 1.19, with the values of 0.85 for B. edulis, 0.95 for B. umbriniporus, 1.03 for L. rugosiceps, and 1.19 for B. tomentipes. Apparently, caps were considered as a morphological part to better accumulate total polyphenol compared with stipes in species of L. rugosiceps and B. tomentipes, while contrary results were noticed in B. edulis and B. umbriniporus. In a previous study, the mushroom caps of species of Agrocybe aegerita and Lentinula edodes have higher content of glutathione, ergothioneine than in stipes (Kalaras, Richie, Calcagnotto, & Beelman, [18]). One possible explanation is that the mushroom cap consisted of pileipellis, flesh, and hymenium, which are important for its metabolism and reproduction (Casale et al., [4]). Therefore, cap of mushroom may be a major part to enrich active constituents and can be the primary choice to meet purpose of diet nutrition and natural antioxidants.

Predicting content of total polyphenol

The regression model of GS‐SVM was developed using raw spectral information from two morphological parts of mushrooms combined with various data treatments to predict amount of total polyphenol in four porcini mushrooms. Table shows the results, and the best prediction results by the two parts are shown in Figure. The RPD value of models based on four data treatments of FD, FD + SG (7), FD + SG (9), and SNV + FD+SG (9), respectively, was 1.52, 1.51, 1.50, and 1.86 when using caps data, which indicated these models could be used to distinguish sample with high concentration of polyphenol. The best prediction model was achieved by SD‐treated, the RMSEP, RPD and R2 were 0.82, 2.4 and 86.76%, which indicated the good relationship between polyphenol content and FT‐MIR data, and this model could be applied to predict total polyphenol content roughly. In models by stipes data, the best R2 (84.66%) was obtained by FD‐treated, which indicated the good relationship between polyphenol content and FT‐MIR data. However, the best prediction was achieved by model with data treatment of MSC + FD+SG (9), with the values of 0.83, 2.32, and 1.78 corresponded to R2, RMSEP, and RPD, respectively, which indicated that the model could discriminate high values from low values among these amounts. In summary, the prediction result by caps data was superior to result by stipes data. Although numerous data treatments have been used to improve data quality, this study did not obtain an excellent model for accurately predicting the content of total polyphenol. This result may be caused by insufficient sample size, and better model may be obtained by increasing sample size and applying data fusion strategies in the future.

Prediction results of total polyphenol by GS‐SVM regression model

SubstrateTreatentBest cBest gCross validationExternal prediction
R2RMSECVR2RMSEPRPD
CapsNo724.083.45e−040.920.730.772.241.20
SNV5.665.52e−0310.120.531.101.13
MSC370,7289.54e−070.960.550.672.340.92
FD161.95e−030.970.410.681.311.52
SD42.76e−030.970.430.870.822.40
SG (7)5124.88e−040.920.690.772.151.24
SG (15)724.083.45e−040.910.750.772.291.19
SNV + SD0.355.52e−030.491.910.111.840.78
FD + SG (7)161.95e−030.970.410.671.321.51
FD + SG (9)161.95e−030.970.410.671.331.50
SNV + FD+SG (9)642.44e−040.950.800.581.111.86
StipesNo92,681.91.35e−060.881.270.803.281.04
SNV524,2881.35e−060.970.660.803.800.95
MSC8,1922.70e−060.891.160.852.021.28
FD32,7681.08e−0510.130.854.800.93
SD32,7689.540.980.570.792.091.33
SG (7)2,896.314.32e−050.91.170.813.700.95
SNV + FD1,0241.53e−050.940.820.852.651.20
MSC + FD11,585.21.90e−060.950.750.832.271.38
FD + SG (9)16,3842.15e−0510.130.864.610.96
FD + SG (11)1,448.152.16e−050.920.960.884.090.91
FD + SG (13)4,0961.53e−050.960.720.894.250.94
MSC + FD + SG (9)5,792.623.81e−060.940.780.832.321.78

7 Abbreviations: FD, first‐order derivative; SD, second‐order derivative; MSC, multiplicative scattering correction; SNV, standard normal variate; SG, Savitzky––Golay smoothing; SG (7), Savitzky–Golay smoothing with seven points; SG (9), Savitzky–Golay smoothing with nine points; SG (11), SavitzkyGolay smoothing with eleven points; SG (13), Savitzky–Golay smoothing with thirteen points; SG (15), SavitzkyGolay smoothing with thirteen points.

fsn31313-fig-0004.jpg

CONCLUSION

The test results showed that this study did provide a fast and accurate method for species discrimination. In addition to the use of low‐level data fusion, three types of feature variables were selected simultaneously to complete mid‐level data fusion, which indicated that the PLS‐DA model based on feature variable selected by maximum Q2(cum), GS‐SVM, and PSO‐SVM models based on low‐level data fusion had 100% discrimination accuracy allowing each mushroom classed into its real species. Moreover, this study also measured alcohol‐soluble polyphenols content in four porcini mushrooms, offered accumulation tendency of polyphenols in different parts of mushroom for the first time and predicted the polyphenol content for four porcini mushrooms. The result suggested that the way of caps data combination with second‐order derivative can be used to predict roughly polyphenols content in four porcini mushrooms. These outcomes from this work can provide academic references for origin traceability, market supervision, quality evaluation, and edible security.

ACKNOWLEDGEMENTS

The authors would like to acknowledge the assistance of Scientific Research Foundation Project of Education Department of Yunnan (2018JS275), Construction Project of Key Laboratory of Edible Fungal Resources Development and Utilization in Universities in Yunnan and Scientific Research Fund Project of Yunnan Provincial Department of Education (2019Y0095).

CONFLICTS OF INTEREST

The authors declare that there are no conflicts of interest regarding the publication of this paper.

ETHICAL APPROVAL

This study does not involve any human or animal testing.

REFERENCES 1 Bekyarov, G., Kakalova, M., Iliev, A., Bekyarova, T., Donchenko, G. V., Palyvoda, O. M., ... Kuzmenko, O. I. (2011). The content of selenium and other ten chemical elements in Boletus Edulis from Bulgaria and inhibition of lipid peroxidation. Journal of Environmental Science and Engineering, 5, 708 – 715. 2 Bilal, A. W., Bodha, R. H., & Wani, A. H. (2010). Nutritional and medicinal importance of mushrooms. Journal of Medicinal Plants Research, 4, https://doi.org/10.5897/JMPR09.565 3 Borràs, E., Ferré, J., Boqué, R., Mestres, M., Aceña, L., & Busto, O. (2015). Data fusion methodologies for food and beverage authentication and quality assessment‐A review. Analytica Chimica Acta, 891, 1 – 14. https://doi.org/10.1016/j.aca.2015.04.042 4 Casale, M., Oliveri, P., Casolino, C., Sinelli, N., Zunin, P., Armanino, C. ... Lanteri, S. (2012). Characterisation of PDO olive oil Chianti Classico by non‐selective (UV‐visible, NIR and MIR spectroscopy) and selective (fatty acid composition) analytical techniques. Analytica Chimica Acta, 712, 56 – 63. https://doi.org/10.1016/j.aca.2011.11.015 5 Chen, J., Guo, B., Yan, R., Sun, S., & Zhou, Q. (2017). Rapid and automatic chemical identification of the medicinal flower buds of Lonicera plants by the benchtop and hand‐held Fourier transform infrared spectroscopy. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 182, 81 – 86. https://doi.org/10.1016/j.saa.2017.03.070 6 Choong, Y. K., Sun, S. Q., Zhou, Q., Ismail, Z., Rashid, B. A. A., & Tao, J. X. (2011). Determination of storage stability of the crude extracts of Ganoderma lucidum using FTIR and 2D‐IR spectroscopy. Vibrational Spectroscopy, 57 (1), 87 – 96. https://doi.org/10.1016/j.vibspec.2011.05.008 7 Coates, J. (2000). Interpretation of Infrared Spectra, A Practical Approach. Encyclopedia of Analytical Chemistry, 12, 10815 – 10837. 8 Cuevas, F. J., Pereira‐Caro, J., Moreno‐Rojas, J. M., Muñoz‐Redondo, J. M., & Ruiz‐Moreno, M. J. (2017). Assessment of premium organic orange juices authenticity using HPLC‐HR‐MS and HS‐SPME‐GC‐MS combining data FUSION and chemometrics. Food Control, 82, 203 – 211. https://doi.org/10.1016/j.foodcont.2017.06.031 9 Das, C., Chakraborty, S., Acharya, K., Bera, N. K., Chattopadhyay, D., Karmakar, A., & Chattopadhyay, S. (2017). FT‐MIR supported Electrical Impedance Spectroscopy based study of sugar adulterated honeys from different floral origin. Talanta, 171, 327 – 334. https://doi.org/10.1016/j.talanta.2017.05.016 Dentinger, B. T. M., & Suz, L. M. (2014). What's for dinner? Undescribed species of porcini in a commercial packet. PeerJ, 2, e570. https://doi.org/10.7717/peerj.570 Dutta, A. (2017). Fourier Transform Infrared Spectroscopy. Hagos, Y., Redi‐Abshiro, M., & Chandravanshi, B. S. (2018). Selective determination of caffeine and trigonelline in aqueous extract of green coffee beans by FT‐MIR ATR spectroscopy. Spectroscopy, 97, 33 – 38. http://doi.org/https://doi.org/10.1016/j.vibspec.2018.05.003 Hall, D. L., & Llinas, J. (2001). Handbook of multisensory data fusion. CRC Press. Helland, I. S., Naes, T., & Isaksson, T. (1995). Related versions of the multiplicative scatter correction method for preprocessing spectroscopic data. Chemometrics and Intelligent Laboratory Systems, 29, 233 – 241. https://doi.org/10.1016/0169-7439(95)80098-T Ismail, A., Marjan, Z. M., & Foong, C. W. (2004). Total antioxidant activity and phenolic content in selected vegetables. Food Chemistry, 87, 581 – 586. https://doi.org/10.1016/j.foodchem.2004.01.010 Jia, Z. S., Tang, M. C., & Wu, J. M. (1999). The determination of flavonoid contents in mulberry and their scavenging effects on superoxide radicals. Food Chemistry, 64, 555 – 559. https://doi.org/10.1016/S0308-8146(98)00102-2 Kaewnarin, K., Suwannarach, N., Kumla, J., & Lumyong, S. (2016). Phenolic profile of various wild edible mushroom extracts from Thailand and their antioxidant properties, anti‐tyrosinase and hyperglycaemic inhibitory activities. Journal of Functional Foods, 27, 352 – 364. https://doi.org/10.1016/j.jff.2016.09.008 Kalaras, M. D., Richie, J. P., Calcagnotto, A., & Beelman, R. B. (2017). Mushrooms: A rich source of the antioxidants ergothioneine and glutathione. Food Chemistry, 233, 429 – 433. https://doi.org/10.1016/j.foodchem.2017.04.109 Kasprzyk, I., Depciuch, J., Grabek‐Lejko, D., & Parlinska‐Wojtan, M. (2018). FTIR‐ATR spectroscopy of pollen and honey as a tool for unifloral honey authentication. The case study of rape honey. Food Control, 84, 33 – 40. https://doi.org/10.1016/j.foodcont.2017.07.015 Kim, M.‐Y., Seguin, P., Ahn, J.‐K., Kim, J.‐J., Chun, S.‐C., Kim, E.‐H., ... Chung, I.‐M. (2008). Phenolic Compound Concentration and Antioxidant Activities of Edible and Medicinal Mushrooms from Korea. Journal of Agricultural and Food Chemistry, 56, 7265 – 7270. https://doi.org/10.1021/jf8008553 Li, Y., Zhang, J. Y., & Wang, Y. Z. (2018). FT‐MIR and NIR spectral data fusion: A synergetic strategy for the geographical traceability of Panax notoginseng. Analytical and Bioanalytical Chemistry, 410, 91 – 103. https://doi.org/10.1007/s00216-017-0692-0 Liu, Y.‐T., Sun, J., Luo, Z.‐Y., Rao, S.‐Q., Su, Y.‐J., Xu, R.‐R., & Yang, Y.‐J. (2012). Chemical composition of five wild edible mushrooms collected from Southwest China and their antihyperglycemic and antioxidant activity. Food and Chemical Toxicology, 50, 1238 – 1244. https://doi.org/10.1016/j.fct.2012.01.023 Loong, L. C., Liong, C. Y., & Jemain, A. A. (2018). Partial Least Squares‐Discriminant Analysis (PLS‐DA) for classification of high‐dimensional (HD) data: A review of contemporary practice strategies and knowledge gaps. Analyst, https://doi.org/10.1039/x0xx00000x Maaten, L., & Hinton, G. (2008). Visualizing data using t‐SNE. Journal of Machine Learning Research, 9, 2579 – 2605. Manzi, P., Gambelli, L., Marconi, S., Vivanti, V., & Pizzoferrato, L. (1999). Nutrients in edible mushrooms: An inter‐species comparative study. Food Chemistry, 65, 477 – 482. https://doi.org/10.1016/S0308-8146(98)00212-X Mello, A., Ghignone, S., Vizzini, A., Sechi, C., Ruiu, P., & Bonfante, P. (2006). ITS primers for the identification of marketable boletes. Journal of Biotechnology, 121, 318 – 329. https://doi.org/10.1016/j.jbiotec.2005.08.022 Nagy, P., Juhász, J., Reiczigel, J., Császár, G., Kocsis, R., & Varga, L. (2019). Circannual changes in major chemical composition of bulk dromedary camel milk as determined by FT‐MIR spectroscopy, and factors of variation. Food Chemistry, 278, 248 – 253. https://doi.org/10.1016/j.foodchem.2018.11.059 Nicolaï, B. M., Beullens, K., Bobelyn, E., Peirs, A., Saeys, W., Theron, K. I., & Lammertyn, J. (2007). Nondestructive measurement of fruit and vegetable quality by means of NIR spectroscopy: A review. Postharvest Biology and Technology, 46, 99 – 118. https://doi.org/10.1016/j.postharvbio.2007.06.024 Ordoudi, S. A., de los Mozos Pascual, M., & Tsimidou, M. Z. (2014). On the quality control of traded saffron by means of transmission Fourier‐transform mid‐infrared (FT‐MIR) spectroscopy and chemometrics. Food Chemistry, 150, 414 – 421. https://doi.org/10.1016/j.foodchem.2013.11.014 Qi, L. M., Liu, H. G., Li, J. J., Li, T. & Wang, Y. Z. (2018). Feature fusion of ICP‐AES, UV‐Vis and FT‐MIR for origin traceability of Boletus edulis mushrooms in combination with chemometrics. Sensors, 18, 241. https://doi.org/10.3390/s18010241 Roy, I. G. (2015). On computing first and second order derivative spectra. Journal of Computational Physics, 295, 307 – 321. https://doi.org/10.1016/j.jcp.2015.04.015 Sun, W. J., Zhang, X., Zhang, Z. Y., & Zhu, R. H. (2017). Data fusion of near‐infrared and mid‐infrared spectra for identification of rhubarb. Spectrochim Acta A Mol Biomol Spectrosc, 171, 72 – 79. https://doi.org/10.1016/j.saa.2016.07.039 Tondi, G., & Petutschnigg, A. (2015). Middle infrared (ATR FT‐MIR) characterization of industrial tannin extracts. Industrial Crops and Products, 65, 422 – 428. https://doi.org/10.1016/j.indcrop.2014.11.005 Trollope, K. M., Nieuwoudt, H. H., Görgens, J. F., & Volschenk, H. (2014). Screening a random mutagenesis library of a fungal β‐fructofuranosidase using FT‐MIR ATR spectroscopy and multivariate analysis. Applied Microbiology and Biotechnology, 98, 4063 – 4073. https://doi.org/10.1007/s00253-013-5419-3 Tsujikawa, K., Kanamori, T., Iwata, Y., Ohmae, Y., Sugita, R., Inoue, H., & Kishi, T. (2003). Morphological and chemical analysis of magic mushrooms in Japan. Forensic Science International, 138, 85 – 90. https://doi.org/10.1016/j.forsciint.2003.08.009 Velioglu, Y. S., Mazza, G., Gao, L., & Oomah, B. D. (1998). Antioxidant activity and total phenolics in selected fruits, vegetables, and grain products. Journal of Agricultural and Food Chemistry, 46, 4113 – 4117. https://doi.org/10.1021/jf9801973 Vera, L., Aceña, L., Guasch, J., Boqué, R., Mestres, M., & Busto, O. (2011). Discrimination and sensory description of beers through data fusion. Talanta, 87, 136 – 142. https://doi.org/10.1016/j.talanta.2011.09.052 Wong, J. Y., & Chye, F. Y. (2009). Antioxidant properties of selected tropical wild edible mushrooms. Journal of Food Composition and Analysis, 22, 269 – 277. https://doi.org/10.1016/j.jfca.2008.11.021 Xu, L. U., Zhou, Y.‐P., Tang, L.‐J., Wu, H.‐L., Jiang, J.‐H., Shen, G.‐L., & Yu, R.‐Q. (2008). Ensemble preprocessing of near‐infrared (NIR) spectra for multivariate calibration. Analytica Chimica Acta, 616, 138 – 143. https://doi.org/10.1016/j.aca.2008.04.031 Yang, H., & Irudayaraj, J. (2000). Characterization of semisolid fats and edible oils by Fourier transform infrared photoacoustic spectroscopy. Journal of the American Oil Chemists' Society, 77 (1), 291 – 295. https://doi.org/10.1007/s11746-000-0048-y| Yao, S., Li, T., Liu, H. G., Li, J., & Wang, Y. Z. (2018). Traceability of Boletaceae mushrooms using data fusion of UV‐visible and FTIR combined with chemometrics methods. Journal of the Science of Food and Agriculture, 98, 2215 – 2222. https://doi.org/10.1002/jsfa.8707 Zheng, J., Yang, B., Trépanier, M., & Kallio, H. (2012). Effects of genotype, latitude, and weather conditions on the composition of sugars, sugar alcohols, fruit acids, and ascorbic acid in sea buckthorn (Hippophaë rhamnoides ssp. mongolica) Berry Juice. Journal of Agricultural and Food Chemistry, 60, 3180 – 3189. https://doi.org/10.1021/jf204577g Zhou, X. W., Liu, Y., Guo, M. Y., Su, K. Q., & Zhang, Y. M. (2016). Species clarification of the widely cultivated ganoderma in China based on rDNA and FIP gene sequence analysis. International Journal of Agriculture and Biology, 932 – 938. http://doi.org/10.17957/IJAB/15.0189.

By Xiu‐Ping Li; Jieqing Li; Tao Li; Honggao Liu and Yuanzhong Wang

Reported by Author; Author; Author; Author; Author

Titel:
Species discrimination and total polyphenol prediction of porcini mushrooms by fourier transform mid‐infrared (FT‐MIR) spectrometry combined with multivariate statistical analysis
Autor/in / Beteiligte Person: Wang, Yuanzhong ; Li, Xiu-Ping ; Li, Jieqing ; Li, Tao ; Liu, Honggao
Link:
Zeitschrift: Food Science & Nutrition Food Science & Nutrition, 8(2):754-766, Jg. 8 (2020), Heft 2, S. 754-766
Veröffentlichung: Wiley, 2020
Medientyp: unknown
ISSN: 2048-7177 (print)
DOI: 10.1002/fsn3.1313
Schlagwort:
  • total polyphenol prediction
  • FT‐MIR spectroscopy
  • porcini mushroom
  • Correlation coefficient
  • Mid infrared
  • lcsh:TX341-641
  • species discrimination
  • Mass spectrometry
  • Leccinum rugosiceps
  • 01 natural sciences
  • symbols.namesake
  • data fusion
  • 0404 agricultural biotechnology
  • Food science
  • Original Research
  • Mathematics
  • Mushroom
  • biology
  • 010401 analytical chemistry
  • 04 agricultural and veterinary sciences
  • biology.organism_classification
  • 040401 food science
  • 0104 chemical sciences
  • Fourier transform
  • Polyphenol
  • Boletus edulis
  • symbols
  • lcsh:Nutrition. Foods and food supply
  • Food Science
Sonstiges:
  • Nachgewiesen in: OpenAIRE
  • Rights: OPEN

Klicken Sie ein Format an und speichern Sie dann die Daten oder geben Sie eine Empfänger-Adresse ein und lassen Sie sich per Email zusenden.

oder
oder

Wählen Sie das für Sie passende Zitationsformat und kopieren Sie es dann in die Zwischenablage, lassen es sich per Mail zusenden oder speichern es als PDF-Datei.

oder
oder

Bitte prüfen Sie, ob die Zitation formal korrekt ist, bevor Sie sie in einer Arbeit verwenden. Benutzen Sie gegebenenfalls den "Exportieren"-Dialog, wenn Sie ein Literaturverwaltungsprogramm verwenden und die Zitat-Angaben selbst formatieren wollen.

xs 0 - 576
sm 576 - 768
md 768 - 992
lg 992 - 1200
xl 1200 - 1366
xxl 1366 -