308
Views
6
CrossRef citations to date
0
Altmetric
Original Articles

A New Approach to Predict Acidity of Bayberry Juice by Using Vis/Near Infrared Spectroscopy

, &
Pages 631-638 | Received 18 Jan 2006, Accepted 11 Oct 2006, Published online: 10 Aug 2007

Abstract

A total of 76 bayberry juices were collected and their spectra features were got by using a vis/NIR spectroscopy. One mixed algorithm was used to predict the acidity (pH) of bayberry juice with partial least squares (PLS) and artificial neural network (ANN). PLS was used to find some sensitive spectra actives to acidity in juice, before doing this, the influence of various spectral pretreatments (standard normal variate, multiplicative scatter correction, S.Golay first derivative, wavelet package transform) were compared. The PLS approach with WPT preprocessing spectra was found to provide the best results, and the spectral reflectivity corresponding to them were regarded as the input neurons of ANN. Remnant values by subtracting standard values and validation values, were regarded as the output neurons of ANN. The calibration equation developed from them was used to predict the constituent values for the independent spectra of 30 samples. The results indicated that the observed results using PLS-ANN (rp = 0.943) were better than those obtained by PLS (rp =  0.932). At the same time, the sensitive wavelengths corresponding to the acidity of bayberry juices or some element at a certain band were proposed on the basis of regression coefficients by PLS. It indicates that using vis/NIRS technique to fast and nondestructive detection the acidity of bayberry juices was feasible.

INTRODUCTION

Vis/NIR spectroscopy is the electro-magnetic wave whose frequency is between visual light and near infrared. Its wavelength is defined as from 380 to 2526 nm. Vis/NIR spectroscopy technology has been widely used in quantitative and qualitative analysis. In the all, it has become an important detecting technique in every field, especially in food science and agricultural science. Kawano et al.[Citation1] studied the sugar content in peaches with an optical fibre in interactance mode. Slaughter[Citation2] determined that visible and NIR-spectroscopy could be used to measure non-destructively the internal quality of peaches and nectarines as characterized by their SSC, sugar content, sorbitol content, and chlorophyll content. Wang et al.[Citation3– Citation4] classified pecky rice kernels and fungal-damaged soybean seeds through using visible and near-infrared spectroscopy.

Bayberry (Myrica rubra Siebold and Zuccarini), or Yangmei in Chinese Mandarin, is indigenous to China. It has been cultivated more than 2000 years and the annual output is about 300,000 tons. Its flesh can be processed into sweets, jam, juice and wine, or canned in syrup. It is a very important fruit-product as juice or wine for export, and it has a high economical value. The acidity (pH) and sugar content are the important internal quality attribute to bayberry. Most the methods to measure these qualities are based on complex processing of samples, and involve a considerable amount of manual work. At present, research has been focused on the development of non-destructive measurements techniques for fruit and juice internal quality using vis/NIR spectroscopy techniques as told above. But the most important techniques for NIR are how to extract quantitative information from them. The objectives of the study were to evaluate the use of vis/NIR spectroscopy in measuring the acidity (pH) of bayberry juice, and try to find a better mathematic model to predict it by using vis/NIR.

MATERIALS AND METHODOLOGY

System Set-up and Experimental Material

Spectra were collected using a NIR scanning spectroradiometer ASD Handheld FieldSpec in reflectance mode. Measurements were made at ambient temperature (25°C) over the wavelength range 325–1075 nm at 1.5 nm intervals. The spectroradiometer plumb placed at a height of approximately 100 mm above the vessel center, a 150 W halogen lamp was used, and the angle between them was 30°. shows the collection system of bayberry juice. Reflectance data were stored as log (1/R) (R = reflectance) at 1nm intervals (751 spectra data points). Software of ASD View Spec Pro, Unscramble V9.2 (CAMO, PROCESS, AS, OSLO, Norway), and DPS (data procession system for practical statistics) were used in this article.

Figure 1 Collection system of bayberry juice.

Figure 1 Collection system of bayberry juice.

Seventy six of bayberry juices were gained from the local market. There are 4 varieties of bayberry juice with 19 of each variety (They are Cixi bayberry juice, Xianju bayberry juice, Lishui bayberry juice, and linhai bayberry juice). The samples were placed in airtight glass vessel, stored in an ice filled cooler and transported to the laboratory to keep at cold temperature (4 ± 1°C). All samples were first allowed to equilibrate to room temperature (25°C) before NIR spectroscopy analysis. Each sample was individually numbered and placed in each vessel. In order to reduce the error of operation during the whole experiment, each vessel was glass service chose as the same diameter of 6.5 cm and height of 1.4 cm. For each bayberry juice the scan number were 10 at exactly the same position. The measurement of acidity was measured by a pH meter (SJ-4A, Exact instrument Co., Ltd., Shanghai, China) under the China standard.

Processing of the Optical Data

SNV is a mathematical transformation of log (1/R) spectra. It removes the multiplicative interferences of scatter, particle size, and the change of light distance.[Citation5] MSC is used to correct for additive and multiplicative effects in the spectra. This method is useful when applying linear regression methods as it eliminated scattering effects.[Citation6] S. Golay 1st-Der,[Citation7] which is a moving window averaging method, consisting of a preliminary averaging over a fixed number of points (segment option), and the gap was set as three. All of the above three preprocessing methods were disposed in Unscramble V9.2. WPT[Citation8] is a novel signal processing technique, which has dual localization in both frequency and time. It is an important extension of wavelet transform. In this paper, the wavelet function symlets 6 was adopted to decompose wavelet packet into five layers. It was carried out through using Matlab 7.0. To avoid low signal-noise ratio, only the wavelengths ranging from 400 to 1000 nm were used in this investigation. is the four original representative absorbance spectral data from 325 nm to 1075 nm (one of each variety).

Figure 2 Four original representative absorbance spectral data from 325 nm–1075 nm.

Figure 2 Four original representative absorbance spectral data from 325 nm–1075 nm.

Partial Least Square (PLS)

Partial least square analysis, a method of NIR spectrum analysis, is the most widely used choices among many technologies.[Citation9–11] Because it takes the advantage of the correlation relationship that already exists between the spectral data and the constituent concentrations. Besides, the model built by PLS is steadier, especially to the study on small samples and large variables. However, PLS is based on linear models and unsatisfactory results may be obtained when non-linearity behaviors are present. To model these non-linearities, additional factors have to be introduced into the linear PLS model. Nowadays these non-linearities are commonly modeled by using neural network.[Citation12]

Artificial Neural Network (ANN)

The most popular type of neural network for use in analytical applications is artificial neural network (ANN)[Citation13–15] with the back-propagation learning algorithm. It is a one-way multilayer feed-forward network. Back–propagation uses a learning process to minimize the global error of the system by modifying node weights. The weight increment or decrement is achieved by using the gradient descent rule. The network is trained by initially selecting the weights at random and then presenting all training data repeatedly. The weights are adjusted after every trial using external information specifying the correct result until the weights converge, and the errors are reduced to acceptable values. The delta rule was used as the learning rule, which specifies how connection weights are changed during the learning process. The sigmoid transfer function was chosen for the non–linear function that transfers the internally generated sum for each node to a potential output node.

RESULTS AND DISCUSSION

Near-infrared spectra are difficult to interpret directly because of the overlap of weak overtones and combinations of fundamental vibration bands. As a result, multivariate calibration is required for quantitative analysis of sample constituents by NIRS. Various calibration methods have been used to relate near-infrared spectra with measured properties of materials. Principal components regression (PCR), partial least squares (PLS), multiple linear regression (MLR), Fourier regression and artificial neural networks are the most used multivariate calibration techniques for NIRS.[Citation16– Citation17] PLS was usually considered for a large number of applications in fruit and juice analysis and was widely used in multivariate calibration. Because it takes the advantage of the correlation relationship that already exists between the spectral data and the constituent concentrations. However, PLS is based on linear models and unsatisfactory results may be obtained when non-linearities are present. Artificial neural network (ANN) as a new technique has been widely used for its prediction and forecasting abilities especially in complex settings.

Multivariate Calibration by PLS-ANN for Acidity

In this article, a mixed algorithm was employed for building the nonlinear model of NIR and the acidity of bayberry juice. The mixed algorithm was combined with PLS method and ANN. The model based on the mixed algorithm was divided into two parts: linear part and nonlinear part, and the corresponding model of each part were built, respectively. In the linear part a total of 46 juice samples were used to build a calibration model by PLS. The calibration models for acidity with pre-processed spectra were developed using PLS regression with cross validation by the predicted error sum of squares (PRESS) function in order to avoid overfitting of the models. To evaluate the results, the standard error of calibration (SEC) and correlation coefficient (r) for the calibration set, and standard error of cross-validation (SECV) and r for validation set were considered. Good models should have lower SEC and SECV, higher r but smaller differences between SEC and SECV.

All the parameters of different calibration and validation sets were showed in . The results indicated that the pre-processing method affected the performance of models. Compared the four different preprocessing method in wavelength range of 400–1000 nm, WPT was the best, with higher correlation coefficient and lowest SEC and SECV values also smallest differences between SEC and SECV.

Table 1 Calibration and validation results of partial least square for analyzing acidity of bayberry juice

At the same time, PLS method was used to find some sensitive spectra actives to acidity in juice where 25 wavebands were obtained for its higher regression coefficients, and then regarded the spectral reflectivity corresponding to them as the input neurons of ANN. Calculate the remnant values by subtracting standard values and validation values, and regarded them as the output neurons of ANN. So in the nonlinear part, a remnant ANN was built which regard sensitive spectral reflectivity as the input neurons and remnant values as the output neurons. A three-layer ANN was built. The transfer function of each layer was sigmoid function. The node of input layer was 25. The node of hidden neuron layer was 6. The node of output neuron layer was 1. The goal error was set as 0.001. The speed of learning was set as 0.2. The time of training was set as 1000.

Prediction Results of PLS-ANN Regression Analysis for Acidity

In the prediction, using PLS models with four different preprocessing methods were used to predict 30 unknown juice samples, and got the best prediction values of them. It showed that PLS model with WPT preprocessing is an optimal method to preprocess data of the spectra, and then regard spectral reflectivity corresponding to the 25 sensitive wavebands of these 30 samples as the input neurons of ANN, and predict the remnant values. The final predicted acidity values of 30 juice samples were got through adding predicted values (by calibration model of PLS) and remnant values (by remnant ANN model). The results indicated that the observed results using PLS-ANN were better than those obtained by PLS. The rp for PLS-ANN prediction is 0.943, RMSEP = 0.145, bias = −0.101; However, the rp for PLS prediction is 0.932, RMSEP  =  0.168, bias = −0.117. The model based on PLS-ANN shows superior performance for determination the acidity of bayberry juice.

Sensitive Wavelengths Analysis Based on Regression Coefficients

The sensitive wavelengths reflecting the characteristics of spectra for acidity was obtained based on regression coefficients. From , we can find that wavelengths of 685∼695 nm, 910∼925 nm might be of particular importance for the acidity calibration. While the wavelengths between 700 to 950 nm are possible that it results from a 3rd overtone stretch of CH and 2nd and 3rd overtone of OH in bayberry juice which was referred by Slobodan and Yukihiro in their article about detailed band assignment for the short-wave NIR region useful for various biological fluids.[Citation18] However, it is likely that the wavelengths 685∼695 nm for acidity mainly attributed to the color of bayberry juice because there is non-existent of organic acids in this region of the spectrum. So in our research, to acidity, 910∼925 nm were better. This found was similar to the earlier literature, such as He found wavelengths near 900 nm were sensitive wavelengths corresponding to organic acid of oranges.[Citation19]

Figure 3 Prediction results for the unknown 30 samples from the PLS-ANN models.

Figure 3 Prediction results for the unknown 30 samples from the PLS-ANN models.

Figure 4 Regression coefficients with corresponding wavelengths for acidity.

Figure 4 Regression coefficients with corresponding wavelengths for acidity.

CONCLUSIONS

The results indicate that it is possible to use this non-destructive technique for measuring the acidity of bayberry juice. Through multivariate calibration technique of PLS a 46 juice samples calibration model was established. And at the same time, 25 sensitive spectra actives to acidity in juice were found, the spectral reflectivity corresponding to them were regarded as the input of ANN. Remnant values which were got by subtracting standard values and validation values were regarded as the output neurons of ANN. The predicted values of the unknown 30 juices were got through adding predicted values (by calibration model of PLS) and remnant values (by remnant ANN model). The model based on PLS-ANN show superior performance for determination the acidity of bayberry juice, that is, the correlation coefficient r = 0.943, RMSEP  =  0.145, bias  =  −0.101. At the same time, the sensitive wavelengths corresponding to the acidity of bayberry juices or some element at a certain band were proposed on the basis of regression coefficients by PLS. For acidity, wavelengths 910∼925 nm might be of particular importance. It indicates that using vis/NIRS technique to fast and nondestructive detection the acidity of bayberry juices was feasible.

ACKNOWLEDGMENTS

This study was supported by the Teaching and Research Award Program for Outstanding Young Teachers in Higher Education Institutions of MOE, PRC, Natural Science Foundation of China (Project No: 30270773), Specialized Research Fund for the Doctoral Program of Higher Education (Project No: 20040335034) and Natural Science Foundation of Zhejiang Province, China (Project No: RC02067).

REFERENCES

  • Kawano , S. , Watanabe , H. and Iwamoto , M. 1992 . Determination of Sugar Content in Intact Peaches by Near Infrared Spectroscopy with Fibre Optics in Interactance Mode . J. Japanese Soc. Hort. Sci , 61 ( 2 ) : 445 – 451 .
  • Slaughter , D.C. 1995 . Non-destructive Determination of Internal Quality in Peaches and Nectarines . Trans. ASAE , 38 ( 2 ) : 617 – 623 .
  • Wang , D. , Dowell , F.E. , Lan , Y. , Pasikatan , M. and Maghirang , E. 2002 . Determining Pecky Rice Kernels Using Visible and Near-infrared Spectroscopy . Int. J. Food Prop. , 5 ( 3 ) : 629 – 639 .
  • Wang , D. , Dowell , F.E. , Ram , M.S. and Schapaugh , W.T. 2004 . Classification of Fungal-damaged Soybean Seeds Using Near-infrared Spectroscopy . Int. J. Food Prop. , 7 ( 1 ) : 75 – 82 .
  • Chu , X.L. , Yuan , H.F. and Lu , W.Z. 2004 . Progress and Application of Spectral Data Pretreatment and Wavelength Selection Methods in NIR Analytical Technique . Prog. Chem. , 16 : 528 – 542 .
  • Chalus , P. , Roggo , Y. , Walter , S. and Ulmschneider , M. 2005 . Near-infrared Determination of Active Substance Content in Intact Low-dosage Tablets . Talanta , 66 ( 5 ) : 1294 – 1302 .
  • Liang , Y.C. and Yi , Z.S. 2001 . “ The Handbook of Analytical Chemistry: Chemistry Metrology ” . Beijing : Chemical Industry Press .
  • Chen , J. , Shi , Y. and Shi , S. 1999 . Noise Analysis of Digital Ultrasonic Nondestructive Evaluation System . Int. J. Pressure Vessels Pip. , 76 ( 9 ) : 619 – 631 .
  • Haalan , D.M. and Thomas , E.V. 1988 . Partial Least Squares Method for Spectral Analysis . Anal. Chem. , 60 : 1193 – 1202 .
  • He , Y. , Feng , S.J. , Deng , X.F. and Li , X.L. 2006 . Study on Lossless Discrimination of Varieties of Yogurt Using the Visible/NIR-spectroscopy . Food Res. Int. , 39 ( 6 ) : 645 – 650 .
  • Antihus , H.G. , He , Y. and Annia , G.P. 2006 . Non-destructive Measurement of Acidity, Soluble Solids and Firmness of Satsuma Mandarin Using Vis/NIR-spectroscopy Techniques . J. Food Eng. , 77 ( 2 ) : 313 – 319 .
  • Paulo , H.F. , Ronei , J.P. and Joao Carlos , A. 2002 . Determination of Organic Matter in Soils Using Radial Basis Function Networks and Near Infrared Spectroscopy . Anal. Chim. Acta , 453 : 125 – 134 .
  • Wu , W. , Walczak , B. and Massart , D.L. 1996 . Artificial Neural Networks in Classification of NIR Spectral Data: Design of the Training Set . Chemometrics Intell. Lab. Syst. , 33 : 35 – 46 .
  • Zhang , Y.D. , Dong , K. and Ren , L.F. 2004 . Pattern Recognition of Laser-induced Autofluorescence Spectrum from Colorectal Cancer Tissues Using Partial-least-square and Neural Network . China Medical Engineering , 12 ( 4 ) : 52 – 59 .
  • Dou , Y. , Sun , Y. , Ren , Y.Q. and Ren , Y.L. 2005 . Artificial Neural Network for Simultaneous Determination of Two Components of Compound Paracetamol and Diphenhydramine Hydrochloride Powder on NIR Spectroscopy . Anal. Chim. Acta , 528 : 55 – 61 .
  • Naes , T. and Isaksson , T. 1990 . Locally Weighted Regression and Scatter Correction for Near-infrared Reflectance Data . Anal. Chem , 62 : 664 – 673 .
  • Holst , H. 1992 . Comparison of Different Calibration Methods Suited for Calibration Problems with many Variables . Appl. Spec , 46 : 1780 – 1784 .
  • Slobodan , S. and Yukihiro , O. 2001 . Short-wave Near-infrared Spectroscopy of Biological Fluids. 1. Quantitative Analysis of Fat, Protein, and Lactose in Raw Milk by Partial Least-squares Regression and Band Assignment . Anal. Chem , 73 ( 1 ) : 64 – 71 .
  • He , Y.D.F. 1998 . “ The Method for Near Infrared Spectral Anlysis ” . In The Analysis Basic and Application of Near Infrared Spectroscopy , Edited by: Yan , Y.L. , Zhao , L.L. , Han , D.H. and Yang , S.M. 354 Beijing : Light Industry of China .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.