775
Views
10
CrossRef citations to date
0
Altmetric
ORIGINAL ARTICLES

Prediction of total nitrogen in cropland soil at different levels of soil moisture with Vis/NIR spectroscopy

, , , , , & show all
Pages 267-281 | Received 06 Jan 2014, Accepted 17 Mar 2014, Published online: 16 Apr 2014

Abstract

Visible/near-infrared (Vis/NIR) spectroscopy has been proven to be an effective technique for soil total nitrogen (TN) content estimation in the laboratory conditions. However, the transferability of this technique from laboratory study to field application is complicated by soil moisture effects. This study aims to compare the performance of four spectral transformation strategies, namely, Savitzky–Golay (SG) smoothing, SG smoothing followed by first derivative (FD), orthogonal signal correction (OSC), and generalized least squares weighting (GLSW), in the removal of soil moisture effects on TN estimation. The spectral transformations were applied on 8 sets of spectral reflectance measured from 62 soil samples at 8 moisture levels. The air-dried set was used for partial least squares regression (PLSR) calibration, whereas the other seven sets with moisture gradients were used for external validations. Results show that the SG-PLSR model cannot be transferred from the air-dried samples to the samples with moisture gradients. The FD-PLSR model showed fair TN prediction performance, with five out of seven residual prediction deviations (RPD) that are greater than 1.4. Both OSC-PLSR and GLSW-PLSR had good transferability to the moist samples. More specifically, the GLSW-PLSR model (mean of , root mean square error for prediction [RMSEP] = 0.262, and RPD = 1.885) outperformed the OSC-PLSR model (mean of , RMSEP = 0.277, and RPD = 1.780). The results demonstrate the value of OSC and GLSW in eliminating the effects of moisture on TN estimation, and the GLSW-PLSR is recommended for a better Vis/NIR estimation of TN content under different soil moisture conditions.

Introduction

Nitrogen, which is one of the primary nutrients critical for the survival of all living organisms, and its cycle is significantly affected by human activities (e.g., agricultural practice) in the local and global ecosystems (Bernhard Citation2010). The rational use of nitrogenous fertilizers could not only improve soil fertility and promote the sustainable development of agriculture but also ensure food and nutrition safety (Yakovchenko et al. Citation1996; Olfs et al. Citation2005). Conversely, unreasonable management of nitrogenous fertilizers may lead to the decline of soil fertility, resulting in the decrease of crop yield and protein content. Moreover, this situation will cause a series of environmental problems, such as soil erosion and water eutrophication (Elmi et al. Citation2003; Erhart et al. Citation2007), which will threaten human health and survival. Soil total nitrogen (TN) is an important part of soil, and the level of its content is one of the important indicators for soil quality evaluation (Shi et al. Citation2013). The importance of nitrogen in terrestrial ecosystems underlies the significance of enhancing the efficiency of obtaining soil TN content to guide the precise fertilizer and the assessment of soil quality.

In recent years, visible/near-infrared (Vis/NIR) spectroscopy has been increasingly used to measure various soil properties, including the TN content (Chang & Laird Citation2002; Viscarra Rossel et al. Citation2006; Michel et al. Citation2009; Soriano-Disla et al. Citation2013). Compared with traditional biological and chemical methods for determining TN content (Keeney Citation1982; Felipó & Garau Citation1987), Vis/NIR spectroscopy has the advantages of fast, efficient, and nondestructive measurements (Demattê et al. Citation2004; Terhoeven‐Urselmans et al. Citation2006). In the laboratory, soil reflectance can be acquired under controlled conditions. However, in the field, spatial variation of “external parameters,” such as moisture and particle size, may influence soil reflectance and further degrade its ability for TN estimation. The effects of soil moisture on the Vis/NIR reflectance spectra have been extensively studied (Galvão & Vitorello Citation1998; Reeves III Citation2010). The relationship between the normalized soil reflectance and moisture is nonlinear (Idso et al. Citation1975). Such nonlinear relationship could explain why homogeneous moisture in soils is preferable for the Vis/NIR estimation of soil properties. The influence of soil moisture on the Vis/NIR estimation of soil properties, such as TN, has been explored in various studies. For example, Chang and Laird (Citation2002) minimized the influence of moisture content on the spectra by using dried samples. Nevertheless, one of the most challenging issues is how to improve the transferability of Vis/NIR model among different moisture conditions. Moreover, with the development of a global reflectance spectral library from legacy soil samples (Viscarra Rossel et al. Citation2008), which are usually air-dried, transferring the Vis/NIR model calibrated from dry samples to the samples with moisture gradients would be desirable.

Chemo-metric strategies were used in line monitoring of petrochemical, food, and pharmaceutical field for preventing influence factors, which can occur in industrial conditions (Preys et al. Citation2008), such as multiplicative scatter correction (Isaksson & Næs Citation1988), standard normal variate transformations and de-trending methods (Barnes et al. Citation1989), smoothing (Savitzky & Golay Citation1964), orthogonal signal correction (OSC; Wold et al. Citation1998), external parameter orthogonalization (EPO; Roger et al. Citation2003), and generalized least squares weighting (GLSW, Zorzetti et al. Citation2011). In soil science, current efforts have been devoted to the removal of the influence of moisture on the Vis/NIR estimation of soil organic carbon (SOC) in view of some chemo-metric strategies. For example, Wu et al. (Citation2009) noted that the first derivative (FD) of specific bands (e.g., 800–1400, 1600–1700, and 2100–2200 nm) is moisture-independent and proposed to use these spectra to estimate SOC content under field condition. Marco et al. (Citation2013) classified the spectra according to the normalized soil moisture index proposed by Haubrock et al. (Citation2008) and established a series of models to predict SOC content at different moisture levels. Minasny et al. (Citation2011) used EPO to remove the effects of soil moisture from Vis/NIR spectra in the prediction of SOC content. Despite these efforts, further examination should be conducted on chemo-metric strategies that hold the potential to remove moisture effects on the Vis/NIR estimation of TN.

We investigated the influence of soil moisture on the Vis/NIR spectra and the prediction of TN. This study mainly aims to remove the soil moisture effects on the Vis/NIR estimation of soil TN. More specifically, we aim to (1) explore the transferability of partial least squares regression (PLSR) model calibrated among eight moisture gradients and to (2) compare the performance of four spectral transformation strategies, namely, Savitzky–Golay (SG) smoothing, SG smoothing followed by FD, OSC, and GLSW, in the removal of soil moisture effects on the Vis/NIR estimation of TN content.

Materials and methods

Study area

The study area is located in Honghu City (Hubei Province), Jianghan Plain, which is known for its riparian landscape as well as its dramatic land use and land cover changes since the 1950s (Fang et al. Citation2005). The landform of the study area is flat with an elevation not exceeding 50 m above mean sea level. Inceptisols dominate the study area according to the US Department of Agriculture Soil Taxonomy. Water body, lacustrine vegetation, floodplain, cropland, and open land are the major land cover types (Zhao et al. Citation2003). The field survey in December 2011 reported that artificial forest and meadow along the dams are now essential parts of land cover types in this region. Paddy field and irrigated cropland are the major land-use types comprising the cropland. The humid climate and diverse land-use types result in the spatial–temporal dynamics of soil moisture. Therefore, the Honghu region was selected as study area.

In December 2011, a total of 62 topsoil samples were collected in study area. According to the US Department of Agriculture classification system, soil texture of all samples was classified into four classifications. Descriptive statistics showing the texture, TN, and SOC content of the samples have been provided in We can see that with the order of clay, loam, and sandy, the SOC and TN content gradually reduced. The geographical coordinates of all sampling sites were recorded using a handheld global positioning system with a positional error of less than 5 m. shows the spatial distribution of the sampling sites, with an Enhanced Thematic Mapper Plus (ETM+) image (panchromatic band) showing the landscape of the study area. Spatial resolution of the panchromatic band is 15 m and the spectral range is 0.52–0.90 µm. Thus, the cropland shows bright while the water body (e.g., breeding ponds) appears to be dark. It is noted that the total set of soil samples were from cropland and some were much closer to the breeding pond area. At each sampling site, about 1.0–1.5 kg of surface soil from a depth of 0–10 cm was collected in five sampling campaigns after wiping off plant material, plant residues, roots, and stones (Shi et al. Citation2013). That is to say, five samples were collected on the center position and four corners of a 5 × 5 m grid for each sampling site, and composed them as one soil sample. This method can reduce random error and ensure the representativeness of the sample. The collected soil samples were then kept in a sealed package for chemical analyses and spectral measurement in the laboratory.

Table 1. Basic properties and statistical descriptions of the soil samples on Honghu region, China.

Figure 1. Maps showing the geographical location of the study area, distribution of sampling sites, and landscape of the study area, indicated by a LANDSAT-7 ETM+ image with band 8 (panchromatic band).
Figure 1. Maps showing the geographical location of the study area, distribution of sampling sites, and landscape of the study area, indicated by a LANDSAT-7 ETM+ image with band 8 (panchromatic band).

Laboratory analyses and measurements

In the laboratory, the soil samples were air-dried, ground, and sieved (less than 2 mm). Each sample was thereafter placed on a petri dish with diameter and thickness of 12 and 1 cm, respectively. All samples were oven-dried at 105°C for six hours and weighted on an analytical scale (with accuracy of 0.01 g) to calculate the weight of oven-dried soil samples. A small part of each sample was analyzed for soil TN content by using the semimicro Kjeldahl method (Nelson & Sommers Citation1980). Subsequently, the oven-dried soil samples were rewetted to oversaturation, with the moisture content falling within the range of 350–400 g kg−1. For the rewetting process, predetermined volume of distilled water was measured using a glass cylinder, and gently added to the petri dish. Each of the petri dish was covered for about two hours until all the samples were saturated. The samples were thereafter air-dried for seven days at room temperature ranging from 18 to 22 °C (Ciarlo et al. Citation2007). The lid of the petri dish was used to control soil evaporation so that the moisture of each sample could fall within the ranges of 300–350 g kg−1, 250–300 g kg−1, 200–250 g kg−1, 150–200 g kg−1, 100–150 g kg−1, 50–100 g kg−1, and 0–50 g kg−1 (air-dried) for each of the seven days, respectively. The soil samples at these eight different moisture levels (including the oversaturated samples) were weighted shortly before spectral measurement. Subtracting the weights of the oven-dried soil samples facilitates the calculation of the moisture of each sample at each moisture level. With box plots, shows the statistical characteristics of the moisture content of the eight data-sets. It can be seen that for each of the seven moist data-sets, the moisture content of the 62 samples takes an approximately normal distribution. We assumed that such statistical distribution of moisture content could be a good representative of operational situation.

Figure 2. Box plots of soil moisture content (SMC) which are divided into eight levels: 0–50 g kg−1 (air-dried), 50–100 g kg−1, 100–150 g kg−1, 150–200 g kg−1, 200–250 g kg−1, 250–300 g kg−1, 300–350 g kg−1, and 350–400 g kg−1.
Figure 2. Box plots of soil moisture content (SMC) which are divided into eight levels: 0–50 g kg−1 (air-dried), 50–100 g kg−1, 100–150 g kg−1, 150–200 g kg−1, 200–250 g kg−1, 250–300 g kg−1, 300–350 g kg−1, and 350–400 g kg−1.

An ASD FieldSpec3 portable spectral radiometer with wavelength of 350–2500 nm was used to measure the soil spectra for each set. The sampling interval and spectral resolution were 1.4 nm and 3 nm for the 350–1000 nm range, respectively, and 2 nm and 10 nm for the 1000–2500 nm range, respectively (http://www.asdi.com). A standardized white Spectralon® panel was used for reflectance calibration. A white light source matched with the spectroradiometer was used with a 45° incident angle. The spectra of the soil samples were measured by the spectroradiometer with a distance of 15 cm from the probe to the sample surface and a zenith angle of 90°. The entire scanning procedure was performed in a dark room at night to minimize the influence of external light (Choe et al. Citation2008).

Spectral preprocessing

Only the reflectance of the air-dried samples (0–50 g kg−1) in the region of 400–2350 nm was used for model calibration because of the presence of noise at both edges. Four spectral preprocessing strategies were implemented and compared in terms of their performance in removing soil moisture effects on TN prediction. The reflectance was smoothed by using the SG algorithm with a window size of 15 and polynomial order of 2 (Savitzky & Golay Citation1964). The smoothed reflectance was thereafter preprocessed with FD (Rinnan et al. Citation2009), OSC, and GLSW. Thus, for each of the eight data-sets, four sets of spectra were involved: (1) the reflectance with SG smoothing, (2) the SG smoothed reflectance followed by FD, (3) the SG smoothed reflectance coupled with OSC, and (4) the SG smoothed reflectance followed by GLSW.

The OSC effectively removes all orthogonal variations in independent variables (X-block) that are unrelated or orthogonal to dependent variables (Y-block), thus simplifying the calibration model (Lin & Kuo Citation2013). The algorithm starts by identifying the first principal component of the X-block. Afterward, the loading is rotated to make the scores orthogonal to the Y-block. Once the rotation is complete, a partial least squares (PLS) model is estimated, which can predict these orthogonal scores from the X-block. The number of components (c) in the PLS model is adjusted to achieve a specified level of captured variance for the orthogonal scores. Finally, the weights, loadings, and predicted scores are used to remove the specified orthogonal component. This entire process can be repeated on the X-block, from which the previously identified orthogonal component was removed, for any given number of components. Each cycle results in additional PLS weights and loadings being added to the total and will be used when applied to new data (Wise et al. Citation2007).

The GLSW was used to down weigh the differences between replicate measurements (Zorzetti et al. Citation2011). The underlying principle of GLSW is a filter calculated from the differences (usually caused by external factors) between samples, which should otherwise be similar. These differences are considered interferences, which the filter attempts to down weigh. In this study, GLSW was applied to correct soil moisture effects on the Vis/NIR estimation of TN. GLSW helps down weigh the differences and make the spectra of the same sample at different soil moisture gradients appear more similar. Ten samples were selected from the air-dried set according to the spectral Euclidean distance (DISTSLCT function in PLS toolbox version 7.3.1, Eigenvector Research, Manson, WA). Then, the same samples were selected from the other seven data-sets with different moisture levels. Thereafter, a GLSW was estimated by using the SG smoothed reflectance of selected samples from each of the eight moisture levels. Finally, the filtered air-dried set was used for the PLSR calibration (Fu et al. Citation2012). The most commonly used algorithm for GLSW has a single adjustable parameter, α, which defines how strongly the GLSW down weight the interferences. Adjusting α toward larger values (typically above 0.001) decreases the effect of the filter. Smaller α values (typically 0.001 and below) apply more filtering.

Building the data-sets

The reflectance spectra and the TN content of all 62 soil samples were denoted as (Xi, Y), where Xi denotes the spectral matrix and Y represents the content of TN. The lower case character i represents the soil moisture level, where i = 1 means soil moisture content (SMC) = 0–50 g kg−1 (the air-dried data-set), i = 2 means SMC = 50–100 g kg−1, i = 3 means SMC = 100–150 g kg−1, i = 4 means SMC = 150–200 g kg−1, i = 5 means SMC = 200–250 g kg−1, i = 6 means SMC = 250–300 g kg−1, i = 7 means SMC = 300–350 g kg−1, and i = 8 means SMC = 350–400 g kg−1. With the four spectral transformations mentioned above, 32 data-sets, namely, (, Y), (, Y), (, Y), and (, Y), i = 1, …, 8, were built to study the influence of soil moisture on the prediction of TN content.

Model calibration and validation

PLSR modeling

PLSR coupled with leave-one-out cross validation are used for model calibration by relating the spectral data-sets Xi with TN content data-sets Y. This method was developed by Wold et al. (Citation2001) and has become a routine modeling technique for the quantitative analysis of Vis/NIR spectra (Viscarra Rossel & Behrens Citation2010). The advantages of PLSR are that it handles multicollinearity. It is robust in terms of data noise and missing values (Viscarra Rossel et al. Citation2006). Particularly, it is useful when dealing with highly correlated predictor variables, whose number is much greater than that of samples (Wold et al. Citation2001; Viscarra Rossel et al. Citation2006). According to the four spectral pretreatment strategies, the SG-PLSR, FD-PLSR, OSC-PLSR, and GLSW-PLSR models were calibrated by using the (, Y), (, Y), (, Y), and (, Y) air-dried data-sets, respectively. To find the better c and α for the OSC-PLSR and GLSW-PLSR models, respectively, different values of c (1, 2, 3, and 4) and α (0.01, 0.001, and 0.0001) were used in the model calibrations. Leave-one-out cross validation (Gomez et al. Citation2008) was used to determine the optimal number of factors in the PLSR models. PLS toolbox version 7.3.1 (Eigenvector Technologies, Manson, Washington, USA) was applied for the model calibration and validation.

Evaluation of the calibration models

The performances of the calibration models were assessed by the determination coefficient for cross validation and validation ( and ), the root mean square error for prediction (RMSEP), and the residual prediction deviation (RPD; Saeys et al. Citation2005; Mouazen et al. Citation2010; Wang et al. Citation2014). R2 indicates the variance percentage in the Y variable that can be captured by the X variables. The RPD, defined as the ratio of the standard deviation of the Y variable in the validation set to the RMSEP, demonstrates the predictive performance of the calibration models. We adopted the five level interpretations of R2 given by Williams (Citation2004) and the RPD mentioned by Viscarra Rossel et al. (Citation2006): excellent predictions (R2 ≥ 0.90, RPD ≥ 2.5), good predictions (0.81 ≤ R2 < 0.90, 2.0 ≤ RPD < 2.5), approximate quantitative predictions (0.66 ≤ R2 < 0.81, 1.8 ≤ RPD < 2.0), possibility to distinguish between high and low values (0.50 ≤ R2 < 0.66, 1.4 ≤ RPD < 1.8), and unsuccessful predictions (R2 < 0.50, RPD < 1.40).

Transferability verification

The transferability verification is two-fold. First, eight PLSR models were calibrated by using one of the data-sets (, Y) with a specific i (i = 1, …, 8). The other seven data-sets were used for validations. Such processes were designed to examine whether the PLSR models calibrated from the soil samples at a specific moisture level could be successfully applied to estimate the TN content when the moisture of the samples varied. No other spectral transformation strategies were used in this verification process. The second verification procedure involved all four spectral transformation strategies for the purpose of comparison. The (, Y), (, Y), (, Y), and (, Y) data-sets were used for model calibration. Thereafter, coupled with the other data-sets at different moisture gradients, four calibration models, namely, SG-PLSR, FD-PLSR, OSC-PLSR, and GLSW-PLSR, were used for TN prediction and transferability verification. In both verification processes, the transferability of the calibration models was assessed by using and RPD.

Flowchart of the study

shows a flowchart that describes the processes of data collection and preparation, model calibration, validation, and evaluation. For the comparisons of spectral transformations in the removal of soil moisture effects, only the air-dried data-sets were used in the model calibration, and the model transferability was examined with the moisture data-sets as test sets. This measure is performed not only because most previous studies have used the spectra of air-dried samples (e.g., Shi et al. Citation2013) but also for future use of soil spectral library in the estimation of soil properties in the field.

Figure 3. Flowchart of the processes of data collection and preparation, model calibration, validation, and evaluation.
Figure 3. Flowchart of the processes of data collection and preparation, model calibration, validation, and evaluation.

Results and analysis

Effect of moisture content on soil spectra

shows the SG smoothed spectra. The differences in soil moisture result in the variations of spectra in the entire spectral range (400–2350 nm). The reflectance decreased with increased soil moisture. The variation is more pronounced for longer wavelengths, particularly in the water absorption bands (1450 and 1940 nm; Weidong et al. Citation2002). For low moisture levels, a small increase in soil moisture induces a typical decrease in soil reflectance. However, in the cases of higher soil moisture levels (i.e., SMC > 250 g kg−1), the reflectance decreases slightly with increased moisture. These conclusions are consistent with those of Neema et al. (Citation1987) and Zhu et al. (Citation1984). Moreover, the wavelengths around 1450, 1850, and 2250 nm were identified as the important wavelengths for PLSR modeling in TN content evaluation (Shi et al. Citation2013). Thus, moisture may mask these features, resulting in poor Vis/NIR calibration for TN.

Figure 4. The average of soil spectra from different data-sets with four spectral transformations: (a) SG smoothing, (b) SG smoothing followed by FD, (c) SG smoothing followed by OSC, and (d) SG smoothing followed by GLSW.
Figure 4. The average of soil spectra from different data-sets with four spectral transformations: (a) SG smoothing, (b) SG smoothing followed by FD, (c) SG smoothing followed by OSC, and (d) SG smoothing followed by GLSW.

shows the eight curves of the FD spectra. The water peak at 1400 and 1900 nm is exaggerated. The eight spectral curves appear more similar to one another compared with those in . In , the OSC applied in this study effectively reduced the influence of moisture and furthered the similarity of the spectral curves from the eight moisture levels. This finding suggests that the OSC may be used to remove the soil moisture effects on the Vis/NIR prediction of soil TN content. reveals that the eight spectral curves with GLSW transformation fluctuate significantly. However, similar trends of these curves can still be observed in the region of 400–2350 nm.

Effect of moisture content on the transferability of SG-PLSR models

The transferability of the eight SG-PLSR models was assessed by using and RPD (). When ≥ 0.5 and RPD ≥ 1.4, models could be transferred among different moisture levels, and the larger, the better. Models calibrated from the moist samples can be transferred to samples with moisture differences of approximately 50 g kg−1 (the adjacent moisture gradient). However, these models failed to predict the TN content when the differences of moisture content were greater than 100 g kg−1. The model calibrated by using the air-dried samples (0–50 g kg−1) exhibited poor transferability in estimating TN with the moist samples. Thus, without additional spectral transformations, the SG-PLSR models need to be calibrated at each of the moisture gradients for accurate TN estimations.

Figure 5. Transferability of SG-PLSR models for total nitrogen (TN) estimation with soil samples of eight moisture gradients as assessed by (a) determination coefficient for validation () and (b) RPD. It should be noted that the diagonals are (a) determination coefficient for calibration () and (b) standard deviation of prediction set divided by root mean square error of calibration.
Figure 5. Transferability of SG-PLSR models for total nitrogen (TN) estimation with soil samples of eight moisture gradients as assessed by (a) determination coefficient for validation () and (b) RPD. It should be noted that the diagonals are (a) determination coefficient for calibration () and (b) standard deviation of prediction set divided by root mean square error of calibration.

Effect of spectral transformations on the correlations between soil spectra and TN

The Pearson correlations between differently pretreated spectra and soil TN at all eight moisture levels were explored. The correlation curves were line-plotted in The spectral wavelengths with significant correlation at the confidence interval of 0.01 were highlighted with black cross symbols. The significant wavelength variables may vary with different spectral pretreatments. No evident trend was found for all eight correlation coefficient curves from the spectra with SG smoothing []. This result indicates that the soil moisture significantly interfered with the correlation between reflectance and TN content. The FD-based correlation curves fluctuated in the entire spectral region. The shapes of the eight curves were similar in the visible region but slightly different in the NIR region of 1400–2350 nm []. sure 6(c) shows that the shapes of the eight correlation curves for the OSC-pretreated spectra and TN content were relatively similar. The same phenomenon was found in the GLSW-pretreated spectra []. These results suggest that the latter two spectral preprocessing techniques could be useful in determining the common wavelengths, which are important for TN estimations at different moisture levels. Thus, the model calibrated from these common wavelengths could be transferred in the soil samples with different moisture contents.

Figure 6. Pearson’s correlation coefficient between total nitrogen (TN) content and differently pretreated spectra: (a) spectra with Savitzky–Golay (SG) smoothing, (b) spectra with SG smoothing and FD, (c) spectra with SG smoothing and OSC, and (d) spectra with SG smoothing and GLSW.
Figure 6. Pearson’s correlation coefficient between total nitrogen (TN) content and differently pretreated spectra: (a) spectra with Savitzky–Golay (SG) smoothing, (b) spectra with SG smoothing and FD, (c) spectra with SG smoothing and OSC, and (d) spectra with SG smoothing and GLSW.

Parameters optimization for the OSC-PLSR and GLSW-PLSR models

The parameters that need to be optimized for OSC-PLSR and GLSW-PLSR are c, the number of OSC filter dimensions and α, the weights of GLSW filter intensity. shows the RMSE values of the TN prediction with different c and α. Results show that the lowest RMSE values were obtained when c = 3 (for the OSC-PLSR model) and α = 0.001 (for the GLSW-PLSR model). Therefore, we chose c = 3 and α = 0.001 for OSC and GLSW filters, respectively.

Figure 7. RMSEP of total nitrogen (TN) prediction models with the seven moist data-sets as validation sets: (a) OSC-PLSR models with different c values (the number of OSC filter dimensions); (b) GLSW-PLSR models with different α values (the weights of GLSW filter intensity).
Figure 7. RMSEP of total nitrogen (TN) prediction models with the seven moist data-sets as validation sets: (a) OSC-PLSR models with different c values (the number of OSC filter dimensions); (b) GLSW-PLSR models with different α values (the weights of GLSW filter intensity).

Performances of the four PLSR models

presents the results of the four PLSR models for estimating TN content. The optimal number of latent variables (NLV) determined by leave-one-out cross validation for the SG-PLSR, FD-PLSR, OSC-PLSR, and GLSW-PLSR models were 4, 5, 4, and 3, respectively. In terms of model calibration by using the (, Y), (, Y), (, Y), and (, Y) data-sets, the determination coefficient for calibration () values were between 0.686 and 0.897, with the between 0.619 and 0.699. The results indicate successful model calibrations. Specifically, the OSC-PLSR model performs better than the SG-PLSR and the FD-PLSR models do, but not as good as the GLSW-PLSR model in terms of NLV and .

Table 2. Prediction of soil total nitrogen (TN) content using Vis/NIR spectroscopy with different preprocessing methods.

The four PLSR models were used to predict the TN content in soil samples with varying moisture contents (data-sets (, Y), (, Y), (, Y), and (, Y), i = 2, …, 8). Thus, for each of the PLSR models, seven validation sets corresponding to the seven moisture gradients were employed. , RMSEP, and RPD were calculated in each of the validation processes. Their mean and standard deviation are shown in details the prediction performance of the four PLSR models for each of the seven soil moisture levels.

Figure 8. Performances of the four PLSR models: SG-PLSR, FD-PLSR, OSC-PLSR, and GLSW-PLSR. The air-dried sets were used for model calibration, whereas the other seven sets with moisture gradients were used for validations.
Figure 8. Performances of the four PLSR models: SG-PLSR, FD-PLSR, OSC-PLSR, and GLSW-PLSR. The air-dried sets were used for model calibration, whereas the other seven sets with moisture gradients were used for validations.

Poor predictions were obtained for the SG-PLSR model when the model calibrated with the air-dried samples was applied to predict the TN content of moist samples. The mean values of , RMSEP, and RPD were 0.468, 0.357, and 1.382, respectively. The (0.331–0.589), RMSEP (0.315–0.402), and RPD (1.226–1.563) for the seven validation sets varied dramatically, indicating unstable prediction ability. For the FD-PLSR model, the predictions were less affected by SMC. The mean values of increased to 0.557 and the mean values of RPD increased to 1.493, whereas those of the RMSEP reduced to 0.330, which were better compared with the predictions of the SG-PLSR model. However, two out of the seven values of and RPD were below 0.5 and 1.4, respectively. This finding denotes the limitations of this model. Besides, the moisture effects on the TN estimation remained because was considerably less than .

Therefore, the OSC-PLSR and GLSW-PLSR models were calibrated to remove the soil moisture effects on TN predictions. The mean values of for these two models significantly increased and approached their . All of the seven RPDs are greater than 1.4, and the RMSEP decreased from above to below 0.29. These results show that the prediction accuracy of the soil with moisture gradients is comparable to the prediction of air-dried soils. Moreover, less fluctuation was observed in , RMSEP, and RPD curves for the OSC-PLSR and GLSW-PLSR models. This finding indicates that the predictions for all seven sets of moist samples were similar in spite of variations in moisture contents. These results proved the efficiency of the OSC-PLSR and GLSW-PLSR in the removal of soil moisture effects on the Vis/NIR estimation of TN. The GLSW-PLSR model (mean of = 0.718, mean of RMSEP = 0.262, and mean of RPD = 1.885) performed better than the OSC-PLSR model did (mean of = 0.695, mean of RMSEP = 0.277, and mean of RPD = 1.780). This finding confirms that the quantitative prediction of soil TN from the soil samples with varied SMCs is possible.

Discussion

Diffuse reflectance spectroscopy has been reported to have potential as a rapid screening method to characterize soil chemical properties (Udelhoven et al. Citation2003). Unfortunately, field spectroscopy performed not as well as laboratory spectroscopy probably due to strong interferences of environmental factors, such as soil moisture variations. Prediction accuracy depends upon sample preparation and spectral pretreatment selected before estimating a statistical model (Chang & Laird Citation2002). In this study, four modeling strategies were used to examine their ability of removing the soil moisture effects on the Vis/NIR estimation of soil TN.

Considering all the indicators used to assess the quality of model predictions, the performance of the four models can be ranked as GLSW-PLSR > OSC-PLSR > FD-PLSR > SG-PLSR. Such a result may be explained by the fact that the SG- or FD-PLSR model is just to deal with spectral data itself, and does not consider the spectral differences between the air-dried and the moist soil sample sets. Such process may miss some useful information and cannot eliminate the soil moisture effect entirely, which will further affect the quality of modeling. In addition, the FD algorithm can remove the baseline drift and background interferences (Rinnan et al. Citation2009), which were partially caused by soil moisture. The interferences caused by soil moisture are nonlinear. Thus, the FD cannot eliminate the influence of moisture completely. By contrast, the OSC- and GLSW-based models introduced in this study showed promising results and could effectively eliminate the effects of soil moisture on the Vis/NIR predictions of TN. However, it should be pointed out that, through too few OSC components, the OSC can only remove parts of the signals that are unrelated (orthogonal) to Y. In contrast, removing too many OSC components may result in erosion of the useful signal (Boulet et al. Citation2007). These defects decrease the prediction accuracy of OSC-PLSR models. Thus, appropriate number of OSC components is important for OSC-PLSR in effectively eliminating the effects of soil moisture on the Vis/NIR prediction of TN. GLSW uses the eigenvectors and eigenvalues of a covariance matrix to down weight signal identified as being from known interferences or differences between samples which should otherwise be the same (Zorzetti et al. Citation2011). Results indicated that GLSW-PLSR model obtained a better performance than the other models.

In addition, we compared the important wavelengths for estimating the soil TN between the models calibrated using the air-dried and moist soil sample sets (with SMC = 150–200 g kg−1 as an example). Thereafter, from the perspective of important wavelengths, we further evaluated the potentials of the four spectral pretreatments in removing the interferences of soil moisture. For a specific spectral pretreatment, if the important wavelengths of the air-dried and the moist samples-based models are similar in their distribution, it may indicate that this pretreatment functions effectively in removing the soil moisture effect on the TN estimation. The variable importance in the projection (VIP) was used to identify the important wavelengths used in the PLSR calibrations (Chong and Jun Citation2005). If the VIP value for a specific wavelength is greater than 1, this spectral wavelength is then considered to be important (Gomez et al. Citation2008). The VIPs for predicting soil TN from the four preprocessed reflectance spectra are presented in We can see that the VIP scores of four PLSR models now display a marked difference. According to the order of , with the increasing number of important wavelengths, the distribution of the bands is becoming more and more wide. This suggests that the prediction abilities of models are gradually improved. From the perspective of moisture content, the similarity of VIP curves of four models are improving step by step. The similar patterns suggest that both data-sets estimate very similar regression models to predict soil TN, which were less affected by water content. This further proved the performance of the four models illustrated as GLSW-PLSR > OSC-PLSR > FD-PLSR > SG-PLSR.

Figure 9. The VIP scores of (a) SG-PLSR, (b) FD-PLSR, (c) OSC-PLSR, and (d) GLSW-PLSR modeling for TN estimation using the air-dried and moist samples (with SMC = 150–200 g kg−1 as an example), respectively.
Figure 9. The VIP scores of (a) SG-PLSR, (b) FD-PLSR, (c) OSC-PLSR, and (d) GLSW-PLSR modeling for TN estimation using the air-dried and moist samples (with SMC = 150–200 g kg−1 as an example), respectively.

Despite these differences, we found some common important bands of the four models. According to the VIP scores, the wavelengths near 1410, 1850, 2210, and 2350 nm were identified as the important wavelengths for TN estimation. Approximate wavelength regions around 1446, 1826, 2176, and 2366 nm were also identified by Morra et al. (Citation1991). Similarly, the wavelengths 1450, 1850, 2250, and 2330 nm were reported by Shi et al. (Citation2013) for TN content estimation. Specially, in our study, we found that the TN content showed high correlations with the wavelengths in the visible region. This could be explained by previous studies which reported visible wavelengths 600–700 nm (Vinogradov Citation1981); and 564.4 and 623.6 nm (Krishnan et al. Citation1980) were important for organic matter prediction. Because of the inherent link between total carbon and TN (Fystro Citation2002), soil TN content might be indirectly related to these wavelengths through soil total carbon (Shi et al. Citation2013).

Soil moisture is one of the major factors affecting soil reflectance in the Vis/NIR region. Some researchers (e.g., Hummel et al. Citation2001) estimated soil chemical properties by developing a calibration data-set with samples covering a wide range of moisture contents. While our study demonstrated the applications of OSC and GLSW in the removal of soil moisture effects on the Vis/NIR prediction of TN content in soil. For operational application in the field situation, the moisture of soil samples may vary spatially. In order to calibrate a TN estimation model which is less influenced by or free of soil moisture effect, the GLSW and OSC may be applied to the Vis/NIR spectra of samples with a wide range of moisture contents. Thereafter, the PLSR approach can be used for model calibrations. It should be noted that the spatial variations of other external parameters besides the moisture might weaken the predictive ability of the TN estimation model. Further investigations are needed to determine whether the effects of other factors, such as soil particle size, can be removed by algorithms, such as OSC and GLSW. In our future studies, the models calibrated in our study need to be validated in other study areas, and the formulation of “moisture-free” or “other-external-factors-free” model for the Vis/NIR estimation of soil chemical properties is encouraged to transfer this technique from laboratory study to field application. Besides, airborne and satellite hyperspectral images are useful data sources in mapping TN contents over large areas (Shi et al. Citation2013). Several studies have showed that the outdoor and airborne/satellite imaging spectroscopy have potentials for the estimation of soil properties (Stevens et al. Citation2010; Viscarra Rossel & Behrens Citation2010). This opens further perspectives to airborne hyperspectral sensors, such as the HyMap and AVIRIS sensors, which characterize soil reflectance in the Vis/NIR spectral region (400–2500 nm) with a relatively high spectral and spatial resolution (Udelhoven et al. Citation2003). Therefore, the proposed strategies for removing moisture effects on the Vis/NIR estimation of TN based on outdoor and airborne/satellite data should be explored to provide accurate information for precision agriculture.

Acknowledgments

The authors are thankful to Prof. Yanfang Liu for her support in the laboratory analyses and Prof. Guofeng Wu for his valuable comments. We would also like to thank Mr Kai Guo, Mr Pujiang Huang, and Ms JingQian for their helpful advices on the methodology.

Funding

This work is supported by the National 12th Five-Years Key Technology R&D Program of China [grant number 2011BAB01B06].

Additional information

Funding

Funding: This work is supported by the National 12th Five-Years Key Technology R&D Program of China [grant number 2011BAB01B06].

References

  • Barnes R, Dhanoa M, Lister SJ. 1989. Standard normal variate transformation and de-trending of near-infrared diffuse reflectance spectra. Appl Spectrosc. 43:772–777. 10.1366/0003702894202201
  • Bernhard A. 2010. The nitrogen cycle: processes, players, and human impact. Nat Educ Knowl. 2:12.
  • Boulet JC, Doco T, Roger JM. 2007. Improvement of calibration models using two successive orthogonal projection methods. Application to quantification of wine mannoproteins. Chemom Intell Lab Syst 87:295–302. 10.1016/j.chemolab.2007.03.008
  • Chang C-W, Laird DA. 2002. Near-infrared reflectance spectroscopic analysis of soil C and N. Soil Sci. 167:110–116. 10.1097/00010694-200202000-00003
  • Choe E, van der Meer F, van Ruitenbeek F, van der Werff H, de Smeth B, Kim K-W. 2008. Mapping of heavy metal pollution in stream sediments using combined geochemistry, field spectroscopy, and hyperspectral remote sensing: a case study of the Rodalquilar mining area, SE Spain. Remote Sens Environ. 112:3222–3233. 10.1016/j.rse.2008.03.017
  • Chong I-G, Jun C-H. 2005. Performance of some variable selection methods when multicollinearity is present. Chemom Intell Lab Syst. 78:103–112. 10.1016/j.chemolab.2004.12.011
  • Ciarlo E, Conti M, Bartoloni N, Rubio G. 2007. The effect of moisture on nitrous oxide emissions from soil and the N2O/(N2O+ N2) ratio under laboratory conditions. Biol Fert Soil. 43:675–681.
  • Demattê JAM, Campos RC, Alves MC, Fiorio PR, Nanni MR. 2004. Visible–NIR reflectance: a new approach on soil evaluation. Geoderma. 121:95–112. 10.1016/j.geoderma.2003.09.012
  • Elmi AA, Madramootoo C, Hamel C, Liu A. 2003. Denitrification and nitrous oxide to nitrous oxide plus dinitrogen ratios in the soil profile under three tillage systems. Biol Fert Soil. 38:340–348. 10.1007/s00374-003-0663-9
  • Erhart E, Feichtinger F, Hartl W. 2007. Nitrogen leaching losses under crops fertilized with biowaste compost compared with mineral fertilization. J Plant Nutr Soil Sci. 170:608–614. 10.1002/jpln.200625181
  • Fang J, Rao S, Zhao S. 2005. Human-induced long-term changes in the lakes of the Jianghan Plain, Central Yangtze. Front Ecol Environ. 3:186–192.
  • Felipó M, Garau M. 1987. Comparison of biological and chemical methods to determine available nitrogen in sewage sludge amended soil. Biol Fert Soil. 5:26–30.
  • Fu Q, Wang J, Lin G, Suo H, Zhao C. 2012. Short-wave near-infrared spectrometer for alcohol determination and temperature correction. J Anal Methods Chem. 2012: Article ID 728128, 7.
  • Fystro G. 2002. The prediction of C and N content and their potential mineralisation in heterogeneous soil samples using Vis–NIR spectroscopy and comparative methods. Plant Soil. 246:139–149. 10.1023/A:1020612319014
  • Galvão LS, Vitorello Í. 1998. Variability of laboratory measured soil lines of soils from southeastern Brazil. Remote Sens Environ. 63:166–181. 10.1016/S0034-4257(97)00135-1
  • Gomez C, Lagacherie P, Coulouma G. 2008. Continuum removal versus PLSR method for clay and calcium carbonate content estimation from laboratory and airborne hyperspectral measurements. Geoderma. 148:141–148. 10.1016/j.geoderma.2008.09.016
  • Haubrock SN, Chabrillat S, Lemmnitz C, Kaufmann H. 2008. Surface soil moisture quantification models from reflectance data under field conditions. Int J Remote Sens. 29:3–29. 10.1080/01431160701294695
  • Hummel J, Sudduth K, Hollinger S. 2001. Soil moisture and organic matter prediction of surface and subsurface soils using an NIR soil sensor. Comput Electron Agric. 32:149–165. 10.1016/S0168-1699(01)00163-6
  • Idso S, Jackson R, Reginato R, Kimball B, Nakayama F. 1975. The dependence of bare soil albedo on soil water content. J Appl Meteor. 14:109–113. 10.1175/1520-0450(1975)014 < 0109:TDOBSA > 2.0.CO;2
  • Isaksson T, Næs T. 1988. The effect of multiplicative scatter correction (MSC) and linearity improvement in NIR spectroscopy. Appl Spectrosc. 42:1273–1284. 10.1366/0003702884429869
  • Keeney DR. 1982. Nitrogen—availability indices. Methods of soil analysis part 2 chemical and microbiological properties. Madison (WI): American Society of Agronomy; p. 711–733.
  • Krishnan P, Alexander JD, Butler B, Hummel JW. 1980. Reflectance technique for predicting soil organic matter. Soil Sci Soc Am J. 44:1282–1285. 10.2136/sssaj1980.03615995004400060030x
  • Lin W-S, Kuo B-J. 2013. Using the orthogonal projections methods for predicting rice (Oryza sativa L.) yield with canopy reflectance data. Int J Remote Sens. 34:1428–1448. 10.1080/01431161.2012.721939
  • Marco N, Antoine S, Carole N, Basl vW. 2013. Prediction of soil organic carbon for different levels of soil moisture using Vis–NIR spectroscopy. Geoderma. 199:37–42.
  • Michel K, Terhoeven-Urselmans T, Nitschke R, Steffan P, Ludwig B. 2009. Use of near-and mid-infrared spectroscopy to distinguish carbon and nitrogen originating from char and forest floor material in soils. J Plant Nutr Soil Sci. 172:63–70. 10.1002/jpln.200700246
  • Minasny B, McBratney AB, Bellon-Maurel V, Roger J-M, Gobrecht A, Ferrand L, Joalland S. 2011. Removing the effect of soil moisture from NIR diffuse reflectance spectra for the prediction of soil organic carbon. Geoderma. 167–168:118–124.
  • Morra M, Hall M, Freeborn L. 1991. Carbon and nitrogen analysis of soil fractions using near-infrared reflectance spectroscopy. Soil Sci Soc Am J. 55:288–291. 10.2136/sssaj1991.03615995005500010051x
  • Mouazen A, Kuang B, De Baerdemaeker J, Ramon H. 2010. Comparison among principal component, partial least squares and back propagation neural network analyses for accuracy of measurement of selected soil properties with visible and near infrared spectroscopy. Geoderma. 158:23–31. 10.1016/j.geoderma.2010.03.001
  • Neema D, Shah A, Patel A. 1987. A statistical optical model for light reflection and penetration through sand. Int J Remote Sens. 8:1209–1217. 10.1080/01431168708954765
  • Nelson DW, Sommers L. 1980. Total nitrogen analysis of soil and plant tissues. J Assoc Official Anal Chem. 63:770–778.
  • Olfs HW, Blankenau K, Brentrup F, Jasper J, Link A, Lammel J. 2005. Soil- and plant-based nitrogen-fertilizer recommendations in arable farming. J Plant Nutr Soil Sci. 168:414–431. 10.1002/jpln.200520526
  • Preys S, Roger JM, Boulet JC. 2008. Robust calibration using orthogonal projection and experimental design. Application to the correction of the light scattering effect on turbid NIR spectra. Chemom Intell Lab Syst. 91:28–33.
  • Reeves III JB. 2010. Near-versus mid-infrared diffuse reflectance spectroscopy for soil analysis emphasizing carbon and laboratory versus on-site analysis: where are we and what needs to be done? Geoderma. 158:3–14.
  • Rinnan Å, Berg Fvd, Engelsen SB. 2009. Review of the most common pre-processing techniques for near-infrared spectra. Trends Anal Chem. 28:1201–1222. 10.1016/j.trac.2009.07.007
  • Roger J-M, Chauchard F, Bellon-Maurel V. 2003. EPO–PLS external parameter orthogonalisation of PLS application to temperature-independent measurement of sugar content of intact fruits. Chemom Intell Lab Syst. 66:191–204. 10.1016/S0169-7439(03)00051-0
  • Saeys W, Mouazen AM, Ramon H. 2005. Potential for onsite and online analysis of pig manure using visible and near infrared reflectance spectroscopy. Biosyst Eng. 91:393–402. 10.1016/j.biosystemseng.2005.05.001
  • Savitzky A, Golay MJ. 1964. Smoothing and differentiation of data by simplified least squares procedures. Anal Chem. 36:1627–1639. 10.1021/ac60214a047
  • Shi T, Cui L, Wang J, Fei T, Chen Y, Wu G. 2013. Comparison of multivariate methods for estimating soil total nitrogen with visible/near-infrared spectroscopy. Plant Soil. 366:363–375.
  • Soriano-Disla JM, Janik LJ, Viscarra Rossel RA, Macdonald LM, McLaughlin MJ. 2013. The performance of visible, near and mid-infrared reflectance spectroscopy for prediction of soil physical, chemical and biological properties. Appl Spectrosc Rev. 49:139–186.
  • Stevens A, Udelhoven T, Denis A, Tychon B, Lioy R, Hoffmann L, Van Wesemael B. 2010. Measuring soil organic carbon in croplands at regional scale using airborne imaging spectroscopy. Geoderma. 158:32–45. 10.1016/j.geoderma.2009.11.032
  • Terhoeven-Urselmans T, Michel K, Helfrich M, Flessa H, Ludwig B. 2006. Near-infrared spectroscopy can predict the composition of organic matter in soil and litter. J Plant Nutr Soil Sci. 169:168–174. 10.1002/jpln.200521712
  • Udelhoven T, Emmerling C, Jarmer T. 2003. Quantitative analysis of soil chemical properties with diffuse reflectance spectrometry and partial least-square regression: a feasibility study. Plant Soil. 251:319–329. 10.1023/A:1023008322682
  • Vinogradov B. 1981. Remote sensing of the humus content of soils. Sov Soil Sci. 11:114–123.
  • Viscarra Rossel RA, Behrens T. 2010. Using data mining to model and interpret soil diffuse reflectance spectra. Geoderma. 158:46–54. 10.1016/j.geoderma.2009.12.025
  • Viscarra Rossel RA, Jeon YS, Odeh IOA, McBratney AB. 2008. Using a legacy soil sample to develop a mid-IR spectral library. Aust J Soil Res. 46:1–16. 10.1071/SR07099
  • Viscarra Rossel RA, McGlynn RN, McBratney AB. 2006. Determining the composition of mineral–organic mixes using UV–vis–NIR diffuse reflectance spectroscopy. Geoderma. 137:70–82. 10.1016/j.geoderma.2006.07.004
  • Viscarra Rossel RA, Walvoort DJJ, McBratney AB, Janik LJ, Skjemstad JO. 2006. Visible, near infrared, mid infrared or combined diffuse reflectance spectroscopy for simultaneous assessment of various soil properties. Geoderma. 131:59–75. 10.1016/j.geoderma.2005.03.007
  • Wang J, Cui L, Gao W, Shi T, Chen Y, Gao Y. 2014. Prediction of low heavy metal concentrations in agricultural soils using visible and near-infrared reflectance spectroscopy. Geoderma. 216:1–9. 10.1016/j.geoderma.2013.10.024
  • Weidong L, Baret F, Xingfa G, Qingxi T, Lanfen Z, Bing Z. 2002. Relating soil surface moisture to reflectance. Remote Sens Environ. 81:238–246. 10.1016/S0034-4257(01)00347-9
  • Williams P. 2004. Near-infrared technology: getting the best out of light: a short course in the practical implementation of near-infrared spectroscopy for the user. Nanaimo (BC): PDK Projects Incorporated.
  • Wise BM, Gallagher N, Bro R, Shaver J, Windig W, Koch RS. 2007. PLS Toolbox 4.0. Wenatchee (WA): Eigenvector Research Incorporated; p. 3905.
  • Wold S, Antti H, Lindgren F, Ohman J 1998. Orthogonal signal correction of near-infrared spectra.pdf. Chemom Intell Lab Syst. 44:175–185.
  • Wold S, Sjöström M, Eriksson L. 2001. PLS-regression: a basic tool of chemometrics. Chemom Intell Lab Syst. 58:109–130. 10.1016/S0169-7439(01)00155-1
  • Wu C-Y, Jacobson AR, Laba M, Baveye PC. 2009. Alleviating moisture content effects on the visible near-infrared diffuse-reflectance sensing of soils. Soil Sci.174:456–465. 10.1097/SS.0b013e3181b21491
  • Yakovchenko V, Sikora LJ, Kaufman DD. 1996. A biologically based indicator of soil quality. Biol Fert Soil. 21:245–251.
  • Zhao S, Fang J, Ji W, Tang Z. 2003. Lake restoration from impoldering: impact of land conversion on riparian landscape in Honghu Lake area, Central Yangtze. Agric Ecosyst Environ. 95:111–118.
  • Zhu Y, Deng D, Lu Y, Chen M. 1984. Study on the rule of soil reflectance varied with different soil moisture. Soil J (China). 25:194–202.
  • Zorzetti BM, Shaver JM, Harynuk JJ. 2011. Estimation of the age of a weathered mixture of volatile organic compounds. Anal Chim Acta. 694:31–37. 10.1016/j.aca.2011.03.021

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.