Publication Cover
Canadian Journal of Remote Sensing
Journal canadien de télédétection
Volume 47, 2021 - Issue 6
6,885
Views
8
CrossRef citations to date
0
Altmetric
Articles

Multi-Sensor Aboveground Biomass Estimation in the Broadleaved Hyrcanian Forest of Iran

Estimation multi-capteurs de la biomasse aérienne de la forêt de feuillus hyrcanienne d’Iran

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 818-834 | Received 11 Jul 2020, Accepted 10 Aug 2021, Published online: 07 Sep 2021

Abstract

In this study, the capability of Landsat-8 (L8), Sentinel-2 (S2), Sentinel-1 (S1), and their combination was investigated for estimating aboveground biomass (AGB). A pure stand of Fagus Orientalis located in the Hyrcanian forest of Iran was selected as the study area. The performance of a parametric approach, i.e., Multiple Linear Regression (MLR) model and non-parametric approaches, i.e., k-Nearest Neighbor (k-NN), Random Forest (RF), and Support Vector Regression (SVR), were also evaluated for AGB estimations. Our results indicated that among S2 metrics, the FAPAR canopy biophysical index and NDVI index based on the red-edge band (NIR-b8a) have the highest correlation coefficient (r) of 0.420 and 0.417, respectively. The results of AGB estimation showed that a combination of S2 and S1 datasets using the k-NN algorithm had the best accuracy (R2 of 0.57 and rRMSE of 14.68%). The best rRMSE using L8, S2, and S1 datasets was 18.95, 16.99, and 19.17% using k-NN, k-NN, and MLR algorithms, respectively. The combination of L8 with S1 dataset also improved the rRMSE relative to L8 and S1 separately by 0.96 and 1.18%, respectively. We concluded that the combination of optical data (L8 or S2) with SAR data (S1) improves the broadleaved Hyrcanian AGB estimation.

RÉSUMÉ

Dans cette étude, la capacité de Landsat-8 (L8), Sentinel-2 (S2), Sentinel-1 (S1) et leur combinaison ont été étudiées pour estimer la biomasse aérienne (AGB). Un peuplement pur de Fagus Orientalis situé dans la forêt hyrcanienne d’Iran a été choisi comme zone d’étude. Le rendement d’une approche paramétrique, c'est-à-dire, le modèle de régression linéaire multiple (MLR) et les approches non paramétriques, c'est-à-dire, k-Nearest Neighbor (k-NN), Random Forest (RF) et Support Vector Regression (SVR), ont été évalués pour les estimations de la biomasse. Nos résultats indiquent que parmi les mesures S2, l’indice biophysique de la canopée FAPAR et l’indice NDVI basé sur la bande red-edge (NIR-b8a) ont les coefficients de corrélation les plus élevés (r) soit 0,420 et 0,417 respectivement. Les résultats de l’estimation de l’AGB montrent qu’une combinaison des données S2 et S1 utilisant l’algorithme k-NN donne la meilleure précision (R2 de 0,57 et rRMSE de 14,68%). Le meilleur rRMSE en utilisant les ensembles de données L8, S2 et S1 était de 18,95%, 16,99% et 19,17% en utilisant respectivement les algorithmes k-NN, k-NN et MLR. La combinaison des ensembles de données L8 et S1 a également amélioré le rRMSE de 0,96% et 1,18% par rapport aux données L8 et S1 séparément. En conclusion, la combinaison des données optiques (L8 ou S2) avec les données SAR (S1) améliore l'estimation de l'AGB de la forêt de feuillus hyrcanienne.

Introduction

Forests contain 80% of carbon stocks in terrestrial ecosystems (Wani et al. Citation2015). Forests play a crucial role in carbon sequestration and mitigating the impact of climate change (Olson et al. Citation1983). Forest aboveground biomass (AGB) is the main pool of total biomass in a forested area. It is also used as an indicator to monitor forest health (Su et al. Citation2020; Pandey et al. Citation2019; Chen et al. Citation2018; Brown et al. Citation1997). An accurate AGB estimation at different spatial and temporal scales is essential for reducing uncertainties in the terrestrial carbon budget; also, it provides critical information for forest management planning (Pan et al. Citation2011). Although field measurement provides the most accurate AGB information, it is destructive, costly, and time-consuming. Also, due to limited accessibility from terrain features, field measurements may be limited in application (Wu et al. Citation2016). Integration of field measurement and remote sensing data is an alternative approach for AGB estimation over large areas with a reliable accuracy (Kumar and Mutanga Citation2017; Zhao et al. Citation2016).

Advances in remote sensing technology offer new opportunities to quantitatively estimate the forest attributes, i.e., AGB using Light Detection and Ranging (LiDAR), Synthetic Aperture Radar (SAR), and optical remotely sensed data. The sole use of optical and SAR data or a combination of both datasets has been frequently used for forest structural prediction (Lu et al. Citation2016). In addition, the different modeling approaches, including parametric and non-parametric algorithms, have been assessed. The main findings can be summarized as (1) the combination of optical and SAR datasets improved the performance of AGB estimations and, (2) non-parametric approaches were more accurate for AGB estimation than parametric approaches (Poorazimy et al. Citation2020; Chen et al. Citation2018; Mura et al. Citation2018; Ghosh and Behera Citation2018; Pandit et al. Citation2018; Castillo et al. Citation2017; Chrysafis et al. Citation2017; Poorazimy et al. Citation2017; Vafaei et al. Citation2017; Fuchs et al. Citation2009). Although many studies have addressed forest AGB estimation, accurate estimation is still challenging (Poorazimy et al. Citation2020; Astola et al. Citation2019; Moradi et al. Citation2018; Motlagh et al. Citation2018; Ronoud and Darvishsefat Citation2018; Korhonen et al. Citation2017; Fernández-Manso et al. Citation2016; Immitzer et al. Citation2016; Yadav and Nandy Citation2015; Amini Baneh Citation2013; Wijaya et al. Citation2010; Khorrami et al. Citation2008; Hall et al. Citation2006; Lu Citation2005; Zheng et al. Citation2004).

The freely available satellite data such as Landsat and Sentinel have increased the necessity for more studies on the estimation of forest biophysical attributes. Landsat-8 (L8), launched in 2013, provides more accurate radiometric and spectral images than the previous Landsat TM and ETM + sensors (Zhu et al. Citation2019). Sentinel-2A and Sentinel-2B were launched in 2015 and 2017, respectively. Sentinel-2 (S2) acquires images from the terrestrial ecosystems with a five-day temporal resolution and a swath width of 290-km (Drusch et al. Citation2012). Its Multi-Spectral Imager (MSI) sensor offers 13 spectral bands with a spatial resolution of 10–60 m. In addition to temporal resolution and the ability in multi-purposes applications, S2 provides three novel spectral bands in the red-edge region placed at 705, 740, and 780-nm at 20-m spatial resolution; thus, it may increase the accuracy of forest biophysical parameters estimation (Sentinel-2_Team Citation2015; Delegido et al. Citation2011). Due to the red-edge spectral bands, S2 data can be compared to other commercial satellites such as Worldview-2 and RapidEye. Therefore, they are valuable for assessing and monitoring of forested areas (Pandit et al. Citation2018). Polarimetric acquisitions, wide-area coverage, and shorter revisit times are among unique SAR data specifications and play an important role in AGB estimation (Poorazimy et al. Citation2020; Periasamy Citation2018; Laurin et al. Citation2018; McNairn and Shang Citation2016). Sentinel-1A and Sentinel-1B were among SAR satellites launched in 2014 and 2016. Sentinel-1 (S1) has a C-band (5.405 GHz) and spatial and temporal resolution of 5 × 20-m and 12 days, respectively (Sentinel-1_Team Citation2013). S1 operates with two polarization channels of VV and VH and has been used for AGB estimation (Kumar et al. Citation2019; Navarro et al. Citation2019; Berninger et al. Citation2018; Ghosh and Behera Citation2018; Huang et al. Citation2018; Laurin et al. Citation2018; Periasamy Citation2018; Omar et al. Citation2017). It is essential to mention that the long-wavelength SAR data are more sensitive to AGB (Ouchi Citation2013). However, these data are not freely available. Hence, many attempts have been made to predict AGB based on the short wavelength SAR data, i.e., S1 imagery.

In addition to freely remotely sensed datasets that can be used for AGB estimation, a comparison scheme on the performance of each parametric and non-parametric modeling approaches can enroute to accurate AGB estimation. Because each of the prediction algorithms for example parametric multiple linear regression (MLR), and non-parametric k-Nearest Neighbor (k-NN), Random Forest (RF) and Support Vector Regression (SVR) have their own region of best performance. So, the results are specific to each study area.

Hyrcanian forests of Iran are distributed along the Caspian Sea and the northern slopes of the Alborz mountains. These forests are remnants of the Pleistocene period and play an important role in multiple aspects, including biodiversity, commercial products, and climate change (Marvi Mohadjer Citation2007). Much attention has been given to quantify Hyrcanian forests using remote sensing data. However, the capability of SAR data in these forests has not been well established, and only a few studies exist using optical data. Besides that, investigating multi-source remotely sensed data for Hyrcanian forests has emerged in recent years as a promising scheme to estimate forest AGB. The objectives of this study are to (1) evaluate the capability of L8, S2, S1 and their combination for forest AGB estimation in the Hyrcanian forests, and (2) compare the performance of different AGB estimation approaches, including a parametric approach (i.e., MLR), and non-parametric approaches (i.e., k-NN, RF, and SVR).

Materials and methods

Study area

The study area is the Kheyruod research forest, which is located in the Hyrcanian forests, North of Iran. The Kheyruod forest research station was established in 1967 and is managed by the Department of Forest and Forest Economics, University of Tehran. It has an 8000-ha area and situated between Longitude 51°.32′–51°.43′ E, and Latitudes 36°.27–36°.40′ N. The kheyruod forest research consist of seven management districts. shows the distribution of the field plots over the Gorazbon and Namkhaneh districts. The elevation of the selected area ranges from 1000-m to 1500-m a.s.l. According to Nowshahr synoptic station, the mean annual precipitation is 1300-mm. The dominant species include Fagus Orientalis, Carpinus Betulus, Acer sp., and Alnus Subcordata.

Figure 1. Study area in Iran and the distribution of field plots (red dots) in the Gorazbon and Namkhaneh districts.

Figure 1. Study area in Iran and the distribution of field plots (red dots) in the Gorazbon and Namkhaneh districts.

Field data

A nondestructive sampling method was conducted to estimate AGB in the field. Based on a typology map, we applied a stratified random sampling approach across the study area. We measured 65 field sample plots with an area of 2025-m2 (45 m × 45 m) in beech dominant tree stands (i.e., stands with beech fraction more than 80%). Field sampling was performed in August 2014 (). On each plot, tree species and diameter at breast height (DBH) were recorded. All trees with DBH larger than 7.5-cm were considered.

AGB estimation

The volume of each individual tree was calculated using a Tariff table. The Tariff table was developed for Gorazbon and Namkhaneh forest districts to predict tree volume based on DBH attribute by the Forestry and Forest Economics Department, University of Tehran. Tree biomass was calculated using Equationequation (1) (Enayati Citation2011; Brown and Lugo Citation1984). (1) AGB=V×WCD (1) where AGB is aboveground tree biomass (Mg.ha−1), V is the volume of a tree derived from the Tariff table, and WCD is wood-critical density. The value of 0.56 Mg/m3 was used for Fagus Orientalis as wood critical density (Tarmian et al. Citation2009). Individual tree biomass was summed up to calculate plot-level AGB (Mg·ha−1). The field data were split randomly into a training dataset (i.e., 70% of the field sample plots) and a validation dataset (i.e., 30% of the field sample plots). Plot level AGB for training and validation datasets are summarized in .

Table 1. Summarized plot level AGB statistics.

Remote sensing data

Three remote sensing datasets, including L8, S2, and S1, were used for Hyrcanian forest AGB estimation (). The L8 data were downloaded from the United States Geological Survey (USGS) Earth Explorer data portal (https://earthexplorer.usgs.gov/). The sentinel data was obtained from the European Space Agency (ESA) Copernicus Open Access Hub (https://scihub.copernicus.eu/dhus/#/home). The Sentinel Application Platform (SNAP) (version. 6) (http://step.esa.int/main/toolboxes/snap/) and IDRISI Selva software packages were used for L8 and Sentinel data processing. The digital topographic maps provided by the National Cartographic Center (NCC) of Iran at 1/25,000 scale were used to check the geometric accuracy of images. The detailed information on optical and SAR data processing is presented in the following sections. The spatial resolution of all images was resampled to 5-m resolution using Nearest Neighbor interpolation.

Table 2. Satellite imagery acquisition dates and metrics derived from L8, S2, and S1.

Optical data processing

The radiometric quality of data was assessed. S2 Level-1C data were corrected to obtain a level-2A dataset using the SEN2COR atmospheric processor (http://step.esa.int/main/third-party-plugins-2/sen2cor/). In addition to the spectral bands, previous studies recommended using the transformed procedures to generate more spectral metrics sensitive to the forest structural attributes variation, i.e., vegetation indices, Tasseled Cap transformation (Greenness component) (Nedkov Citation2017; Ali Baig et al. Citation2014), Principle Component Analysis (PCA), Fusion of spectral bands with the panchromatic band (applied only to OLI data), and canopy biophysical and biochemical indices such as Leaf Area Index (LAI), Leaf Chlorophyll Content (Cab), Canopy Water Content (CWC) and Fraction of Absorbed Photosynthetically Active Radiation (FAPAR) (applied to MSI data) (). These mentioned biophysical indices are computed using PROSAIL radiative transfer model (For detailed information, please refer to Weiss and Baret Citation2016; Jacquemoud et al. Citation2009). Many studies have shown the efficiency of these transformed spectral metrics for vegetation attributes estimation (Liu et al. Citation2019; Putzenlechner et al. Citation2019; Chen et al. Citation2018; Castillo et al. Citation2017; Frampton et al. Citation2013).

SAR data processing

The Ground Range Detection (GRD) images were radiometrically calibrated and the values were converted to the Ƴ° backscatter coefficient according to the local incidence angle (Poorazimy et al. Citation2017; Tsui et al. Citation2012; Kellndorfer et al. Citation1998). The Refined Lee filter was applied to reduce the speckle effect. The terrain correction procedure was implemented on all images and finally inverted to dB using Equationequation (2). (2) DN(dB)=10log10(N) (2) where N is the value extracted from the preprocessed SAR images. Many studies have shown a direct relationship between polarization channels and vegetation AGB (Liu et al. Citation2019; Chen et al. Citation2018; Castillo et al. Citation2017). Therefore, we also used VHVV, VHVV, VH×VV, VH+VV2and VH×VV  as predictor variables ().

Correlation analysis and AGB modeling

We used the Pearson correlation analysis to determine the strength of relationships between AGB and remote sensing derived metrics. To predict forest AGB (dependent variable) from remote sensing metrics (independent variables), parametric and non-parametric approaches were applied. We used the stepwise multiple linear regression (MLR) model as the most common parametric approach (Poorazimy et al. Citation2020; Lu et al. Citation2016; Kumar et al. Citation2013). In addition, different non-parametric approaches, i.e., k-NN (Tomppo Citation1990; Tomppo and Halme Citation2004), RF (Breiman Citation2001), and SVR (Cortes and Vapnik Citation1995; Vapnik Citation1995) were also assessed. Before implementing the MLR, the normality assumption of the dataset was checked using the Kolmogorov–Smirnov Test (Tojal et al. Citation2019; Kleinbaum et al. Citation2013). The collinearity was assessed using the Variance Inflation Factor (VIF) and Tolerance Index to ensure that the predictors were not highly correlated (Tojal et al. Citation2019; Kleinbaum et al. Citation2013). We also used the Durbin-Watson statistic to investigate the residual’s autocorrelation (Tojal et al. Citation2019; Kleinbaum et al. Citation2013). It is important to mention that only statistically significant predictors obtained from the Pearson correlation analysis were used in AGB modeling.

We used four different distance metrics, i.e., Euclidean, Euclidean Squared, Mahalanobis, and Manhattan, for determining the best number of k nearest neighbors with the k-NN method. In the case of the RF algorithm, the optimal k predictors were calculated as a square root of the predictor variables number ±2. Also, the optimal number of decision trees was determined based on the average squared errors of training and validation datasets. We considered four different kernels for the SVR algorithm, including Linear, Polynomial, Radial Basis Function (RBF), and Sigmoid. The statistical analysis was implemented using Statistica (version 10) and SPSS (version 22) software.

Accuracy assessment

The Coefficient of determination (R2), Root Mean Square Error (RMSE), relative RMSE (rRMSE), and Akaike Information Criterion (AIC) were used as criteria metrics for selecting the best fitting models for the validation dataset (EquationEquations 3–6). (3) r2=[i=1N(OiO¯)(PiP¯)]2(i=1N(OiO¯))(i=1N(PiP¯))(3) (4) RMSE=[N1i=1N(PiOi)2]0.5 (4) (5) rRMSE=RMSEO¯×100(5) (6) AIC=Nln(RMSE)+2t(6) where N is the number of field data, Oi is the observed value, Pi is the predicted value, O¯ is the average of observed values, and t is the number of predictors in the model. The flowchart of the methodology is presented in .

Figure 2. The flowchart of applied methodology to predict forest AGB.

Figure 2. The flowchart of applied methodology to predict forest AGB.

Results

Correlation analysis

The applied normality test showed that the data are normally distributed (p = 0.85). The Pearson correlation coefficients computed for AGB and remote sensing derived metrics are provided in . The most important metrics to estimate forest AGB were the first component of Principle Component Analysis (PCA) using the spectral bands for L8 (r = 0.367), FAPAR for S2 (r = 0.42), and VH Ƴ° backscatter coefficient (r=-0.351) for S1. It can be seen from that S2 metrics showed the highest correlation for AGB estimates, and as we will explain in the next section, S2 was the best dataset in both individual and combination forms.

Table 3. Correlation analysis between AGB and remote sensing derived metrics.

AGB modeling using MLR algorithm

The best models obtained from remotely sensed derived metrics and stepwise MLR are shown in . A combination of S2 and S1 datasets, i.e., FAPAR canopy biophysical index and VH Ƴ° backscatter coefficient as predictors, explained more variability in forest AGB (R2=0.34 and rRMSE = 17%). shows that the residual graph is normally distributed. A combination of L8 and S1, and S2 datasets were in the second and third order of accuracy with rRMSE 53.33 and 54.54%, respectively.

Figure 3. Histogram and normal P–P plot of residuals for normality assessment.

Figure 3. Histogram and normal P–P plot of residuals for normality assessment.

Table 4. Selected models to estimate forest AGB based on stepwise MLR algorithm and multi-sensor datasets.

AGB modeling using non-parametric approaches

The results of k-NN for five sources of remote sensing datasets are summarized in . S2 dataset with rRMSE 16.99% showed higher potential for AGB estimation than S1 dataset with rRMSE 19.37%. The incorporation of S2 and S1 datasets performed better than other datasets. Among distance metrics, Manhattan produced more accurate results (i.e., R2 of 0.57 and an rRMSE of 14.68%).

Table 5. The results of the k-NN algorithm for AGB estimation using multi-sources remote sensing datasets.

The RF algorithm gained better results again for a combination of the S2 and S1 datasets (). The final RF model consists of 8 predictors with 500 trees that showed the highest predictive accuracy (R2=0.5 and rRMSE = 18.6%). Unlike our expectations, the S1 dataset had lower performance in comparison with other datasets for AGB estimation with R2=0.126 and rRMSE 20.02%.

Table 6. RF algorithm performance to estimate AGB using multi-sources remote sensing datasets.

The SVR models showed that a combination of S2 and S1 datasets had better performance than other datasets (). The selected SVR model with a sigmoid kernel explained 17.307% of forest AGB variation. The low R2=0.052 for L8 showed no meaningful relationship for AGB estimates, while the S2 dataset had a second order of accuracy with rRMSE 17.93% among other datasets.

Table 7. SVR algorithm performance to estimate AGB using multi-sources remote sensing datasets.

In general, the integration of the S2 and S1 datasets with the k-NN algorithm produced the best results for AGB estimation in our study area. As expected, S2 was more complementary with S1 rather than L8. The scatter plot of predicted versus measured AGB using the best combination dataset has been reported in . As it is observed, the fitted model had more ability to predict AGB until 200 trees/ha. The generated AGB map using the best model has shown in .

Figure 4. The estimated versus measured AGB based on S2 and S1 dataset (k-NN algorithm).

Figure 4. The estimated versus measured AGB based on S2 and S1 dataset (k-NN algorithm).

Figure 5. Predicted AGB over the study area using S2 and S1 dataset and k-NN algorithm.

Figure 5. Predicted AGB over the study area using S2 and S1 dataset and k-NN algorithm.

Discussion

The relationships between AGB and remote sensing derived metrics

L8 dataset showed that the first component of PCA applied to spectral bands (b1–b7) is most relevant to forest AGB estimation than other L8 variables with r = 0.367. Then, the first PCA component using bands 5-6, band-5 (NIR), band-6 (SWIR1), and Greenness component were highly correlated with AGB, respectively. Data fusion between band-5 (NIR) and panchromatic band significantly improved AGB estimation, but other fused bands did not show a high correlation. For the S2 dataset, the highest correlation observed for FAPAR biophysical index (r = 0.42), following by NDVI (based on the red-edge spectral band (NIR-b8a) and FCOVER canopy biophysical index with the correlation of 0.417 and 0.403, respectively. In several studies, the efficiency of canopy biophysical indices derived from the S2 dataset to predict vegetation attributes has been proved (Liu et al. Citation2019; Chen et al. Citation2018; Castillo et al. Citation2017). Also, we found that S2 spectral bands are positively correlated with AGB. Among them, a high correlation was obtained for band-8 (785–900-nm) and Band-8a (855–875-nm) with a correlation of 0.395. Three red-edge spectral bands acquired by S2 seem promising data to estimate vegetation properties. The first red-edge band (b5) did not show any significant correlation with AGB. This is in accordance with Korhonen et al. (Citation2017) for LAI estimation and in contrast to Chrysafis et al. (Citation2017) for AGB estimation. Red band (b4, r =0.283) and second and third red-edge bands (b6, r = 0.334 and b7, r = 0.382) had positive correlation with AGB. According to obtained results, there is a need for more investigation about this phenomenon. The sensitivity of vegetation indices to AGB changes was observed in our study (), which is in line with the results of other studies (Liu et al. Citation2019; Pham and Brabyn Citation2017; Sousa et al. Citation2015; Zhu and Liu Citation2015). Also, Vafaei et al. (Citation2017) reported a lower RMSE for AGB estimation using vegetation indices than ALOS-2 data in the Hyrcanian forest.

Most of the metrics derived from optical remotely sensed datasets showed significantly correlation with forest AGB in these broadleaved temperature forests. This means that with increasing forest AGB value, the reflectance is also increased. Because of the high canopy density and multi-storied Fagetum community in our study area, there was not any reflectance from the ground and floor vegetation. So, this can be a reason for our significant results. Our results were in the range of some other studies (Yadav and Nandy Citation2015; Amini Baneh Citation2013; Lu Citation2005).

The correlation analysis for the S1 dataset also showed a significant negative correlation between AGB and VH and VV Ƴ° backscatter coefficients (i.e., with the correlation of −0.351 and −0.295, respectively). Such significant relationships between the polarimetric channels (i.e., VH and VV) and AGB have been reported in other studies (Kumar et al. Citation2019; Omar et al. Citation2017; Suzuki et al. Citation2013). Van Pham et al. (Citation2019) showed that metrics computed by mathematical operation on different polarization channels were important for AGB estimation. We also found that multiplication, average, and root of multiplied polarimetric channels with the correlation of 0.341, −0.345, and −0.336, respectively produced more relevant metrics than VV Ƴ° backscatter coefficient for AGB estimation. In contrast, we did not observe any significant relationships between forest AGB and two metrics of ratio and difference between two polarimetric channels.

The performance of MLR and non-parametric approaches for AGB estimation

The best MLR model with rRMSE = 17% was based on a combination of S2 and S1 datasets, in which the FAPAR canopy biophysical index and VH Ƴ° backscatter coefficient were selected as predictors. In terms of accuracy for AGB estimation, the second and third models were a combination of L8 and S1, and S2 datasets, respectively. For L8 and S1 combination, the first component of PCA transformation applied to bands of 5–6 and VH Ƴ° backscatter coefficient were selected as predictors and yielded an rRMSE of 18%. For the S2 dataset, the metric of FAPAR was the most effective variable for AGB estimation and achieved an rRMSE of 18.4%. Parametric models might fail to provide good performance for estimating forest structural attributes because of their restricted assumptions. In practice, the relationships between AGB and remote sensing metrics are very complex, which resulted in low accuracy for parametric models. In contrast, non-parametric approaches have a predefined simple data structure and, with their flexibility, showed more potential for AGB estimation (Lu et al. Citation2016). We found a better performance for non-parametric approaches compared to the parametric MLR approach. Our results showed that the k-NN algorithm using a combination of S2 and S1 datasets produced the most accurate results than the other datasets and algorithms. The observed R2 and rRMSE were 0.57 and 14.68%, respectively. Our results showed that the k-NN method improved the AGB rRMSE relative to SVR, RF, and MLR by 2.62, 3.92, and 2.32%. Chirici et al. (Citation2016) summarized the results of 148 studies from 26 different countries in which forest structure attributes have been estimated using remote sensing datasets. They showed that the k-NN algorithm was a reliable approach for predicting forest structural attributes at different scales (i.e., local to global). Our k-NN related results are in accordance with findings of previous studies (Persson et al. Citation2021; Poorazimy et al. Citation2020; Mura et al. Citation2018; Bilous et al. Citation2017; Chirici et al. Citation2016; McRoberts et al. Citation2015; Yadav and Nandy Citation2015; Beaudoin et al. Citation2014; Gagliasso et al. Citation2014; Jung et al. Citation2013; Tian et al. Citation2014; Tomppo et al. Citation2008; McRoberts et al. Citation2007; Maselli et al. Citation2005; Tomppo and Halme Citation2004). Application of RF algorithm for AGB estimation had a better result for the combination of S2 and S1 dataset with R2 = 0.5 while it was worst for S1 dataset. However, there are many successful reports that show the performance of the RF algorithm for AGB estimation within different biophysical conditions (Liu et al. Citation2019; Ghosh and Behera Citation2018; Pandit et al. Citation2018; Chrysafis et al. Citation2017; Pflugmacher et al. Citation2014; Tanase et al. Citation2014; Latifi et al. Citation2010). Also, we found an rRMSE of 17.93% for AGB estimation using the S2 dataset and SVR algorithms. Similar results have been reported by Navarro et al. (Citation2019), Chen et al. (Citation2018), López-Serrano et al. (Citation2016), Mountrakis et al. (Citation2011), and Camps-Valls (2009). One of the advantages of the SVR algorithm is its capacity to deal with a low number of field sample plots (Lu et al. Citation2016). Also, SVR can predict the non-linear relationships between dependent and independent variables. Vafaei et al. (Citation2017) have reported an R2 of 0.61 for AGB estimation using the SVR algorithm in a small part of the Hyrcanian forest. They used the S2 dataset, and their reported accuracy is similar to our results. The performance of different approaches and datasets for AGB modeling is shown in . It demonstrates that the k-NN algorithm is well suited for forest AGB prediction compared to other algorithms. The lowest rRMSE was obtained using a combination of S2 and S1 datasets. Our results provide supporting evidence that a combination of active and passive datasets offers the optimal capability and sensitivity to model structural attributes, particularly over complex forest ecosystems (Fatehi et al. Citation2015). It is worth mentioning that there was a two-year time lag between S2 data and field sample plot collection while the S2 dataset showed its notable performance. In addition, significant revisit time of the S2 dataset may have a great potential for monitoring structural developments. As Mura et al. (Citation2018) reported a better performance for S2 compared to the Landsat-8 and RapidEye. Chrysafis et al. (Citation2017) and Astola et al. (Citation2019) also confirmed that S2 was more successful than L8 for predicting structural attributes.

Figure 6. The performance of each dataset with different modeling algorithm for AGB estimation.

Figure 6. The performance of each dataset with different modeling algorithm for AGB estimation.

S1 dataset has been used more in the sparse forests with low biomass and pastures (Castillo et al. Citation2017; Sinha et al. Citation2015). In our study area, the forest has a complex structure and high density of biomass, which may negatively affect SAR data’s sensitivity. One of the reasons for weak results obtained from SAR data is the saturation of the C-band in high biomass levels. Our minimum value of AGB is close to 200 Mg.ha−1. Although the non-parametric approaches performed better than MLR for the S1 dataset, the uncertainty is still high.

In our study, some uncertainties have been included in the AGB estimation procedure. First, the limited penetration into forest vertical structure caused some errors because most of the AGB concentration is in the trunk of trees (Lu et al. Citation2016). The temporal distance between remote sensing images and field data collection is the second influencing factor in our results. There was a two-year time lag between S2 data and field measurement sample plots. Third, we did not have access to the species-specific allometric equations for our study area. Therefore, there are uncertainties with using the general equation. Furthermore, the possible errors in the volume table could affect the results. Moreover, the GPS positional errors have a substantial impact on the results obtained from remote sensing studies. Finally, all metrics derived from spectral reflectance are affected by the atmosphere, soil moisture, phenology, and vegetation growth (Lu et al. Citation2016). All metrics place emphasis on the necessity of uncertainty analyses before formulating any final conclusion.

According to previous studies and our results, the L8, S2, and S1 datasets individually are not good enough for estimating forest AGB over the pure Fagus Orientalis at the plot level. The results support Moradi et al. (Citation2018), which also stated that Landsat-8 datasets for AGB estimating in Carpinus Betulus stands in Hyrcanian forests have limitations. However, some studies have reported an acceptable performance of these datasets in the mixed forest stands (Amini Baneh Citation2013; Rostami Andargoli Citation2008). The results present the differences in the type of sensors, sampling method, size and number of field sample plots, tree species, and structure of forest stands play important roles in the comparison of results. Therefore, there is a need for more research over the temperate broad-leaved forests. In accordance with past literature, the integration of different remote sensing datasets can improve the precision of results (Zhang et al. Citation2019; Vafaei et al. Citation2017; Chang and Shoshany Citation2016; Sinha et al. Citation2016; Shen et al. Citation2016; Laurin et al. Citation2013) and this strategy is recommended for future studies in Hyrcanian mixed forests of Iran.

Conclusion

Forest ecosystems play a crucial role in mitigating and adapting to climate change as they are the largest terrestrial carbon sink. Conversely, climate change can drive forest ecosystem loss and therefore there is a need for accurate and timely forest ecosystems monitoring. In this study, we evaluated the capability of spectral and transformed bands of L8, S2, and S1 for AGB estimation. The limited ability of optical and short-wavelength SAR data to penetrate the vertical structure of forests resulted in low sensitivity for forest AGB estimation. In comparison, the combination of optical and SAR datasets improved the forest AGB estimation accuracy when they were used individually. In this regard, S2 was more complementary than L8 when used in combination with S1. Very likely, it is because of a higher spatial resolution of S2 and the presence of red-edge bands and derived canopy biophysical indices. By combining remotely sensed datasets, the selected algorithm should be able to accommodate the different characteristics of multi-source data for AGB estimation. In addition, the relationship between AGB and remote sensing-based metrics is often complex, so comparative analyses to select the most accurate prediction technique is a common and necessary approach. We found that the k-NN algorithm has better performance than MLR, RF, and SVR algorithms. It is worth mentioning that each of the prediction algorithms has its own region of best performance, and the results are specific to each study area. Still, any generalization should be performed with caution and not without local validation. The use of LiDAR data and long-wavelength SAR data is recommended for future studies because they penetrate the vertical structure of the forest, which includes the most relevant component for AGB estimation. Also, providing species-specific allometric equations for AGB estimating is essential to predict accurate forest AGB.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References