1,741
Views
47
CrossRef citations to date
0
Altmetric
Original Articles

Modelling and prediction of crop losses from NOAA polar-orbiting operational satellites

, , , , &
Pages 886-900 | Received 12 Jan 2015, Accepted 14 Jan 2015, Published online: 10 Feb 2015

Abstract

Weather-related crop losses have always been a concern for farmers, governments, traders, and policy-makers for the purpose of balanced food supply/demands, trade, and distribution of aid to the nations in need. Among weather disasters, drought plays a major role in large-scale crop losses. This paper discusses utility of operational satellite-based vegetation health (VH) indices for modelling cereal yield and for early warning of drought-related crop losses. The indices were tested in Saratov oblast (SO), one of the principal grain growing regions of Russia. Correlation and regression analysis were applied to model cereal yield from VH indices during 1982–2001. A strong correlation between mean SO's cereal yield and VH indices were found during the critical period of cereals, which starts two–three weeks before and ends two–three weeks after the heading stage. Several models were constructed where VH indices served as independent variables (predictors). The models were validated independently based on SO cereal yield during 1982–2012. Drought-related cereal yield losses can be predicted three months in advance of harvest and six–eight months in advance of official grain production statistic is released. The error of production losses prediction is 7%–10%. The error of prediction drops to 3%–5% in the years of intensive droughts.

1. Introduction

Drought is a direct cause of famine killing hundreds of thousands of people and disrupting livelihood of millions in the developing countries (WFP Citation2014). The developed countries suffered from droughts as well due to an increase in the cost of population living following some shortages of food and price increase. From the most recent events, the droughts of 2010 in Russia and 2012 in the USA produced considerable local and global economic impacts. During the 2010, grain production in Russia dropped to 75 million metric tons (mmt) compared to 97 mmt in 2011. The 2012 drought in the USA costs the country's taxpayers nearly $35 billion in economic losses from considerable yield reduction, food and farmland price increase, water shortages, and fire-related ecosystem destruction. Both droughts have also led to global grain reduction, shortages of grain supply compared to the demands, lack of grain on the international market, and corresponding price increase (Goldenberg Citation2012; U.S. Drought Citation2012).

In the years when drought affects some of the major grain growing countries and regions (USA, China, India, European Union, Russia) global production falls below the consumption, the later has a high rate of increase due to a fast growing Earth population. In 8 of the first 13 years of the twenty-first century, global grain production falls below the consumption (PotashCorpo Citation2012). Therefore, assessment of drought-related losses of grain well in advance of harvest in the principal producing countries is an important task not only for that country but also for prediction of the global supply/demand, import/export, prices and evaluation of assistance to the developing nations. The goal of this paper is to investigate the potential of modelling commercial grain production of several grain crops (cereals) using space observations and estimation of drought-related losses of cereals in advance of harvest. Russia was selected for this experiment providing cereal production data and USA provided operational satellite data. Saratov oblast' (SO, administrative province) was selected as one of the important grain producing districts in the Lower Volga region located in the southeastern European Russia ().

Figure 1. Geographic location of Saratov oblast (SO), Russia: cereal crops are growing all over and VH satellite data were collected from the black area.

Figure 1. Geographic location of Saratov oblast (SO), Russia: cereal crops are growing all over and VH satellite data were collected from the black area.

2. Grain in Russia and environment

Russia is the fourth largest grain producing country in the world after China, India, and USA. Grain harvested in Russia is entirely consumed inside the country. However, since the demands for grain are higher than the supplies, Russia purchases on the average nearly 3 million metric ton (mmt) of grain annually from the international market. In some years, when large-scale drought causes huge crop losses, this amount is doubled as it was in 1999 and 2000, when the purchases reached 7.8 and 5.8 mmt, respectively (FAO Citation2012). Therefore, advanced knowledge about the amount of grain collected in a particular year is very important not only for estimation of domestic grain use, but also projections of global supply and demand, international trade and consequently food security.

Southern Russia is the main grain basket in the country producing nearly 70% of high quality grain. Agricultural land of SO covers nearly 9 million hectares (ha), with 6.1 million ha occupied primarily with grain crops, which fields are traditionally huge and have not changed over years in order to receive maximum production with the best grain quality (Voeikov & Gortsev Citation2014; Zernoimport Citation2014). This region has fertile, well drained and nutritional Chernozem soils, which are good for cultivation of grain crops such as wheat, barley, oats, and corn. However, climate of this area is semi-arid, and the shortage of water puts considerable constraint on grain production. The situation is complicated by hot weather accompanied by desiccative winds (suhovei). Moreover, the unusually light snow cover does not make up for summer moisture deficit. Although this area has a few rivers, the area of commercial grain crops is so large that the amount of available water is not sufficient to offset dry climate. Therefore, grain crops are generally not irrigated. In drought years, up to 40% of grain production might be lost in that area.

One of the major geographic features of SO is Volga river, which divides the territory into two parts. The western part, (the right bank, of the river, excluding south), belongs to forest-steppe zone and the eastern part (the left river bank) belongs to steppe zone. The amount of grain produced each year in SO depends on the amount of natural water supply. Annual amount of precipitation (Pr) in SO is not large, changing from 500 mm in the north to 300 mm in the south. Most precipitation falls during the warm period. Moreover, since summer is hot, potential evapotranspiration (PET) in this area is large exceeding the amount of precipitation. As a result, the natural annual water balance (Pr-PET) is 200–400 mm short. Ninety per cent of this deficit occurred during warm period. The water shortage is usually accompanied by hot weather, droughts, and desiccative wind (suhovei). Droughts are typical for this region, affecting crops severely as it was in the past 30 years of the twentieth century (1972, 1975, 1980, 1984, 1995, and 1998). Light snow cover in this region does not offset the spring and summer moisture deficit (Kogan Citation1983).

Since climate controls grain losses, weather data are normally used for the modelling of individual grain crop production and prediction of losses. However, the weather-station network in Russia is not dense enough for accurate assessments of weather impacts, considering a very large area under grain, the number of farms growing crops, and high variability of rainfall. Moreover, since disintegration of the Soviet Union in the 1990s and deterioration of economic and social situation, the number of weather-observing stations in Russia reduced three times. Therefore, satellite data were used as proxy for weather variables in this area. These data were used before to model and monitor individual grain crops (wheat, corn, soybeans, and other) but have never been used for a group of crops, called cereals and grown in spring-summer (Liu & Kogan Citation2002; Dabrowska-Zielinska et al. Citation2002; Domenikiotis et al. Citation2004; Kogan et al. Citation2012).

3. Data

This study employs satellite and in situ data. In situ data were presented by production in tons (t), area in hectares (ha), and yield (Y, t/ha) for spring-planted cereals, which included wheat, barley, oats, corn, and pulses. Spring wheat, barley, and oats occupy 90% of the area. All crops are planted in late April early May and are harvested in August–September (USDA Citation1994). Therefore, cereal analysis in this research is represented by these three crops. The data were obtained from the Russia's Central Statistical Administration. Total regional production and area were summarized from the reports of all farms to SO level at the end of agricultural year; SO yield was calculated by dividing production by area.

Satellite data were collected from the National Oceanic and Atmospheric Administration (NOAA) Global Vegetation Index (GVI) data set during 1981–2012. Spatial and temporal data resolution was 16 km and 7-day composite, respectively (Kidwell Citation1997). Since grain crops occupy slightly over 6 million ha in SO, satellite data were aggregated from around 1100 (hundred times exceeding the number of weather stations used for spatial aggregation of SO precipitation and temperature) 16-km pixels in the SO area shown in . Such a large number of pixels are sufficient for characterizing average SO land cover conditions (Kogan et al. Citation2003). The GVI-based counts in the visible (VIS), near infrared (NIR), and infrared (channel 4, 10.3-11.3 µm) spectral regions were used in this research because they characterize conditions of vegetation, including crops. The first two changes with chlorophyll and water content in green vegetation and the last one helps to monitor thermal conditions. Pre-launch calibration was applied to VIS and NIR counts to convert them to reflectance, and post-launch calibration was applied to correct reflectance for sensor and orbit degradation (Kidwell Citation1997). Finally, these data were used to calculate normalized difference vegetation index (NDVI = (NIR - VIS)/(VIS + NIR)) and convert Ch4 counts to brightness temperature (BT).

4. Methodology

Both yield and satellite data time series were processed in the following way:

4.1. Yield

Yield time series of any crop over a period of more than 15 years can be approximated by the following equation (Obukhov Citation1949):(1) where Yn is actual yield; Ŷn is a slow-changing yield's deterministic component controlled by the agricultural technology (breeding, mechanization, fertilizers, weeds control, etc.); dYn is yield's random component regulated by weather fluctuation from year-to-year; n is the number of years in yield time series.

The technology-related yield trend (Ŷn) can be approximated by a polynomial (either linear or non-linear depending on longevity of yield series), regressing Y against year number. For yield time series longer than 20 years, non-linear polynomial approximation (for example, quadratic polynomial) is regularly used in order to represent appropriately the rate of yield growth and impacts of the interaction between climate and agricultural technology on yield (Fisher Citation1922; Obukhov Citation1949).

At a general background of the slow-moving technology-driven trend, fluctuations of yield around the trend are mainly explained by weather variations from year to year. In years of favourable weather for crop growth, yield exceeds the level estimated by the trend line and in the opposite situation of unfavourable weather yield drops below the trend. Following these considerations, dYn is normally expressed as a ratio or difference (for a shorter time series). In this research, we used the ratio, characterizing deviation from agricultural technology related yield trend. This method allowed us to represent adequately variation of the yield around trend at the beginning and at the end of 20-year time series (Fisher Citation1922; Obukhov Citation1949).(2)

4.2. VH indices from the advanced very high resolution radiometer (AVHRR)

The principle for constructing these indices stems from the properties of green vegetation to reflect visible and emit thermal solar radiation. If vegetation is healthy, it reflects little radiation in the VIS part of solar spectrum (due to high chlorophyll absorption), much in the NIR part (due to scattering the light by leaf internal tissues and water content), and emit less thermal radiation in IR spectral bands (because the transpiring canopy is cooler). As the result, NDVI becomes large and BT small. Oppositely, for unhealthy vegetation NDVI becomes smaller due to reduced chlorophyll (less absorption and greater reflection) and water content (greater reflection) and BT larger because vegetation surface becomes hotter following reduced transpiration (Cracknell Citation1997).

It is important to emphasize that NDVI and BT quantify both spatial difference between productivity of different ecosystem (ecosystem component, EC) and weather-related variations in each ecosystem (weather component, WC). The EC is normally influenced by long-term environmental factors such as climate, soils, topography, landscape, geology, etc., while the WC is controlled by weather fluctuations in each ecosystem. Since weather-related NDVI and BT variations are much smaller than variations due to ecosystem differences, separation of WC (by calculating deviation of NDVI (BT) from their climatology) is an important procedure to do prior to comparing their each year weekly variations with each year yield fluctuations (Kogan Citation1997).

Following these considerations, the Vegetation health (VH) indices calculated from NDVI and BT were introduced. Details of the algorithm are presented in Kogan Citation(1997). Here, the important steps are indicated. They include (a) complete elimination of high frequency noise from NDVI and BT annual time series, (b) approximation of annual cycle, (c) calculation of multi-year climatology, and (d) estimation of medium-to-low frequency fluctuations in NDVI and BT associated with weather variations (expressed as departures from seasonal cycle climatology). The VH indices were represented by NDVI-based Vegetation Condition Index (VCI), BT-based Temperature Condition Index (TCI) and Vegetation Health Index (VHI), aggregating VCI and TCI together. The equations (Equation3Equation5) provide these approximations.(3) (4) (5) where NDVI, NDVImax, and NDVImin (BT, BTmax and BTmin) are no noise (statistically smoothed) weekly NDVI (BT) and their 1982–2003 absolute maximum and minimum representing climatology of these indices, respectively; a is a coefficient quantifying a share of VCI and TCI contribution in the VHI. Since this share is generally not known for a specific location and time of the year, it was assumed that VCI and TCI contributions are equal (a = 0.5).

The values of these indices change from 0 to 100. Zero quantifies severe vegetation stress (moisture-based from VCI and thermal-based from TCI) associated with the lowest weekly NDVI (BT) values, which correspond to the 1982–2006 absolute minimum of weekly climatology (NDVImin and BTmin). The VH value 100 quantifies very favourable conditions or the highest weekly NDVI (BT) corresponding to the 1982-2006 absolute maximum value of weekly climatology (NDVImax and BTmax; Kogan Citation1995). The application of these indices as a weather proxy in a few countries (Poland, Brazil, Argentina, Zimbabwe, Morocco, Russia, India, Mongolia) showed that they correlate highly with productivity of crops and pastures and can be used as numerical weather indicators of agricultural losses in advance of harvest (Kogan Citation1997, Citation2002; Kogan et al. Citation2005; Liu & Kogan Citation2002; Dabrowska-Zielinska et al. Citation2002; Unganai & Kogan Citation1998). Weekly area-average indices (VCI, TCI and VHI) were calculated for the entire SO cereals growing area indicated in Atlas Citation(1960) and USDA (Citation1994). This area was inside the coordinates 42.5 − 51.5 N, 50.5 − 53.0 E. Mean values of VH indices were calculated from 1102 16-km pixels.

4.3. Statistical procedures

Since dY yield and VH indices were similarly expressed as a deviation from climatology (from trend for cereals and from max-min for VH), correlation and regression analysis were used to: select the index type, time of their strongest impacts on dY and develop dY-VH models to be used for assessment of yield losses. The data selected for statistical analysis included 20 years of cereal yield (derived after harvest time) and 20 years of VH data for 52 weeks in each year (total 1040 images were used to calculate mean spatial value of each index for the entire period). The Pearson correlation coefficients (CC) were used as a criterion to initiate this selection. Since the number of variables is large (considering three indices and 30-week intervals for each) and they are inter-correlated (collinear), two methods of variable (predictors in statistical model) reduction were applied (a) partial correlation coefficients (Snedecor Citation1965) and (b) weighting variable contribution and aggregating them to one variable following Obukhov Citation(1949).

The partial correlation coefficients (PCC) method estimates the contribution of each independent variable when other variables are fixed at the average level (Snedecor Citation1965). The PCC indicator is an important criterion to use for the selection of predictors, which are collinear with others and have the largest contribution to variability of a dependent variable (predictant). Models, which include collinear variables, become statistically unstable and less accurate. In order to improve such model's stability, the variables with the lowest PCC should be removed. The procedure of predictors' selection consists of several steps. In the first one, a dY-VH model is built based on all weeks with the highest CC for a particular index. In the following steps, the predictors with the lowest PCC are removed and for the remaining predictors PCCs are re-estimated. This procedure is finalized when the remaining predictors have PCC ≥ 0.6.

After PCC-based predictors' selection is completed, the method (b) is applied: predictors from several weeks are aggregated into one variable with the weights calculated from CC values using approximation in equation (Equation6). These weights are applied to the corresponding variable following Equationequation (7), showing for VCI example.(6) (7)

In equations (Equation6) and (Equation7), W – weight, CC – correlation coefficient, CC2 – determination coefficient, i and n – first and last weeks of predictors' aggregation, respectively, k – the new aggregated variable.

4.4. Validation

An important step in building statistical models is to test model predictions outside of a training set, because if the same training data set is used the results of model performance would be very optimistic due to instability of model (Snedecore Citation1965). Since the data sample used in this study is limited, a cross-validation technique was applied for independent model verification. In this approach, a single year was left out one-by-one from the data, a model was built for each set without one year and prediction was made for the removed year. The models were evaluated based on determination coefficient (R2) for predicted (P) versus observed (O) yield values, bias ((B) - difference between (P) and (O) yields), mean B (MB = (∑B)/N), relative bias ((RB) = 100*(B)/(O)), and Mean bios error (MBE = (∑(B)2/N), both systematic (SMBE) and non-systematic (NMBE). The SMBE and NMBE estimate what portion of MBE is unaccounted (due to predictors not included in the model) and accounted (due to included predictors), respectively (Willmott Citation1982).

5. Results

shows mean SO cereal yield (t/ha) and its linear trend during 1971–2001. The trend was approximated from the 10-year longer than modelling data set in order to avoid errors at the ends of shorter-period time series (Snedecore Citation1965). Following Equationequation (1), trend was approximated as Ŷn = 1.08 + 0.000013*year; dYn was estimated from Equationequation (2). Average VCI, TCI, and VHI for SO area (USDA Citation1994) were calculated for each year of the 1982–2001 period from 1102 16-km pixels. It is important to mention that the time series of indices (not shown) did not experience any trend due to a comprehensive processing procedure applied to satellite data. Since these indices represent weakly deviation from their climatology for each year, they were further correlated with the mean SO cereal yield anomaly.

shows dynamics of the Pearson correlation coefficient for the end of season dYn (the numbers for the collected yield are normally available in January–February of the year following crop harvesting) with the indices (upper) and with SO area mean total monthly precipitation (dP) and mean temperature (dT) anomaly. As seen, the dynamics for three indices and weather parameters is smoothed and indicates near zero correlation prior to crops planting and high correlation in the middle of the growing season.

demonstrates five steps of predictors' selection for the dYn – TCI and VHI models based on the PCC values. Principally, the lowest PCC are eliminated from the models (steps 1–4) and the final model is built when PCC > 0.6 for all predictors, which is shown in step 5. results were used to develop four models with all predictors (Equationequations (8) and (Equation9)) selected in and cumulative predictors (Equationequations (10) and (Equation11)) following the equations (Equation6) and (Equation7) approximation. Finally, and demonstrate independent models' validation following (Willmott Citation1982). and illustrate drought area and intensity during 2010–2013. These years were unusual, especially 2010 with almost 100% drought area; in two other years, drought area was estimated at 40% but thermal stress in spring of 2013 covered almost 80% of SO area.

Table 1. Selection of TCI and VHI predictors for cereal's dY models based on partial correlation coefficients (PCC), Saratov oblast, Russia.

Table 2. Statistics of an independent testing of cereal models (1982--2001), Saratov oblast, Russia.

6. Discussion

6.1. Yield trend

The cereal trend () indicates that from the 1970s, agricultural technology-related yield did not increase and remained at the level of 1.1 t/ha. This means that the technology applied to spring grain crops in SO does not stimulate yield growth. Some of the causes of long-term yield hiatus are deterioration of environmental resources (exhaustion of soil fertility, destruction of upper soil layer by heavy machinery, weeds in the fields, etc.) as the result of their uncontrollable exploitation over a long period of time, worsening of the economic situation, ill policies, etc. Besides, deficient agricultural technology, shortage of water, and frequent droughts limit agricultural production even more. At the background of flat yield tendency, weather-related fluctuations of yield around the trend (dY) remained large. As seen in , the extreme deviations of cereal yield from the trend in SO were in 1975 (below the trend) and 1978 (above the trend) with dY estimated fraction at 0.34 versus 1.98, respectively. These values can be interpreted as yield reduction by 66% in 1975 due to unfavourable weather (64% reduction in mean SO summer rainfall) and increase by 98% in 1978 due to favourable weather (48% above climatic norm of SO spatial mean summer rainfall (Dronin & Bellinger Citation2005)).

Figure 2. Cereal yield time series in SO, Russia during 1972–2001.

Figure 2. Cereal yield time series in SO, Russia during 1972–2001.

6.2. Correlation of dY with VH indices and weather variables

shows correlation coefficient (CC) dynamics of dY versus VCI (moisture condition), TCI (thermal condition) and VHI (vegetation health) between weeks 10 (mid-March) and 40 (end of October). Our interests were to (a) investigate the strength of this relationship and (b) whether the strongest correlation coincides with the cereals’ critical period (the period of the highest crops’ requirements to moisture and thermal conditions) and (c) if dY-VH correlation matches with dY-weather correlation (both time and intensity).

Figure 3. Dynamics of correlation coefficient for Saratov oblast's cereal yield departure from trend with (a) SO spatial mean VCI, TCI, and VHI and (b) with SO spatial monthly mean precipitation and temperature anomaly (deviation from climatic normal).

Figure 3. Dynamics of correlation coefficient for Saratov oblast's cereal yield departure from trend with (a) SO spatial mean VCI, TCI, and VHI and (b) with SO spatial monthly mean precipitation and temperature anomaly (deviation from climatic normal).

As seen in , in the early spring, which is cereals’ pre-planting, planting, and emergence (weeks 10–15, March–April), the correlation for all indices is near zero. It increases sharply during the green biomass accumulation in April and May (weeks 17–22). Thermal conditions during this period show much stronger correlation. It means that they play greater role because at the general background of year-to-year moisture deficit in SO, productivity of cereals depends much on air temperature: if weather is hot, crops become stressed thermally and the end-of-season production is reduced; oppositely, cooler weather stimulates higher production. The maximum correlation in occurs in June–July (weeks 26–30) for VCI and early June (weeks 23–25) for TCI. The timing of these maximums coincides with the critical period of spring-planted cereals’ development when they experience enhanced sensitivity to weather conditions: yield is below trend for VH indices below 40 and above trend if VH > 60.

It is important to emphasize, that correlation with in situ data (departure from climatology of total monthly precipitation (dP) and mean monthly temperature (dT)) shows similar to satellite data results. Correlation coefficients' time series of dY versus dP and dY versus dT in (Kogan 1986) are complimentary (in timing and strength of the correlation) to dY versus VH correlation dynamics. Differences in the sign of the correlation for temperature anomaly (negative) and TCI (positive) is because TCI was designed to reflect a reverse temperature scale (from high to low, Equationequation 4), while in situ temperature reflect changes from low to high.

Although the correlation dynamics in were in line with cereals’ response to weather, it should be indicated that even for the weeks of the highest correlation, VCI explained only 15%–30% of dY variance (CC = 0.36–0.41, ), for TCI from 35% to 50% (CC = 0.47–0.75). The relationship of yield anomaly with VHI was the strongest ( and ) explaining up to 64% of dY variance (CC = 0.68–0.80). The largest correlation of dY with VHI emphasizes the importance of combining VCI (moisture conditions) and TCI (thermal conditions) together even if their contribution to VHI was approximated as equal (Equationequation 5).

Both methods (a) PCC and (b) Equationequations (6Equation7) were applied for selection of variables (predictors) for dY-VH regression models. presents the 4–6 steps process of the method (a) for removing weekly predictors with the lowest PCC values from TCI and VHI models. For example, in the first step for TCI model, a regression equation was developed for all variables (weeks 22–29) with the highest CC (the top line for each index). The PCC values in the first step indicated that TCI23 has the lowest contribution (PCC = 0.15) to the model; this variable was removed from the model and PCC were re-estimated for the remaining variables. Similarly, in the following steps, PCC values indicated that TCI 25, 22, and 24 do not provide statistically significant contribution (PCC = 0.42, 0.47, and 0.30, respectively) as predictors of dY (due to their collinearity with other predictors) and they were removed from the models step-by-step. Finally, after the TCI24 (PCC = 0.30 in step 4) was removed from the model, the remaining variables (TCI26–TCI29) had the highest values of PCC (0.75–0.81). These variables were included in dY model, Equationequation (8). Similarly, dY–VHI model in Equationequation (9) included VHI26–VHI29 variables (PCC = 0.71–0.80).

The estimates for Equationequations (8) and (Equation9) are statistically significant: the models' Multiple CC (MCC) is large; 0.83 and 0.86, respectively, R2 explains 69% and 74% of the dY variance, respectively, N is the number of data records, and DF is degree of freedom. The underlined variables and all MCC are statistically significant at 1% and the remaining at 5%.(8) (9)

Meanwhile, small DF values indicate that further reduction of variables in these equations is needed. Therefore, the method (b) was applied to equations (Equation8) and (Equation9). To improve the stability of regression coefficients (Snedecor Citation1965), the number of independent variables was reduced to one only resulting in the maximum DF. Following the transformation in equations (Equation6) and (Equation7), Equationequations (8) and (Equation9) were re-written as:(10) (11)

The MCC, R2, and DF in equations (Equation10) and (Equation11) are statistically sound, however, not much different from models (Equation8) and (Equation9). Therefore, at the final step, all Equationequations (8)–(Equation11)) were tested independently.

6.3. Independent validation

The statistics of an independent testing of (P) versus (O) cereal yield in SO is shown in . Following R2 and MBE values, the best models were (Equation9) and (Equation11), which are based on VHI variables (both weighted and non-weighted). This again emphasizes importance of combining together VCI (calculated from NDVI and estimating moisture conditions) and TCI (calculated from BT and estimating thermal conditions). Although the R2 for these equations are slightly different, Equationequation (11) (weighted VHI) has 7% smaller systematic MBE (0.009 versus 0.012) and larger non-systematic error (0.052 versus 0.042), which is an important indicator of how good might be model performance. It is important to mention that in the best models SMBE should approach to zero while NMBE should approach to MBE (Willmott Citation1982).

Finally, two points should be emphasized: (a) the independently validated models () have relatively high R2 (0.518–0.598), although it is smaller than the R2 (0.69–0.81) from the Equationequations (8)–(Equation11), what means that the R2 alone cannot be considered as a complete and reliable indicator of a well-performing model; the MBE and especially SMBE should always be estimated for this goal; (b) in April 2013, we received additional Saratov cereal yield data for the period 2002–2012 (11 years) and tested Equationequation (11) during this completely independent period of simulation. As seen in , Equationequation (11)-based simulated P and observed cereal yield match quite well: R2 = 0.611, MB = −0.029, MBE = 0.082, SMBE = 0.013 (16%), and NMBE = 0.069 (84%). The completely independent model verification showed quite similar to statistical evaluation.

Figure 4. Observed and independently simulated (Equationequation 11) SO cereal yield during 2002–2012.

Figure 4. Observed and independently simulated (Equationequation 11(11) ) SO cereal yield during 2002–2012.

Losses of cereals were independently estimated quite accurately during 2010–2012, especially in 2010, when drought reduced cereal yield to 0.4 t/ha from the general trend level for that year of 1.2 t/ha (FAO Citation2012). Comparison of 2010-2012 droughts in European Russia is shown in and . The drought of 2010 was high intensity and covered large area compared to other two years (). In the middle of summer of all three years, SO experienced thermal stress (TCI < 40). This stress was extreme in 2010 when TCI was below 20, for almost three months and near zero (exceptional intensity) for 1.5 months, while in the other two years the stress was weaker (TCI around 25) and shorter, continued less than two weeks ((a)). Thermal stress of severe-to-exceptional and exceptional (the strongest) intensity affected 100 and nearly 50% of SO area, respectively ((b)). In 2011 and 2012, the area of similar intensity thermal stress was much smaller

Figure 5. Drought area and intensity for the European Russia and western Kazakhstan in 2010–2012 (Rectangle indicates SO location).

Figure 5. Drought area and intensity for the European Russia and western Kazakhstan in 2010–2012 (Rectangle indicates SO location).

Figure 6. Saratov oblast VHI-based thermal stress dynamics (a) and per cent SO area under droughts of severe-to-extreme and exceptional intensity (b).

Figure 6. Saratov oblast VHI-based thermal stress dynamics (a) and per cent SO area under droughts of severe-to-extreme and exceptional intensity (b).

7. Conclusions

This paper discusses application of satellite-based globally universal VH technique, used originally to monitor VH and drought detection, for statistical modelling of crop yield. Previously this technique was applied and showed good results to model individual crops (wheat, corn, sorghum, rice, etc.). In this study, the VH proxies were applied to a combination of grain crops, called cereals, which included wheat, rye, barley, oats, corn, and pulses. The region was Saratov oblast, one of the major producers of grain in the southern European Russia. The developed models were quite accurate and reliable in prediction of cereal yield 3–4 months in advance of harvest and 6–8 months before official statistics of grain harvest is released. From the three indices characterizing moisture (VCI), thermal (TCI) and VHI conditions, the last two were very good predictors of cereal yield in SO, especially yield losses as it was in 2010. Further investigation of yield losses predictors might include combining satellite data with weather data, specifically during winter and early spring when vegetation is still dormant and the application of VH indices is limited. The VH indices and data are delivered every week to http://www.star.nesdis.noaa.gov/smcd/emb/vci/VH/index.php. They estimate and display global and regional vegetation health, moisture and thermal conditions, drought, and fire risk. They also discuss climate issues and VH utility in global observing system. Finally, this method will be considerably improved with observations from the new generation of operational satellite, called Suomi NPP (S-NPP) with many advanced sensors on board. The new Visible Infrared Imager Radiometer Suite (VIIRS) has started to provide radiance measurements with much higher spatial resolution (375 m) and four times more spectral bands (compared to its predecessor). Besides, VIIRS is currently providing exceptional data quality due to on board calibration of visible channels, narrow response function, sharper view and consequently, better quality NDVI, new vegetation indices (from mid to IR channels), and other products (net primary production, leaf area, vegetation fraction, and others).

Disclosure statement

No potential conflict of interest was reported by the authors.

References

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.