624
Views
3
CrossRef citations to date
0
Altmetric
Research Article

Suitability of satellite-based hydro-climate variables and machine learning for streamflow modeling at various scale watersheds

ORCID Icon & ORCID Icon
Pages 2233-2248 | Received 19 Jul 2019, Accepted 29 Apr 2020, Published online: 31 Jul 2020

Figures & data

Figure 1. Map of the study area showing the Upper Mississippi River Basin (top) and its subwatersheds (bottom left and right). Stars indicate streamflow gauging stations used in this study. Note that the subwatershed downstream of the gauging stations are excluded in the models (shown by lighter colors). The sub-watershed IDs are labelled according to USGS Watershed Boundary Dataset (WBD) Hydrologic Units (HU). USGS gauge numbers are given for the watershed outlets.

Figure 1. Map of the study area showing the Upper Mississippi River Basin (top) and its subwatersheds (bottom left and right). Stars indicate streamflow gauging stations used in this study. Note that the subwatershed downstream of the gauging stations are excluded in the models (shown by lighter colors). The sub-watershed IDs are labelled according to USGS Watershed Boundary Dataset (WBD) Hydrologic Units (HU). USGS gauge numbers are given for the watershed outlets.

Table 1. Summary of watershed characteristics of the study sites.

Figure 2. Summary of data sources and processing applied on the dataset used in the MLM. Processed variables were resampled according to the HUs of each study site.

Figure 2. Summary of data sources and processing applied on the dataset used in the MLM. Processed variables were resampled according to the HUs of each study site.

Figure 3. Decision tree processes (a) showing an example of covariant space where Y (predictand or target) is explained by X1 and X2 (predictors). The covariant space can be approximated by rectangular spaces (b) and (c), each representing the first node and entire nodes of the decision tree (d).

Figure 3. Decision tree processes (a) showing an example of covariant space where Y (predictand or target) is explained by X1 and X2 (predictors). The covariant space can be approximated by rectangular spaces (b) and (c), each representing the first node and entire nodes of the decision tree (d).

Figure 4. Schematic diagram of the boosting process in the boosted regression tree (BRT) method.

Figure 4. Schematic diagram of the boosting process in the boosted regression tree (BRT) method.

Figure 5. Conceptual diagram of the design of training data.

Figure 5. Conceptual diagram of the design of training data.

Figure 6. Workflow and data requirements for training and testing in this study.

Figure 6. Workflow and data requirements for training and testing in this study.

Figure 7. Streamflow model performance according to K-fold numbers.

Figure 7. Streamflow model performance according to K-fold numbers.

Table 2. Performance metrics for training and testing.

Figure 8. Scatterplots of model-simulated vs observed streamflow for training: (a) RRW, (c) IRW, and (e) UMRB; and testing: (b) RRW, (d) IRW, and (f) UMRB.

Figure 8. Scatterplots of model-simulated vs observed streamflow for training: (a) RRW, (c) IRW, and (e) UMRB; and testing: (b) RRW, (d) IRW, and (f) UMRB.

Figure 9. Time series plots of model-simulated streamflow and observed streamflow for the entire study period for (a) UMRB, (b) IRW, and (c) RRW.

Figure 9. Time series plots of model-simulated streamflow and observed streamflow for the entire study period for (a) UMRB, (b) IRW, and (c) RRW.

Figure 10. Relative importance of predictor variables for each watershed: (a) RRW, (b) IRW, and (c) UMRB.

Figure 10. Relative importance of predictor variables for each watershed: (a) RRW, (b) IRW, and (c) UMRB.

Figure 11. Map showing the relative contribution of sub-watersheds in simulating the MLMs for each watershed. Darker shading shows high relative importance and lighter shading shows low or no importance.

Figure 11. Map showing the relative contribution of sub-watersheds in simulating the MLMs for each watershed. Darker shading shows high relative importance and lighter shading shows low or no importance.

Table 3. Comparison of streamflow estimation performances between existing process-based model results (SWAT) and machine learning modellng (MLM) for each study site.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.