Search in:

Hydrological Sciences Journal Volume 68, 2023 - Issue 13

Submit an article Journal homepage

Free access

465

Views

CrossRef citations to date

Altmetric

Research Article

Automatic water-level class estimation from repeated crowd-based photos of streams

Ze Wanga School of Hydraulic Engineering, Dalian University of Technology, Dalian, Liaoning, China;b Department of Geography, University of Zurich, Zurich, SwitzerlandCorrespondence[email protected]

https://orcid.org/0000-0002-4177-8869 View further author information

Jan Seibertb Department of Geography, University of Zurich, Zurich, Switzerland;c Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences, Uppsala, Sweden

https://orcid.org/0000-0002-6314-2124 View further author information

Ilja van Meerveldb Department of Geography, University of Zurich, Zurich, Switzerland

https://orcid.org/0000-0002-7547-3270 View further author information

Heng Lyua School of Hydraulic Engineering, Dalian University of Technology, Dalian, Liaoning, China

https://orcid.org/0000-0002-7794-6764 View further author information

Chi Zhanga School of Hydraulic Engineering, Dalian University of Technology, Dalian, Liaoning, ChinaView further author information

Pages 1826-1840 | Received 05 Dec 2022, Accepted 22 Jun 2023, Published online: 16 Aug 2023

Cite this article
https://doi.org/10.1080/02626667.2023.2240312
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Figure 1. Photographs illustrating the principle of the virtual staff gauge approach used in the CrowdWater project. The first observer at a location (i.e. spot) takes a photo that shows the water surface and a reference object (in this case a pile of rocks) and inserts the virtual staff gauge (a sticker) in the photo, in such a way that class 0 (blue line) is on the water surface (a). The same observer or other observers can take photos of the spot at a later time and determine the new water-level class by looking at the position of the water surface relative to the reference objects (b–g) and the virtual staff gauge on the initial photo (a). Photos and data from https://www.spotteron.com/crowdwater/spots/23 445.

Figure 2. Schematic representation of the dataset construction. The dataset consisted of pairs of photos: the initial photo with virtual staff gauge and a photo at the same spot taken at a later time (see ). The citizen scientist who submitted the second photo also submitted a value for the water-level class. The photo pairs were then evaluated in the CrowdWater game, where multiple citizen scientists cast their vote on the water-level class for the second photo. The median of these votes was used as the label for the classification task, while the classes with the highest and second-highest number of votes (see EquationEquations 1(1) $\begin{aligned} C_{h i g h - r e s o l u t i o n} = C_{m a x} - \frac{N_{s e c o n d - m a x}}{N_{m a x} + N_{s e c o n d - m a x}} \\ i f C_{m a x} \geq C_{s e c o n d - m a x} \end{aligned}$ (1) and Equation2(2) $\begin{aligned} C_{h i g h - r e s o l u t i o n} = C_{m a x} + \frac{N_{s e c o n d - m a x}}{N_{m a x} + N_{s e c o n d - m a x}} \\ i f C_{m a x} < C_{s e c o n d - m a x} \end{aligned}$ (2) ) were used as the label for the regression task. For both tasks, the data for the very high and very low water-level classes were merged in the reclassification process to reduce the number of classes for which there were very few observations. The dataset was divided into a training set and a test (or validation) set, with 80% and 20% of all photos for each spot and class, respectively.

Figure 3. (a) Number of photos for the different water-level classes after reclassification to nine classes, and (b) the cumulative probability curve for the higher resolution water-level class data (see EquationEquations 1(1) $\begin{aligned} C_{h i g h - r e s o l u t i o n} = C_{m a x} - \frac{N_{s e c o n d - m a x}}{N_{m a x} + N_{s e c o n d - m a x}} \\ i f C_{m a x} \geq C_{s e c o n d - m a x} \end{aligned}$ (1) and Equation2(2) $\begin{aligned} C_{h i g h - r e s o l u t i o n} = C_{m a x} + \frac{N_{s e c o n d - m a x}}{N_{m a x} + N_{s e c o n d - m a x}} \\ i f C_{m a x} < C_{s e c o n d - m a x} \end{aligned}$ (2) ). These frequency curves show the data for all 385 spots combined.

Figure 4. Schematic illustration of the steps used in model training and evaluation in this study. In the training stage, the model used photo pairs from the training dataset (80% of all photo pairs) as input, predicted the water-level class, calculated the loss function based on the true values, and updated the model parameters using different training strategies (semi-trained or fully-trained). In the evaluation stage, the trained model predicted the water-level class for the photo pairs from the test dataset (20% of all photo pairs), and the results were compared to the true water-level class (i.e. water-level class from the game) to calculate the evaluation metrics. These metrics were also calculated for the water-level class provided by the citizen scientist who uploaded the photo.

Figure 5. Set-up of the model used in this study. The model is composed of two parts: the feature extraction module and classifier module. The initial photo with the virtual staff gauge and the photo taken at the same spot at a later time go through the feature extraction module separately. The extracted features are subsequently concatenated and processed in the second module to accomplish the classification or regression task. The feature extraction module follows the VGG 19 model and is constructed by stacking multiple convolution layers and max pooling layers. The classifier module mainly depends on a series of fully-connected layers. The ReLU function was used after each convolution layer. The height, width and channel number of the input images and vectors in each step are indicated in the figure.

Figure 6. Schematic representation of the convolution and max-pooling operation in the CNN model.

Figure 7. Box plots of (a) the root mean square error R and (b) the Spearman rank correlation coefficient r_s for of the fully-trained and semi-trained models for the classification task and for the water-level class estimated by the citizen scientist who submitted the photo. R was calculated for all 385 spots, while r_s could only be evaluated for the 112 spots for which the water-level class varied over at least two classes. The median vote for a photo in the CrowdWater game was assumed to be the truth. The upper and lower boundary of the box represent the upper (0.75) and lower (0.25) quartiles, the solid line is the median, the whiskers extend to 1.5 times the interquartile range, and the dots are outliers.

Table 1. The accuracy (A [-]) and root mean square error (R [classes]) for the testing dataset for images with different water-level classes.

Download CSV Display Table

Figure 8. Relation between the Spearman rank correlation coefficient r_s for the water-level class identified by the citizen scientists who submitted the photo (x-axis) and the semi-trained model (y-axis) for the 56 spots for which observations were available for at least two different water-level classes after excluding the class 0 observations. The median of the vote for an image in the CrowdWater game was used as the truth. The shapes of the spots in (a) indicate the variance, and in (b) they show the range of water-level class observations. The result for the one spot for which the r_s is negative is not shown.

Figure 9. The water-level class based on the median vote from the game (i.e. ground truth; blue line) for all the analysed photos, and the corresponding semi-trained model predictions (red dot) and original submissions (green triangle) where they do not match the ground truth for two example spots (Location 1 (a) and 2 (b)). For visualization, only the results for the wrongly classified photos are shown; those that were predicted correctly by the model or submitted correctly by the citizen scientists are not shown (and would plot on the blue line). The photo numbers are in chronological order, but the time interval between adjacent photos differs.

Figure 10. Example photos for two spots for which the model performance was not satisfactory: photos taken (a) from a moving train resulting in a negative Spearman rank correlation coefficient r_s for both the model and the original observation (photos from: https://www.spotteron.com/crowdwater/spots/40237), and (b) for a spot where the Spearman rank correlation is lower for the model than for the original submission because the sight distance and visual angle differ considerably from one photo to another (photos from: https://www.spotteron.com/crowdwater/spots/22036).

Figure 11. Box plots of (a) the root mean square error R and (b) the Spearman rank correlation coefficient r_s of the water-level class time series for the classification and regression models (semi-trained). R was calculated for all 385 spots, and r_s only for the 112 spots for which the water-level class varied over at least two classes. For the classification task, the photos were compared with the median vote from the CrowdWater game. For the regression task, the classes with the most and the second most votes in the game were used to derive higher resolution class data for the photos (see for a schematic representation). Note that the result for the one outlier for the classification model with a root mean square error of 4.8 classes is not shown.

Figure 12. The water-level class data (EquationEquations 1(1) $\begin{aligned} C_{h i g h - r e s o l u t i o n} = C_{m a x} - \frac{N_{s e c o n d - m a x}}{N_{m a x} + N_{s e c o n d - m a x}} \\ i f C_{m a x} \geq C_{s e c o n d - m a x} \end{aligned}$ (1) and Equation2(2) $\begin{aligned} C_{h i g h - r e s o l u t i o n} = C_{m a x} + \frac{N_{s e c o n d - m a x}}{N_{m a x} + N_{s e c o n d - m a x}} \\ i f C_{m a x} < C_{s e c o n d - m a x} \end{aligned}$ (2) ) and regression model estimates for two spots (Location 1 and 2). The photo numbers are in chronological order, but the time interval between adjacent photos differs.

Figure 12. The water-level class data (EquationEquations 1(1) Chigh−resolution=Cmax−Nsecond−maxNmax+Nsecond−maxifCmax≥Csecond−max(1) and Equation2(2) Chigh−resolution=Cmax+Nsecond−maxNmax+Nsecond−maxifCmax<Csecond−max(2) ) and regression model estimates for two spots (Location 1 and 2). The photo numbers are in chronological order, but the time interval between adjacent photos differs.

Figure 13. Influence of the number of training photos on model performance in terms of root mean square error R (a and c) and Spearman rank correlation coefficient r_s (b and d) for the classification task (a, b) and regression task (c, d). Each dot represents one spot and is coloured by the number of classes over which the water level varied in the photos for that spot.

Data availability statement

The data used in this study were collected by the citizen scientists of the CrowdWater project and can be downloaded from https://crowdwater.ch/en/data/.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Automatic water-level class estimation from repeated crowd-based photos of streams

Table 1. The accuracy (A [-]) and root mean square error (R [classes]) for the testing dataset for images with different water-level classes.

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Automatic water-level class estimation from repeated crowd-based photos of streams

Figures & data

Table 1. The accuracy (A [-]) and root mean square error (R [classes]) for the testing dataset for images with different water-level classes.

Data availability statement

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date