494
Views
12
CrossRef citations to date
0
Altmetric
Research Article

Propagation of radar rainfall uncertainties into urban pluvial flood modeling during the North American monsoon

, , ORCID Icon & ORCID Icon
Pages 2232-2248 | Received 15 Feb 2021, Accepted 12 Aug 2021, Published online: 29 Oct 2021

ABSTRACT

Pluvial flooding in urban regions is a natural hazard that has been rarely investigated. Here, we evaluate the utility of three radar (Stage IV, Multi-Radar Multi-Sensor or MRMS, and gauge-corrected MRMS or GCMRM) quantitative precipitation estimates (QPEs) and the Storm Water Management Model (SWMM) hydrologic–hydraulic model to simulate pluvial flooding during the North American monsoon in Phoenix. We focus on an urban catchment of 2.38 km2 and, for four storms, we simulate a set of flooding metrics using the original QPEs and an ensemble of 100 QPEs characterizing radar uncertainty through a statistical error model. We find that Stage IV QPEs are the most accurate, while MRMS QPEs are positively biased and their utility to simulate flooding increases with the gauge correction done for GCMRMS. For all radar products, simulated flood metrics have lower uncertainty than QPEs as a result of rainfall–runoff transformation. By relying on extensive precipitation and basin datasets, this work provides useful insights for urban flood predictions.

Editor A. Castellarin Associate Editor M. Batalini de Macedo

1 Introduction

Flooding is the most common natural hazard, causing large property losses and fatalities worldwide (Doocy et al. Citation2013). For example, in the United States (US), major flooding events caused an annual average of $9.1 billion in losses and 71 fatalities between 2004 and 2015 (The National Academies Press Citation2019). The impacts of flooding are particularly significant in urban regions, due to population growth concentrated in cities (Cohen Citation2006, Hossain et al. Citation2015), land-cover modifications increasing surface imperviousness (W. Zhang et al. Citation2018), and climate change potentially causing more severe precipitation extremes (Trenberth et al. Citation2003, Emanuel Citation2005, Prein et al. Citation2017). Urban areas may be impacted by pluvial, fluvial, and coastal flooding. Fluvial flooding results from a river overtopping its banks. Coastal flooding occurs when storm surge or extreme high tides inundate the shore and/or cause inland flooding through the drainage network. Pluvial flooding takes place when runoff exceeds the capacity of natural and built drainage systems to collect water and safely transport it to a receiving water body (Rosenzweig et al. Citation2018). Here, we focus on urban pluvial flooding, a process that has rarely been systematically measured and modeled despite being a significant driver of urban flooding (Rosenzweig et al. Citation2018, The National Academies Press Citation2019). While pluvial flooding is often considered “nuisance” flooding, it can result in building damage, traffic impacts, power outages, and weakened infrastructure (ten Veldhuis Citation2011).

A key resource to mitigate the impacts of urban flooding is the availability of accurate hydrometeorological forecasts with sufficient lead times. In the US, operational flood and flash food forecasts are provided by the National Weather Service (NWS), an agency of the National Oceanic and Atmospheric Administration (NOAA). These forecasts rely on quantitative precipitation estimates (QPEs) and forecasts (QPFs) generated through the integration of weather radar data, raingauge observations, and numerical weather prediction models. QPEs and QPFs are used as forcings for the Sacramento Soil Moisture Accounting (SAC-SMA) hydrologic model (Z. Zhang et al. Citation2012), which produces flood forecasts at ~3600 locations across the nation (Salas et al. Citation2018). Flash flood guidance values are also generated in certain regions of the country (D. Seo et al. Citation2013). NWS River Forecast Centers (RFCs) and Weather Forecast Offices (WFOs) use these different sources of hydrometeorological forecasts to issue flood and flash flood watches and warnings to inform the public on the potential occurrence and danger of these events (NOAA Citation2020). For urban regions where catchments have short response times and no streams, flood and flash flood watches and warnings are often based solely on QPEs, QPFs, and expert knowledge of the area.

Despite the low annual precipitation depths, urban pluvial flooding is of significant concern also in desert cities (Thakali et al. Citation2016, Saber et al. Citation2020). These include the Phoenix, Arizona, metropolitan region in the southwestern US, which experiences localized convective thunderstorms with high rain rates during the North American monsoon (NAM) summer season (Adams and Comrie Citation1997), resulting in floods and flash floods (Yang et al. Citation2017, Citation2019). Due to the combined effect of increasing urbanization and concentration of population, economic activities and infrastructure, pluvial flooding events in the Phoenix metropolitan area have been increasingly impacting transportation, electricity delivery, and properties (NOAA Citation2021). Unfortunately, operational forecasts of floods and flash floods driven by NAM convective storms are challenged by the limited ability to (i) predict the exact location, timing, and rain rates of convective storms with adequate lead time (Li et al. Citation2003, Rogers et al. Citation2017) and (ii) simulate rainfall–runoff processes in urban catchments with highly heterogeneous surface properties and presence of stormwater infrastructure (Leandro et al. Citation2016).

In this study, we contribute to improving the predictability and modeling of urban pluvial flooding caused by NAM convective thunderstorms through two main activities. First, we quantify the uncertainty of three radar-based QPE products in the Phoenix metropolitan area during the NAM season (July to October of years 2015–2019). This is a necessary preliminary step to validate and improve both QPEs and QPFs used for hydrologic predictions. The QPEs include the National Centers for Environmental Prediction (NCEP) Stage IV analysis product (Y. Lin Citation2020), and the Multi-Radar Multi-Sensor (MRMS) system product and its gauge-corrected version (GCMRMS) (J. Zhang et al. Citation2011, Citation2016). They all rely on observations of S-band weather radars of the Next Generation Weather Radar (NEXRAD) network that are merged with other data sources that depend on the product. Radar-derived rainfall estimates are affected by errors in (i) reflectivity measurements, (ii) relations converting reflectivity into rainfall rate, and (iii) geometry of the radar measurement field (Villarini and Krajewski Citation2010a). Although the integration of other data sources and gauge observations in radar QPEs limits these errors, their effect can still be significant, especially at hourly and sub-hourly resolutions (Nelson et al. Citation2016). Here, we characterize the uncertainty of these errors using the multiplicative error model proposed by Ciach et al. (Citation2007). While error models of radar-derived QPEs have been applied in humid (Habib et al. Citation2008, Villarini and Krajewski Citation2009, Citation2010b) and mountainous regions (Germann et al. Citation2009, Kirstetter et al. Citation2010), their application has been limited in arid sites and, to our knowledge, they have never been tested in the desert southwestern US. Moreover, to calibrate the error model, we use observations of 168 raingauges in an area of 5930 km2, resulting in one of the largest densities of ground observations used to apply these statistical models.

The second main activity of this study is to set up a hydrologic–hydraulic model in an urban catchment in Phoenix and use it to assess the propagation of uncertainty of the QPE products into urban flooding predictions. Simulating urban pluvial flooding in detail is a complex task that requires (i) identifying and collecting data on small-scale spatial heterogeneities of catchment features (e.g. roads and buildings) and stormwater infrastructure; and (ii) incorporating these features in a numerical model that simulates rainfall–runoff processes, surface overland flow, and pipe flow. Progress has been made during the last decade to address some of the major challenges of urban hydrologic modeling (e.g. Chen et al. Citation2009, S. Zhang and Pan Citation2014, Leandro et al. Citation2016, Cristiano et al. Citation2017, Grimley et al. Citation2020). However, current operational hydrologic forecasts by the NWS still rely on models that do not explicitly simulate urban hydrologic and hydraulic processes. Here, we take advantage of the availability of high-resolution (0.25 m) terrain from Light Detection and Ranging (LiDAR) and a detailed infrastructure database provided by the City of Phoenix to explore the utility of the Environmental Protection Agency (EPA) Storm Water Management Model (SWMM) model (Koustas Citation2000, Rossman Citation2010), which is widely adopted for engineering design. After setting up both the one- and two-dimensional (1D and 2D) versions of SWMM in a basin of 2.38 km2, we select four storm events and conduct rainfall–runoff simulations using (i) gauge rainfall observations; (ii) Stage IV, MRMS, and GCMRMS QPEs; and (iii) an ensemble of rainfall fields generated with the error model that characterizes the uncertainty of radar-derived QPEs. In doing so, this study focuses on a source of uncertainty (precipitation) affecting urban flood modeling that has received less attention compared to other sources, such as topographic data and parameter specification (e.g. Deletic et al. Citation2012, Abily et al. Citation2016). This work provides valuable support for local flood management and forecasting agencies, and can be used to improve the recently launched NOAA National Water Model (NWM; NOAA Citation2016), which relies on MRMS QPEs and a distributed hydrologic model to simulate streamflow at ~2.7 million river locations over the continental US at different lead times.

2 Study area

The metropolitan area of Phoenix, Arizona ()) is one of the fastest growing urban regions in US, with a population that has risen from 1.86 million in 1985 to over 4.75 million in 2018 (Guan et al. Citation2020). It is located in the southwestern US in a region with a hot desert climate (BWh, according to the Köppen classification) where the mean annual rainfall and temperature are 204 mm and 24°C, respectively (Mascaro Citation2017). The rainfall regime is characterized by marked seasonality, including (i) a summer season from July to September, when the NAM leads to diurnally modulated convective thunderstorms with high rainfall intensities, very short durations (<1 h) and small spatial extents (Balling and Brazel Citation1987); and (ii) a dry period from late fall to early summer that is interrupted by occasional cold fronts leading to widespread storm systems with low-to-moderate rainfall intensity and relatively longer durations of up to a few days (Sheppard et al. Citation2002). The spatial variability of annual, seasonal, and extreme rainfall is moderately to significantly controlled by terrain, which varies from 220 to 2325 m above mean sea level (Mascaro Citation2017, Citation2018, Citation2020).

Figure 1. (a) Metropolitan area of Phoenix, AZ along with location of KIWA WSR-88D weather radar, study basin, and raingauges of the Flood Control District of the Maricopa County (FCDMC) network in Maricopa and Pinal counties, displayed with two different colors depending on the distance from the radar (labelled as cluster 1 and 2). (b) Boundaries of the study catchment and sub-catchments in downtown Phoenix (near the corner of W Roosevelt St and S 7th Ave), along with outlet, underground conduits of the stormwater infrastructure system, grids of Stage IV and Multi-Radar Multi-Sensor (MRMS) (same as gauge corrected MRMS (GCMRMS)) radar products, and raingauge of the FCDMC network

Figure 1. (a) Metropolitan area of Phoenix, AZ along with location of KIWA WSR-88D weather radar, study basin, and raingauges of the Flood Control District of the Maricopa County (FCDMC) network in Maricopa and Pinal counties, displayed with two different colors depending on the distance from the radar (labelled as cluster 1 and 2). (b) Boundaries of the study catchment and sub-catchments in downtown Phoenix (near the corner of W Roosevelt St and S 7th Ave), along with outlet, underground conduits of the stormwater infrastructure system, grids of Stage IV and Multi-Radar Multi-Sensor (MRMS) (same as gauge corrected MRMS (GCMRMS)) radar products, and raingauge of the FCDMC network

While floods and flash floods occur in both summer and winter in this urban area, pluvial flooding events caused by monsoonal thunderstorms occur more frequently than other types of flood events (Yang et al. Citation2017; their ) and are particularly impactful in small urban catchments with short response times. To investigate the predictability of these events, we simulate urban flooding with different radar rainfall products in an urban catchment of 2.38 km2 in downtown Phoenix ()). The basin is largely (~80%) impervious and includes the dense downtown business district, the Phoenix City Hall, and entertainment venues such as the Citi Field Baseball stadium. As detailed in ), the drainage infrastructure in the catchment includes 657 catch basins, 386 manholes and other junctions, and 1091 pipe segments totalling 24 km, all draining to one outlet that discharges to the Salt River south of downtown Phoenix. In addition to this surface discharge, there are 26 drywells that capture and infiltrate stormwater.

Figure 4. Relations between e(Rr) and Rr for the different radar products, along with quantiles of the empirical (dotted lines) and mixed gamma (solid lines) distributions for cluster 2

MRMS: Multi-Radar Multi-Sensor; GCMRMS: gauge corrected MRMS.
Figure 4. Relations between e(Rr) and Rr for the different radar products, along with quantiles of the empirical (dotted lines) and mixed gamma (solid lines) distributions for cluster 2

3 Datasets

3.1 Rainfall products

We use three radar-derived QPEs: Stage IV, MRMS, and GCMRMS. Stage IV QPEs are produced by processing and mosaicking reflectivity data from the NEXRAD network; rainfall rates are further adjusted with gauge and satellite observations and manually quality controlled (Y. Lin Citation2020). MRMS products are derived by integrating radar observations from the NEXRAD and Canadian networks, with atmospheric environmental data, satellite data, and lightning and raingauge observations (J. Zhang et al. Citation2016). We acquired Stage IV and MRMS QPEs for summers (July to October) of 2015–2019. For MRMS, we obtained version 11 of the radar-only and gauge-corrected (GCMRMS) products. Stage IV (MRMS; GCMRMS) QPEs are available in polar stereographic coordinates at 4-km, 1-h (1-km, 2-min; 1-km, 1-h) resolution. We project all radar products into Universal Transverse Mercator (UTM) Zone 12 N and aggregate MRMS at one-hour time resolution. To quantify errors in the QPEs and apply the multiplicative error model, we use rainfall records of 168 gauges of the Automated Local Evaluation in Real Time (ALERT) network managed by the Flood Control District of the Maricopa County (FCDMC; )). For these tipping-bucket gauges, we convert the tipping instants into rainfall intensities at one-hour resolution following the procedure described by Mascaro et al. (Citation2013).

3.2 Geospatial datasets for hydrologic–hydraulic simulations

We set up the EPA SWMM hydrologic–hydraulic model in the study basin using several geospatial datasets. We use the terrain description from a LiDAR product at an equivalent resolution of 0.25 m that is publicly available (ASU Citation2018); soils data from the Natural Resources Conservation Service (NRCS) Web Soil Survey (NRCS Citation2019); and percent imperviousness from a 30-m map from the National Land Cover Database (MRLC Citation2016). We obtained information on drainage infrastructure components from the City of Phoenix, including location of pipes, dry wells, and manholes; pipe material, diameter, and slope; and elevations of pipe rim and invert.

4 Methods

In the following sections, we first describe the statistical error model used to characterize the uncertainty of radar errors, and then we illustrate the setup of the hydrologic–hydraulic model. Lastly, we detail the approach used to sample the radar rainfall error model to force the hydrologic–hydraulic model and describe the metrics adopted to quantify the associated uncertainty.

4.1 Radar rainfall error model

We use a multiplicative model to characterize the radar rainfall model uncertainty. This type of model has been shown to outperform additive models in terms of predictive skill and ability to separate systematic and random errors and capture the nonlinear relationship between the magnitude of errors and measurement (Tian et al. Citation2013, Tang et al. Citation2015). Specifically, we use the model proposed by Ciach et al. (Citation2007) and recently improved by Villarini et al. (Citation2014), who suggested the use of a mixture of gamma distributions instead of the Gaussian distribution to characterize positive multiplicative errors, along with techniques to account for the spatiotemporal structure of the errors. Since the errors of radar-derived products are affected by the distance from the radar, we divide the raingauges into two clusters, as shown in ), and apply the model separately on each cluster. In the following, we provide a brief description of the model and refer the reader to Ciach et al. (Citation2007) and Villarini et al. (Citation2014) for a detailed description. We also point out that a recent study by Ciach and Gebremichael (Citation2020) has proposed an alternative parameterization of the error model based on a three-parameter modified Laplace model.

The model assumes that the difference between area-averaged radar rainfall estimates and point gauge measurements is negligible. In a previous model application in Oklahoma by Ciach et al. (Citation2007), this assumption was supported, referring to Ciach and Krajewski (Citation2006), who calculated the spatial correlogram of one-hour rainfall time series observed at pairs of gauges with distances, d, of less than 4 km. These authors found the Pearson’s correlation coefficient, ρ, to be larger than 0.88, thus suggesting the rainfall spatial variability to be small within a radar pixel. We perform a similar analysis with our dataset by computing the Kendall’s τ correlation coefficient to better measure the correspondence between samples that are not well described by the Gaussian distribution (Serinaldi Citation2008, Villarini et al. Citation2014). We then derive ρ from τ using the equation ρ=sinπτ2 (Fang et al. Citation2002) and fit the power-law relation

(1) ρd=ρ0expdd0(1)

with parameters ρ0 and d0, to capture the average behavior. We find ρ to be larger than 0.88 for d ≤ 1 km (the MRMS grid resolution), which is similar to the value of Ciach and Krajewski (Citation2006) (see Fig. S1 in the Supplementary material). The minimum ρ decreases instead to 0.72 for d = 4 km, suggesting that a single raingauge may not fully capture the rainfall variability within a Stage IV pixel. Dai et al. (Citation2018) showed that this can introduce uncertainty in the application of a different error model. Since a similar study has not been yet conducted for the model adopted here and is beyond the scope of this paper, we will continue to assume that raingauges provide a relatively accurate estimate of the area rainfall.

According to the multiplicative structure of the error model, the true rainfall value in a radar pixel, Rtrue, can be expressed as the product of a deterministic component, hRr, and a random component, eRr

(2) Rtrue=h(Rr)e(Rr).(2)

Both functions in EquationEquation (2) depend on Rr, which is the radar rainfall value corrected for unconditional bias (if present), computed as:

(3) Rr=B0Rr.(3)

B0 is in turn calculated as:

(4) B0=iRg,iiRr,i,(4)

where Rg,i is the raingauge measurement at the ith gauge, and Rr,i is the biased radar rainfall in the co-located pixel (i.e. the original Stage IV, MRMS and GCMRMS values).

In EquationEquation (2), the function hRr provides an estimate of the bias conditioned on Rr, while eRr accounts for residual errors after removing unconditional and conditional biases. We obtain the two components as follows. We use raingauge observations, Rg, to approximate Rtrue; next, following Ciach et al. (Citation2007), we compute hRr by (i) applying the Epanechnikov kernel with a minimum of 20 data points, and (ii) fitting the power relation:

(5) hRr=aRrb(5)

with parameters a and b, to the line derived from the kernel. EquationEquation (5) allows obtaining the sample of errors as the ratio of Rg (approximating Rtrue) and hRr. We use this sample to find the best parametric distribution to statistically model the random component eRr, testing different mixtures of gamma distributions, as proposed by Villarini et al. (Citation2014). For each distribution, we evaluate the possibility of using parameters that are constant or dependent on Rr and choose the most appropriate one with the Akaike information criterion (AIC; Akaike Citation1974).

We also test whether the random errors in our dataset exhibit significant spatial and temporal dependencies. For each cluster, we compute the spatial correlogram of the random errors using the Kendall’s τ correlation coefficient and the same power model of EquationEquation (1), where ρ(d) and ρ0 are now replaced by τ(d) and τ0, respectively (Fig. S2 in the Supplementary material). We find the nugget parameter τ0 to range from 0.23 to 0.37 depending on the radar product, thus indicating that the correspondence between errors at close pixels is rather low. To verify this assumption, we (i) generate 100 spatially correlated error fields, following Villarini et al. (Citation2014), as well as 100 random error fields; and (ii) compute for each field the basin mean areal rainfall during four selected storms (see Section 4.3). We find the uncertainties of the basin mean areal rainfall derived from spatially correlated and uncorrelated fields to be very similar (not shown). To quantify the temporal dependencies, we compute the autocorrelation of the errors for N = 462 storms (detected in 85 individual pixels) with durations longer than four hours. We then compare the resulting median and 90% confidence interval with those of N time series of random white noise with the same duration as the observed events. We find that – apart from MRMS, which exhibits a certain degree of autocorrelation at lag 1 – observed and synthetic autocorrelations are similar (Fig. S3 in the Supplementary material). Based on these empirical outcomes, we consider the spatiotemporal dependencies among the random errors to be negligible for the purpose of applying the error model in our basin.

4.2 The coupled hydrologic–hydraulic model

4.2.1 Brief model description

SWMM is a semi-distributed rainfall–runoff model developed by the US EPA (Koustas Citation2000) for simulating runoff in urban catchments. The model includes a hydrologic component that transforms rainfall into surface runoff in sub-catchments, and a routing component that transports this runoff through stormwater infrastructure elements, including pipes, channels, and storages. As detailed in the user’s guide (W. James et al. Citation2010), different hydrologic processes can be simulated that allow applying the model in event-based or continuous fashion. The routing component includes several hydraulic modeling capabilities largely based on the application in 1D of the momentum and continuity equations. The model domain is generated by (i) discretizing the catchment into smaller sub-catchments corresponding to each inlet into the infrastructure drainage system; and (ii) introducing different types of model elements, such as conduits, junctions (e.g. catch basins and manholes), outfall nodes, and storage units (see for a description of the main elements). Model inputs include rainfall data, provided through one or more raingauges (or radar pixels; see ), while outputs include the flow rate and hydraulic grade lines throughout the drainage system, and surface flood volumes when surcharge occurs (i.e. water rises above the crown of the highest conduit) and the hydraulic grade line exceeds surface elevation of the junctions.

Table 1. Description of the main components of the Storm Water Management Model (SWMM) hydrologic–hydraulic model

In addition to the standard components of SWMM, we also use the proprietary PCSWMM implementation to model overland routing of flood water and the resulting flood extent. This is accomplished through an integrated 1D–2D modeling approach where the fully dynamic 1D approach (i.e. SWMM) is extended to 2D free surface flow using a mesh that captures topography, geometry, and built structures. PCSWMM builds upon the SWMM 1D model and discretizes the domain into a square mesh where each 2D cell is represented with a 2D node or a junction, whose invert elevations are assigned as either the ground surface elevation or the rim elevation of adjacent coupled 1D junctions (Finney et al. Citation2012, R. James et al. Citation2013). In PCSWMM, rainfall–runoff processes and routing in the pipe system are simulated in 1D, as in SWMM. When flood conditions are simulated, water leaves the drainage system at catch basin junctions (and manholes, if not bolted) and is then routed on the 2D mesh cells. Flood water is routed across the land surface and, as the hydraulic grade line drops below the catch basin rim elevation, water can re-enter the drainage system.

4.2.2 Model setup

To set up SWMM in our basin, we extensively pre-process and quality control the LiDAR and infrastructure data through geographic information system (GIS) software and field visits. We use the LiDAR point cloud data with an average point spacing of 0.25 m and a total point count of over 37.8 million in the basin to generate a digital elevation model (DEM) at 0.6-m resolution for the catchment delineation. The elevations of the point cloud data range from 320 to 484 m, mainly constituting returns from the road surface and roofs of the buildings. Features such as buildings are initially retained as they can influence sub-catchment boundaries. Next, we use the high-resolution DEM to generate 2D computational square grid cells at a lower resolution of 4.57 m (15 ft) for overland routing of flood waters needed for PCSWMM. This lower resolution is selected due to the high computational demands of routing overland flow. We then exclude buildings through a mask of building footprints available from ASU (Citation2018), as they would artificially raise the grid cell elevation and impact overland flow routing. Using aerial imagery, we visually identify DEM and 2D cells with abnormally low or high elevations caused by the presence of overpasses or sites that were under construction when the LiDAR was obtained. To prevent ponding at these artificial depressions, those sections are excluded from the 2D computational grid by creating bounding layers over them.

As a next step, we check the reliability of the GIS data of stormwater infrastructure components through field visits. During these, we verify surface structures (e.g. catch basins, manholes) in frequently flooded areas identified by project partners in the City of Phoenix and the FCDMC, as well as areas that deviated from standard design practice. In addition, we identify a few missing or moved surface components, including catch basins and manholes, as well as missing subsurface components where stormwater could not reach the outlet from a portion of the system as documented. These missing pipe segments are specified based on design standards. Additionally, some components have missing attribute data such as rim or invert elevations, slope, pipe diameter, and pipe material. Rim elevations are estimated from the DEM, while invert elevations, slope, pipe size, and material are estimated from adjacent components and design standards. Once data gaps are assessed and filled, the infrastructure shapefiles are imported into SWMM to create the drainage network in the model. Finally, pipe material (mostly concrete) is used to specify Manning’s roughness coefficient (on average, 0.011).

We apply the model for event-based simulations using rainfall data from (i) the only gauge of the FCDMC networks installed within the basin; and (ii) the radar grids (see )). For the hydrologic component, we specify the percent imperviousness of each sub-catchment using the NRCS dataset and we simulate infiltration with the Green-Ampt method, which is parameterized with soil properties from the Web Soil Survey database. Parameter values are reported in . The sub-catchments draining to each of the 26 drywells present in the basin are modeled as disconnected from the rest of the drainage system unless flooding and overland flow cross sub-catchment boundaries (modeled only in the 2D PCSWMM). We use the 1D dynamic wave routing method to model routing in the pipes. PCSWMM simulates surface flooding by coupling the 1D model of the drainage network with a 2D overland flow model. When surcharge occurs, flow exiting catch basins (all manholes are bolted in our study basin) is routed on the 4.57 m × 4.57 m mesh surface using the St Venant flow equations.

Table 2. Parameter values of the Green-Ampt infiltration scheme adopted in Storm Water Management Model (SWMM)

Table 3. Imperviousness and ground slope values assigned to the sub-catchments, summarized as a range of values for percentages of the basin area

4.3 Hydrologic–hydraulic simulations

We identify four storm events from the 2015–2019 study period with significant areal coverage in Phoenix and relatively high average precipitation depth, and for which NOAA has reported impacts on traffic and properties (NOAA Citation2021). The storm characteristics are presented in . Two storms occurred in the middle of the NAM season and, as such, are of monsoonal origin; the other two were observed at the end of September and in October during the transition from NAM to winter weather conditions and, for one of them, the source of moisture came partially from a tropical storm (personal communication with Dr. Larry Hopper, at the NWS). Storm durations range from 6 to 30 hours and gauge total rainfall varies from 5 to 10 mm. The impacts of one of the storms (23 September 2019) on street-level flooding in the study basin are presented in the photograph () that was taken with a camera installed to monitor flooding during the event. For each storm, we conduct simulations with SWMM using different rainfall inputs at one-hour resolution, including (i) raingauge observations (Rg); (ii) original (i.e. Rr*) QPEs from Stage IV, MRMS and GCMRMS; and (iii) an ensemble of N = 100 fields generated through the error model for each radar product. The rainfall inputs derived from the raingauge are spatially uniform, while those produced by the original radar products and the error model are spatially variable with a resolution of 4 km (1 km) for Stage IV (MRMS and GCMRMS), as also shown in ). The procedure used to generate N = 100 fields with the error model is based on Monte Carlo simulations, consisting of the following steps:

  1. For a given storm with T time steps, the unconditional bias is first removed from the original radar estimates using EquationEquation (3), and the deterministic component hRr is calculated with EquationEquation (5) at all pixels and times.

  2. Since the spatiotemporal dependencies of the errors are found to be negligible, for each pixel, a total of T random errors, eRr, are randomly generated from the corresponding probability distribution.

  3. The errors and the deterministic component hRr are then used to estimate the true rainfall using EquationEquation (2), resulting in T rainfall grids for each pixel. This spatiotemporal rainfall field represents a possible realization of the true rainfall for the analyzed event, given the original radar-derived QPEs. The spatiotemporal rainfall field is then used to force SWMM.

  4. Steps 2 and 3 are repeated N times.

Table 4. Summary of storm characteristics, including start and end date and time in Coordinated Universal Time (UTC), identified as the times when (i) any of the gauge and radar products started measuring nonzero rainfall in the basin and (ii) all measured zero rainfall, respectively; type as defined by Dr. Larry Hopper from National Weather Service (NWS) (2020; personal communication); duration; rainfall totals observed by gauges (Rg); and original (Rr*) and bias corrected (in parentheses; Rr) mean areal rainfall totals of the radar-derived estimates

Figure 2. Flooding in the study catchment at Central Ave and Washington St in Phoenix, AZ during the storm on 23 September 2019

Figure 2. Flooding in the study catchment at Central Ave and Washington St in Phoenix, AZ during the storm on 23 September 2019

Since the 2D simulations with PCSWMM are computationally intensive, running this model N = 100 times is not feasible. Therefore, we select three rainfall sequences generated from the previous Monte Carlo SWMM simulations for each storm. These sequences correspond to the 25th, 50th, and 75th percentiles of the distribution of the total flood volume, Vf (i.e. the volume of water that exits the drainage system through catch basins when the hydraulic grade line exceeds the rim elevation of catch basins) across the 100 Monte Carlo runs. We then apply PCSWMM for these three realizations for each storm to illustrate the influence of precipitation uncertainty on flood extent in a computationally efficient manner.

4.4 Uncertainty quantification

Let {x1, …, xN} be the values of a given variable x (rainfall or discharge at a given time step) returned by the ensemble runs with the radar rainfall or the hydrologic model. We quantify the uncertainty of these ensemble simulations of x through two simple metrics: (i) the skewness of the distribution of {x1, …, xN}; and (ii) the interquartile range relative to the median: RIQR = (q0.75q0.25)/q0.50 × 100, where qp is the quantile of the empirical distribution of {x1, …, xN} associated with the non-exceedance probability p.

5 Results

5.1 Characterization of radar QPE uncertainty

The first step to apply the error model is to calculate the unconditional bias, B0, which is needed to compute Rr via EquationEquation (3). We find B0 to be 1.26, 0.66, 0.76 for Stage IV, MRMS, and GCMRMS, respectively, indicating that original Stage IV (MRMS and GCMRMS) QPEs underestimate (overestimate) the gauge observations. presents the scatterplots between Rr and Rg (estimating the true rainfall, Rtrue) along with the lines obtained through the Epanechnikov kernel and the power law of EquationEquation (5) for cluster 2 (results for cluster 1 are reported in Fig. S4 of the Supplementary material). All radar products exhibit a positive systematic bias that becomes larger as Rr increases, as found in previous work (e.g. Ciach et al. Citation2007, Villarini and Krajewski Citation2009, Citation2010b, Villarini et al. Citation2014). The conditional bias for Stage IV QPEs is much smaller than that found for the MRMS and GCMRMS products. The errors for GCMRMS are slightly smaller than those of MRMS for Rr ≤ 15 mm/h, where most data points lie; the opposite is true for Rr > 15 mm/h, although fewer points exist in this region.

Figure 3. Scatterplots of radar rainfall estimates, Rr, and true rainfall approximated by the gauge estimates, Rg, for cluster 2. A total of 8645, 8859, and 8605 nonzero pairs for (a) Stage IV, (b) Multi-Radar Multi-Sensor (MRMS), and (c) gauge corrected MRMS (GCMRMS) are shown using different colors based of their relative frequency. In each panel, the dashed red line is the systematic component estimated with the Epanechnikov kernel, and the solid black line is the power law of EquationEquation (5). Parameters of the power law are a = 1.12, b = 0.90 for Stage IV; a = 1.37, b = 0.77 for MRMS; and a = 1.86, b = 0.66 for GCMRMS

Figure 3. Scatterplots of radar rainfall estimates, Rr, and true rainfall approximated by the gauge estimates, Rg, for cluster 2. A total of 8645, 8859, and 8605 nonzero pairs for (a) Stage IV, (b) Multi-Radar Multi-Sensor (MRMS), and (c) gauge corrected MRMS (GCMRMS) are shown using different colors based of their relative frequency. In each panel, the dashed red line is the systematic component estimated with the Epanechnikov kernel, and the solid black line is the power law of EquationEquation (5)(5) hRr=aRrb(5) . Parameters of the power law are a = 1.12, b = 0.90 for Stage IV; a = 1.37, b = 0.77 for MRMS; and a = 1.86, b = 0.66 for GCMRMS

As a next step, we use EquationEquation (2) to compute the errors, e(Rr), quantifying the variability of radar rainfall estimates around the h(Rr) lines. shows the errors along with the empirical quantiles (dotted lines) derived with the Epanechnikov kernel for cluster 2 (results for cluster 1 are reported in Fig. S5 of the Supplementary material). For all products, the distribution of the errors is positively skewed and, importantly, does not vary with Rr. Based on this empirical outcome, we adopt a parametric distribution with constant parameters (i.e. a distribution that does not change with Rr), as in Villarini et al. (Citation2014). We find the mixture of three gamma distributions to be the most appropriate parametric distribution, as suggested by the lowest AIC values (see Supplementary material, Table S1). The quantiles derived from the mixed gamma distribution are shown in with solid lines, while the comparison between empirical and parametric distributions is reported in (see Fig. S6 in the Supplementary material for cluster 1) and parameters of the gamma distribution for each radar product are summarized in . reveals that, for all products, the distribution of the errors is very well reproduced up to about e(Rr) = 5; larger values of e are underestimated, but they only account for less than 0.5% of the sample.

Table 5. Parameters of the mixture of three gamma distributions used to characterize the distribution of errors, fe;w1,α1,β1,w2,α2,β2,w3,α3,β3=j=13wjge;αj,βj, where ge;αj,βj is the probability density function of the jth gamma distribution of the errors, e (for simplicity, we dropped the dependence on Rr). Parameter values are reported for clusters 1 (C1) and 2 (C2)

Figure 5. Comparison of survival functions of the empirical distribution and mixture of three gamma distributions for cluster 2

MRMS: Multi-Radar Multi-Sensor; GCMRMS: gauge corrected MRMS.
Figure 5. Comparison of survival functions of the empirical distribution and mixture of three gamma distributions for cluster 2

5.2 Propagation of uncertainty from rainfall to runoff

5.2.1 Rainfall uncertainty

presents, for each storm, the time series of rainfall recorded at the gauge (Rg; red dots) along with the basin mean areal rainfall estimated by the three radar QPEs (Rr*; blue dots) and simulated by the corresponding error model (box plots displaying 50% and 90% confidence intervals of N = 100 values produced by the Monte Carlo runs). For all events, the radar products estimate nonzero rainfall in a larger number of time steps as compared to the gauge. For the gauge-corrected Stage IV, Rg is very close to both Rr* and the median of the Monte Carlo simulations. As discussed, MRMS QPEs are positively biased, so that Rr* is in most cases larger than Rg and median of the error model simulations. This bias is reduced when the gauge correction is applied in GCMRMS (see e.g. the event on 10 July 2018). For all three products, higher rainfall measurements yield a higher magnitude of uncertainty for the error model. Finally, presents the mean skewness and RIQR of the ensemble hourly rainfall across each storm duration. Due to the positive skewness of the distribution of the residual error in the radar error model (see ), the mean skewness of the simulated rainfall is also positive and included between 1.0 and 3.2, with higher values for Stage IV, followed by MRMS and GCMRMS. The relative variability across the ensemble members is quantified via the RIQR, whose average ranges from 56.1% to 74.6%, with the highest (lowest) values found again for Stage IV (GCMRMS).

Table 6. Quantification of uncertainty of simulated rainfall and runoff via skewness and RIQR of the ensemble rainfall and discharge at the basin outlet (in parentheses) averaged across each event duration (i.e. skewness and RIQR are computed for each time step and then averaged in time). Values are reported for each radar product

Figure 6. Time series of rainfall recorded at the gauge (Rg), basin mean areal rainfall estimated by the three radar products (Rr*), and box plots of the N = 100 Monte Carlo simulations of the radar rainfall error model for the four events in . Boxes and whiskers of the box plots show the 50% and 90% confidence intervals, respectively, while the horizontal line is the median. Times are in UTC.

MRMS: Multi-Radar Multi-Sensor; GCMRMS: gauge corrected MRMS

Figure 6. Time series of rainfall recorded at the gauge (Rg), basin mean areal rainfall estimated by the three radar products (Rr*), and box plots of the N = 100 Monte Carlo simulations of the radar rainfall error model for the four events in Table 2. Boxes and whiskers of the box plots show the 50% and 90% confidence intervals, respectively, while the horizontal line is the median. Times are in UTC.MRMS: Multi-Radar Multi-Sensor; GCMRMS: gauge corrected MRMS

5.2.2 Uncertainty of basin-integrated runoff

We start evaluating the propagation of rainfall uncertainty into runoff by first assessing the variations in the outlet hydrograph (i.e. outflow at the pipe that discharges to the Salt River) across radar products and storms. The hydrographs simulated by the SWMM model under the rainfall forcings of are displayed in . The use of Stage IV QPEs and raingauge observations as inputs for SWMM leads to closely aligned hydrographs, which are also very similar to the median hydrograph of the corresponding ensemble simulations. In contrast, the outflow generated by the original MRMS QPEs is consistently biased high compared to the simulation under the gauge rainfall observations, with the median of ensemble runs falling between the two. As found in for the rainfall values, the gauge correction of MRMS QPEs reduces the bias in the resulting flow, as shown in the hydrographs with GCMRMS forcings. The uncertainty of the ensemble hydrologic simulations is quantified in via the time-averaged skewness and RIQR of the discharge at the outlet. The mean skewness is still positive, with values in the range 0.4–2.5, which are lower compared to those found for rainfall. This is also true for the mean RIQR that varies from 22.4% to 66.9%. Metric values are, again, the largest (smallest) for Stage IV (GCMRMS).

Figure 7. Hydrographs simulated by the SWMM model at the pipe located at the catchment outlet under rainfall recorded at the gauge (Rg), original radar QPEs (Rr*), and an ensemble of rainfall fields generated with the error model. The corresponding N = 100 outflow simulations are plotted with shaded areas showing the 50% (light gray) and 90% (dark gray) confidence intervals, while the black line is the median.

MRMS: Multi-Radar Multi-Sensor; GCMRMS: gauge corrected MRMS

Figure 7. Hydrographs simulated by the SWMM model at the pipe located at the catchment outlet under rainfall recorded at the gauge (Rg), original radar QPEs (Rr*), and an ensemble of rainfall fields generated with the error model. The corresponding N = 100 outflow simulations are plotted with shaded areas showing the 50% (light gray) and 90% (dark gray) confidence intervals, while the black line is the median.MRMS: Multi-Radar Multi-Sensor; GCMRMS: gauge corrected MRMS

To complement the information provided by the hydrographs at the catchment outlet and better assess flood impacts and differences across precipitation inputs, we examine additional metrics simulated by SWMM. These include Qmax: peak flow of the hydrograph at the outlet; and Vf: stormwater volume that ponds on the surface of the catchment when the hydraulic grade line reaches the catch basin rim elevation at one or more locations. In addition, we consider two metrics that characterize the hydraulic response of the stormwater pipe system under surcharge conditions; these are tmax: maximum total duration of node surcharging across all nodes; and hmax: maximum height of the hydraulic grade line above the pipe invert level. Note that nonzero values for tmax and hmax do not necessarily imply surface flooding, as mentioned in Section 4.2.1. presents the simulated metric values for each storm and radar product. Simulations under gauge rainfall, Rg, could be used as reference to compare across storms. The lowest (highest) Qmax and Vf are obtained for the storm on 23 September 2019 (7 October 2018), which are characterized by the lowest (highest) rainfall intensity (see ). hmax exhibits a certain relation with Qmax and Vf, which quantify actual surface flooding, while tmax is largely controlled by storm duration (reported in ). Turning our attention now to the radar products, all metrics modeled with Rg are close to those simulated under Stage IV Rr* and in most cases (all cases except tmax for event 1) within the 50% (90%) confidence interval of the ensemble runs. When compared to the metrics simulated under Rg, simulations with MRMS rainfall products lead to longer tmax in two events and larger hmax,Qmax,Vf in most storms. The use of GCMRMS improves MRMS performance, apart from some metrics in individual storms (e.g. hmax on 23 September 2019; Vf on 24 July 2017). Finally, when considering the spread of the ensemble distributions, there is not a specific radar product that leads to wider or narrower distributions for a given metric across all events. These metrics are further discussed in Section 6.

Figure 8. Flood metrics simulated by one-dimensional SWMM for the selected storms with rainfall recorded at the gauge (Rg), original radar QPEs (Rr*), and ensemble of rainfall fields generated with the error model plotted through box plots (boxes and whiskers show the 50% and 90% confidence intervals, respectively). See the text for definitions of the flood metrics. tmax is measured in hours since the simulations started

MRMS: Multi-Radar Multi-Sensor; GCMRMS: gauge corrected MRMS; SWMM: storm water management model.
Figure 8. Flood metrics simulated by one-dimensional SWMM for the selected storms with rainfall recorded at the gauge (Rg), original radar QPEs (Rr*), and ensemble of rainfall fields generated with the error model plotted through box plots (boxes and whiskers show the 50% and 90% confidence intervals, respectively). See the text for definitions of the flood metrics. tmax is measured in hours since the simulations started

5.2.3 Uncertainty of spatial flooding

The metrics presented in aid in comparing model output across simulations. However, information on location and extent of flooding over time is needed to alert residents to dangerous conditions, provide timely emergency response, and identify critical locations for infrastructure investment. Such information is obtained through the 2D simulations of flood routing with PCSWMM. and 10 illustrate the uncertainty in maximum flood extent (i.e. areas that are flooded for at least one time step) simulated under each radar product for the storms on 31 July 2018 and 7 October 2018, respectively (Figs S7 and S8 in the Supplementary material show results for the other two storms). Uncertainty is captured through the simulations under the rainfall fields of the error model associated with the 25th percentile (blue), median (green), and 75th percentile (red) of the distribution of Vf from the 1D SWMM simulations. Thus, if a pixel is blue, flooding has been simulated for all higher percentiles; when it is green, it has been simulated for the median and 75th percentile; and when the pixel color is red, flooding only occurred for the 75th percentile.

Figure 9. Uncertainty of flood extent maps for the storm on 31 July 2018, simulated by PCSWMM under (a–c) Stage IV, (d–f) MRMS, and (g–i) GCMRMS products

MRMS: Multi-Radar Multi-Sensor; GCMRMS: gauge corrected MRMS.
Figure 9. Uncertainty of flood extent maps for the storm on 31 July 2018, simulated by PCSWMM under (a–c) Stage IV, (d–f) MRMS, and (g–i) GCMRMS products

For the monsoonal storm on 31 July 2018, the use of rainfall fields generated by the error model with Stage IV QPEs ()) leads to simulation of street flooding in two distinct basin locations ()), with a total of 0.001, 0.03, and 0.05 km2 of flooded areas for the 25th percentile, median and 75th percentile, respectively. If rainfall forcings are provided by MRMS ()), street flooding is also predicted in a location in the upper part of the basin ()). Given the positive bias of these QPEs, the simulated flooding extent is larger and equal to 0.07 (0.23) km2 for the 25th (75th) percentile. Interestingly, the uncertainty across the percentiles is low in a relatively large region in the lower part of the catchment (blue area in ). The gauge correction of MRMS QPEs performed for GCMRMS leads to results that are substantially similar to those for Stage IV ()). Maps of flooded areas for the transition storm on 7 October 2018 are reported in . Flooding is predicted in two of the same basin regions displayed in . Flooded areas under Stage IV ()) are very similar for the median (0.16 km2) and 75th percentile (0.19 km2), while they decrease substantially for the 25th percentile (0.006 km2). As found for the monsoonal storm of , using MRMS QPEs ()) as model inputs increases the extent of street-level flooding as compared to Stage IV, which reaches 0.04 (0.4 km2) for the 25th (75th) percentiles. The adoption of GCMRMS ()) leads instead to lower flooded areas compared to Stage IV, ranging from no flooded areas for the 25th percentiles to 0.09 km2 for the 75th percentile.

Figure 10. Uncertainty of flood extent maps for the storm on 7 October 2018, simulated by PCSWMM under (a–c) Stage IV, (d–f) MRMS, and (g–i) GCMRMS products

MRMS: Multi-Radar Multi-Sensor; GCMRMS: gauge corrected MRMS.
Figure 10. Uncertainty of flood extent maps for the storm on 7 October 2018, simulated by PCSWMM under (a–c) Stage IV, (d–f) MRMS, and (g–i) GCMRMS products

6 Discussion

While focused on an urban catchment in the Phoenix metropolitan area, the results of our modeling efforts are based on extensive precipitation, land surface, and infrastructure datasets, and, as such, they provide general insights on the utility of radar QPEs and hydrologic–hydraulic models for urban flood predictions in the NAM region and other areas, as discussed in the next subsections.

6.1 Comparison of radar products and their utility for urban flood modeling

When applying the error model using the network of 168 gauges, we find that, on average, Stage IV QPEs underestimate gauge rainfall by ~20%, while MRMRS and GCMRMS overestimate it by about 51% and 31%, respectively (see values of B0 reported in Section 5.1). Qualitatively, this result is confirmed for the four simulated storm events. Our finding is consistent with the extensive comparison of six radar products, including Stage IV and earlier versions of MRMS and GCMRMS, performed by Seo et al. (Citation2018) in Iowa, but opposite to two separate analyses, by Lin et al. (Citation2018) and Gao et al. (Citation2021), in Texas during a set of flood events. The existence of regional differences in performance of Stage IV and MRMRS have been recently highlighted in another comparison study in Louisiana by Sharif et al. (Citation2020). Overall, our results and those of other studies suggest that radar-only QPEs are critically improved through gauge-based bias corrections, as done in Stage IV and GCMRMS.

shows that nonzero rainfall is observed by the radar QPEs in more time steps compared to the single point measurement of the gauge, indicating that, even in a very small basin, a single gauge may be unable to monitor localized monsoonal convective storms. Despite this, show that flood simulations under the more accurate Stage IV-derived rainfall products lead to very similar outcomes to those obtained with gauge rainfall, Rg. This suggests two considerations. First, while the basin area is small, the presence of stormwater infrastructure acts as an integrator of the rainfall spatiotemporal variability, so that, despite differences between Rg and Stage IV QPEs, the resulting flood simulations are substantially quite similar. Second, an important consequence of the first consideration is that, when gauge observations are not available, hydrologic simulations with Stage IV QPEs (available throughout the country) can provide a reliable reference for evaluating the impacts of urban flooding. On the other hand, in our study basin, the use of MRMS QPEs consistently results in the largest flood volumes and extents, while the flooding impacts simulated with GCMRMS QPEs are similar to those obtained with Stage IV (see ). Thus, our work demonstrates that the gauge correction applied to MRMS greatly improves the utility of this radar-only product for flood modeling.

6.2 Uncertainty of radar rainfall and corresponding urban flood simulations

Radar QPEs can provide key support for urban flood forecasting and management, complementing gauge observations that may only be available at sparse locations and not in real time. However, radar products are affected by different sources of errors, thus requiring models to characterize the associated uncertainty. Here, we use an error model with a multiplicative structure that leads to the largest uncertainties for the highest rainfall values (see ), as also found in previous applications (Villarini et al. Citation2014). As a consequence, the uncertainty of urban flood predictions is expected to also be relatively high, as these higher precipitation observations are more likely to result in flooding. This is in general confirmed by our results, because the largest spread of the basin outflow confidence intervals occurs right after the highest rainfall events (see ). However, a closer look at the uncertainty metrics for rainfall and discharge reported in suggests that the hydrologic processes governing the rainfall–runoff transformation have two effects: (i) they smooth the impact of the highest rainfall values simulated by the error model, so that the distribution of the ensemble outflows is more symmetric (i.e. less positively skewed) than the distribution of the rainfall inputs, as also found by Grimley et al. (Citation2020); and (ii) they reduce the relative spread of the ensemble rainfall distribution, as shown by the mean RIQR decreasing from an average of 64.7% to 48.2% when considering all events and radar products. These conclusions should be refined in future work quantifying the effect of other sources of uncertainty for the hydrologic–hydraulic model.

We also highlight that the uncertainty of the basin outflow is greater in the rising than the falling limb of the hydrographs (). Specifically, across all storms and radar products, the 50% (90%) confidence interval range in the falling limb is, on average, just 8.8% (6.3%) of the range in the rising limb. This behavior can be explained considering that, in highly impervious areas and small watersheds, the rising limb and its associated uncertainty are strongly shaped by the rainfall rate. In contrast, during the falling limb of the hydrograph, catchment and drainage network properties exert a larger control that results in an uncertainty reduction. When considering the other flood metrics characterizing surface flooding and surcharge conditions in the stormwater infrastructure, we find that their uncertainty varies substantially (). As expected, the variability of Qmax is related to the QPE uncertainty, although the ensemble distributions for Qmax are more symmetric for the same reasons mentioned above. The distribution of Vf is highly skewed with several zero values, despite the fact that tmax and hmax are never zero. This implies that a fraction of QPEs with lower values lead to surcharge conditions but not to surface flooding. The uncertainty of tmax is practically negligible, indicating that all precipitation inputs lead to surcharge conditions over very similar maximum durations. In contrast, hmax, which quantifies the severity of surcharge conditions, exhibits a variability (mainly in the range of 5 m) due to the different QPE values simulated by the error model.

6.3 Utility for urban flood management

Urban pluvial flooding is characterized by localized damage to property and impacts to pedestrian and vehicle travel. Timely information on the location and severity of these hazards can enable response and limit travel delays and risks (NWS Citation2021). Our simulations with PCSWMM show that, despite uncertainties and biases across radar products, all of them robustly identify the same basin regions experiencing flooding conditions (, S7 and S8). This is a promising result supporting the operational use of QPFs derived from radar QPEs combined with hydrologic–hydraulic models, as also highlighted by Grimley et al. (Citation2020) when discussing their simulation of pluvial flooding in an urban area in Iowa. However, the extent of flooding areas exhibits differences both for the same radar product and across radar products. The latter differences can be significant when the QPE bias is high, as found for MRMS. Therefore, flood response can confidently target the correct areas within a few blocks, though the precise flood extent remains uncertain. Improvements in QPEs accuracy are then expected to greatly benefit prioritization and response measures during floods. These findings are driven by both catchment topography and the design of the drainage network; therefore, the flood management implications of the residual uncertainty may vary by location.

It must also be noted that the SWMM model has its own structural and parametric uncertainties (Deletic et al. Citation2012, Abily et al. Citation2016). While fully assessing these is beyond the scope of the current analysis, it is important to consider that the uncertainty in flood estimates simulated in our study for an ensemble of QPEs incorporates other uncertainty sources in the SWMM model. The flood metrics and maps () also point to the need for flood extent and depth data in urban areas to properly validate the accuracy of these hydrologic and hydraulic simulations. New hydrological sensing approaches such as citizen science (Assumpção et al. Citation2018, Fava et al. Citation2019) and image-based sensing (Tauro et al. Citation2017, Hostache et al. Citation2018) offer opportunities to sense flooding as it occurs at the urban scale. These emerging approaches could enable error assessment of the urban hydrological model, providing further context to interpret the propagation of precipitation uncertainty into urban flooding.

7 Conclusions

The small size and heterogeneity of urban catchments, combined with the short, high-intensity precipitation events that drive pluvial flooding, make accurate and efficient urban pluvial flood modeling an ongoing research challenge (Blanc et al. Citation2012, van Dijk et al. Citation2014, Bermúdez et al. Citation2018). This is especially true in the NAM region, which is dominated by high-intensity convective thunderstorms. In this work, we provide insights on pluvial flooding by assessing the propagation of radar precipitation uncertainty into flood modeling in a 2.38-km2 urban catchment in Phoenix, Arizona. Our results, based on simulations for four events during the NAM, can be summarized as follows:

  1. While Stage IV QPEs slightly underestimate gauge rainfall (on average, by 20%), the resulting flood predictions are similar to those obtained with raingauge records, suggesting that Stage IV QPEs are reliable products for modeling urban flooding. MRMS QPEs are positively biased (+51%), and so are the flood peaks and extent of flooded areas simulated with these products. The gauge correction performed in GCMRMS leads to results similar to Stage IV.

  2. The QPE uncertainty simulated by the error model increases with rainfall rate and is rather alike across the three radar products. This is also true for the resulting flood metrics.

  3. As a result of the rainfall–runoff transformation processes and the presence of stormwater infrastructure, uncertainty of flood metrics is reduced and less positively skewed when compared with the QPE uncertainty. The uncertainty may vary significantly depending on the metrics that capture different flooding features (e.g. pipe flow, surface flooding, and pipe surcharge conditions).

  4. All radar products lead to the identification of the same flooded areas within the extent of a few blocks in the basin. This is a promising result for the use of radar-derived QPEs and hydrologic–hydraulic models for urban pluvial flood modeling. However, the exact location and extent of flooded areas may vary significantly across products, suggesting that increasing QPE accuracy can greatly benefit flood management and forecasting agencies.

Supplemental material

Supplemental Material

Download MS Word (4.6 MB)

Acknowledgements

We thank two anonymous reviewers whose comments helped improve the quality of the manuscript. We also thank Dr. Larry Hopper and Mr. Paul Iñiguez from the NWS Phoenix Office for their insights on quality of radar QPEs and selection of storm events in the Phoenix metropolitan regions.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplementary material

Supplemental data for this article can be accessed here.

Additional information

Funding

This work was supported by the National Science Foundation (NSF) Award “SCC: Community-Based Automated Information for Urban Flooding” (Award 1831475), and by the UCAR COMET Award “Assessing the Accuracy of Multi-Radar/Multi-Sensor (MRMS) Precipitation Estimates in the Phoenix Metropolitan Area to Support Flash Flood Warning Operations” (Award SUBAWD001574).

References

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.