297
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Integrating spatial clustering with predictive modeling of pipe failures in water distribution systems

&
Pages 465-476 | Received 15 Jul 2022, Accepted 09 Feb 2023, Published online: 25 Feb 2023
 

ABSTRACT

Pipe failures in water distribution infrastructure (WDI) have significant economic, environmental and public health impacts. To alleviate these impacts, repair and replacement decisions need to be prioritized to effectively reduce failure rates. In this study, a computational framework is proposed for WDI asset management that couples spatial clustering analysis with predictive modeling of pipe failures. First, hotspot/coldspot clusters of statistically significant high/low failure rates are identified using local indicators of spatial association. Second, the predictive abilities of eight statistical learning techniques are systematically tested, and the best-performing method is implemented to forecast failure rates,(breaks/(km.year)) within different sectors of the WDI. Third, the framework is implemented to compare the impact of adopting proactive instead of reactive pipe replacement strategies. Applying the framework to a real-life, large-scale WDI revealed that spatial clustering of pipe failures improves the accuracy of the prediction models.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability

Some or all data, models, or code generated or used during the study are proprietary or confidential in nature and may only be provided with restrictions (e.g. anonymized data). Pipe failure data and network GIS records were provided by the water utility under a confidentiality agreement between the water utility and the second author. The data are available by request from the authors and may only be provided after obtaining the utility’s approval and undergoing potential anonymization. The computational framework developed in this study was implemented using the Python programming language. The spatial autocorrelation analysis is conducted using the Python Spatial Analysis Library (PySAL) (Rey and Anselin Citation2010). Predictive modeling is conducted by scikit-learn (Pedregosa et al. Citation2011). Other Python libraries used for data analysis and visualization include numpy, pandas, geopandas and matplotlib (Hunter Citation2007; McKinney Citation2010; Van Der Walt, Colbert, and Varoquaux Citation2011).

Supplementary material

Supplemental data for this article can be accessed https://doi.org/10.1080/1573062X.2023.2180393

Additional information

Funding

This work was supported by the University of Texas at Austin Startup Grant and by Cooperative Agreement No. 83595001 awarded by the U.S. Environmental Protection Agency to The University of Texas at Austin. This work has not been formally reviewed by EPA. The views expressed in this document are solely those of the authors and do not necessarily reflect those of the Agency. EPA does not endorse any products or commercial services mentioned in this publication. Partial funding by the National Science Foundation (NSF) awards No. 2015603 and 2015658 is gratefully acknowledged.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 239.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.