Blog

Advancing the analysis of water pipe failures: a probabilistic framework for identifying significant factors | Scientific Reports

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Scientific Reports volume  14, Article number: 19218 (2024 ) Cite this article inspect nylon tube

Water is an essential resource required for various human activities, like drinking, cooking, cleaning, and flushing, and it is transported through various types of water pipes. Water pipes often experience unprecedented failures associated with environmental, economic, and social consequences1,2,3. Regarding environmental concerns, water pipe failure leads to water loss, causing flooding and erosion. In 2014, a steel pipe with a diameter of 762 \(mm\) in California, USA, experienced a failure resulting in multiple walkways and garages flooding. This incident caused extensive damage to hundreds of vehicles, as reported by4.

Similarly, the failure of water pipes can have significant economic consequences. In addition to repairing or replacing the failed pipes, there are indirect costs associated with water loss, property damage, and disruption of services5. For instance, it was reported that repairing and rehabilitating existing water distribution networks (WDNs) in the USA would cost over $1 trillion over the next few decades6. Additionally, Xu et al7 stated that China spent more than 10 billion RMB (or 1.45 billion USD) in 2014 to rehabilitate its WDNs. The failure of pipes poses a social concern as it increases health risks due to the potential contamination of water and the subsequent incidence of waterborne diseases. As per the study by8, the World Health Organization has estimated that globally, 3.41 million deaths occur annually due to water-related diseases.

Numerous factors influencing the failure of water pipes have been reported in the extant literature9,10,11. These factors have been broadly categorized into three: pipe-related, environment-related, and operational-related factors12. Pipe-related factors include age, material, diameter, length, manufacturing flaw, installation quality, and buried depth. Environment-related factors include corrosion, soil type, soil pH, atmospheric temperature, road traffic, soil composition and microbiological agents, and external loads. Similarly, internal pressure, water type, water hammering, and water quality are some of the operational-related factors influencing water pipe failure13,12,13,14,17. However, categorizing these factors is not always clear-cut, and some factors may overlap among different categories. For instance, corrosion can be influenced by both the environment and the material of the pipe. In addition, some factors may interact with each other, leading to complex and interdependent causes of pipe failure.

It is worth noting that the relative importance of the factors influencing pipe failure may vary depending on the specific context and location of the WDN. For instance6 found interval to last break, temperature changes, pipe length, and pipe age as the most significant factors leading to water pipe failure in the Cleveland WDN, USA. On the other hand, pipe material, length, age, and the number of previous failures were found to be the most influencing factors for a WDN located in Seville, Spain18. Therefore, a comprehensive and context-specific understanding of the potential failure factors is necessary to manage and maintain the WDN effectively.

As mentioned earlier, it is evidenced that water pipe failure is a severe problem that leads to several negative impacts. Hence, water pipe failure needs to be prevented as much as possible. One of the ways to mitigate the failure is to obtain the probability of failure of certain pipe material in the network given a particular factor and vice versa. For instance, the probability of cast iron (CI) pipe (i.e., material type) given its age can be obtained, and the probability that a pipe failed at a certain age given its material type is CI can be computed from the historical failure data of a WDN. These kinds of computations are referred to as conditional probabilities and can be derived using Bayes’ theorem. Singh19 and Tchórzewska-Cieślak et al.20 adopted Bayes’ theorem to determine the conditional failure probabilities of pipes in Honolulu’s WDN, USA, and Powiat city’s WDN, Poland, respectively. However, these studies are limited because the bias associated with the failure of historical data of a typical WDN was not addressed. The frequency distribution of the assets (i.e., pipe segments) in a WDN is often correlated with the frequency distribution of the failed assets18. For instance, if 80% of a network consists of ductile iron (DI) and the other 20% is made of several other pipe materials, there is a high possibility that a higher percentage of failure may be associated with DI in the network even if DI is not the most susceptible material. This kind of bias can be eliminated by investigating the frequency distribution of the total assets and that of the failed assets. Suppose a significant difference exists between the two distributions. In that case, the assets can be said to have failed due to the investigating factor and not because of their mere abundance in the network.

Zangenehmadar & Moselhi21 employed the Delphi method with the aid of mean ranking score analysis to prioritize factors influencing the failure of water pipes. The mean ranking score analysis depended on a questionnaire survey that was distributed among WDN experts. Furthermore, El Chanati et al.22, developed performance assessment models for water pipelines. These models were developed using data gathered from questionnaires distributed to water pipeline experts in Qatar. The weights of the factors were determined using four distinct methods: AHP, fuzzy AHP (FAHP), ANP, and fuzzy ANP (FANP). Similar qualitative approaches were adopted by Elshaboury et al.23 to rank the factors contributing to the failure of pipes in WDNs.

It is worth noting that the existing methods in the extant literature for identifying significant factors influencing the failure of pipes in WDNs are subjective in nature, and the results cannot be directly attributed to a specific WDN, as they are based on the general opinion of experts working in the field of WDNs. To fill the existing gaps in the literature, this study proposes a novel framework for analyzing the probability of failure of water pipes in a WDN. Specifically, the objectives of this study are as follows.

To develop a framework (for numerical and categorical data) for selecting the most significant factors affecting water pipe failure.

To quantify the reliability of WDN pipes (assets) in terms of probability computed from historical failure data while minimizing the abundance bias that commonly arises in calculating the failure probabilities using empirical methods.

To validate the proposed frameworks with the historical data of a real WDN.

This study has significant theoretical and practical implications. The proposed framework can advance the understanding of factors affecting water pipe failure in WDNs by providing a more systematic and data-driven approach. Given any material type, the framework can assist water utilities and municipalities in prioritizing their maintenance and replacement efforts by identifying the critical factors contributing to pipe failure. Hence, the contributions of this study to the field of WDNs can be summarized as follows:

Identification of significant factors influencing water pipe failure

The study introduces two new algorithms for selecting the most significant factors affecting water pipe failure. One algorithm is designed for numerical data, while the other is tailored for categorical data.

Development of a new metric for failure probability

A novel method for calculating failure probability is presented, accounting for the number of failures and the total length of pipes for each material. This approach provides a more balanced representation of failure risk, offering a more accurate vulnerability assessment across different pipe categories.

Multi-factor analysis using conditional probability

The study applies conditional probability to analyze multiple factors influencing pipe failures simultaneously. This comprehensive approach allows for a more nuanced understanding of how various factors interact to influence failure likelihood, moving beyond single-factor analyses common in previous studies.

Application of the method to a complex, real-world WDN

The methodology is applied to data from a large and diverse urban water network, demonstrating its applicability and effectiveness in real-world scenarios. This practical application provides relevant insights to practitioners, bridging the gap between theoretical models and utility management needs.

Analyzing the failure probability of water pipes gives insight into the range of factors that could be associated with a high or low probability of failure. This method only relies on historical failure data of a network. For example, it is well-established that the probability of failure will be higher for the short-length pipes in a WDN and lower for longer pipes. Installation errors or ground movement can be the most influential reason for this behavior. A similar trend can be found in the diameter of pipes, i.e., a short diameter tends to have a higher failure probability than a larger diameter. The thicker wall thickness in large-diameter pipes and thinner in small-diameter pipes are primarily responsible for this trend.

Similar trends can be observed when examining additional parameters in any WDN. However, it is important to consider whether these trends are useful if they are consistent across all WDNs. While they may not provide definitive answers, they offer a probabilistic perspective by indicating which range of a specific factor is more likely to result in failures for a given material. However, these failure probability distributions are highly influenced by their frequency distributions. For example, the dataset analysis here shows a higher failure probability for shorter-length pipes, a well-reported fact in the literature24,25. However, the frequency distribution of the length of pipes shows that the WDN has a higher number of shorter-length pipes than pipes with longer lengths. This indicates that the failure distributions are highly influenced by their frequency distribution. Furthermore, the conclusions derived from all parameters’ failure distributions may not point to the root cause. In order to select the parameters whose failure distributions are not completely influenced by their frequency distribution, a generic framework is presented to test the significance of the parameters.

Figure 1 shows the flow diagram of the proposed algorithm. The algorithm can test the significance of numerical and categorical data based on their similarity or dissimilarity of cumulative failure distribution with their cumulative frequency distributions. Here, we compute the cumulative failure distribution from the data of failed pipes and the cumulative failure distribution from all pipes in the network (assets). In Fig. 1, the algorithm first determines the type of data present in the specified field. If the data is identified as numeric, the algorithm subsequently conducts three statistical tests to assess the null hypothesis \({(H}_{o})\) . The three statistical tests are the Kolmogorov–Smirnov test (\(ks\) -test), Cramer-von Mises (\(cv\) -test), and Dunn’s test (\(dn\) -test) to check the goodness of fit of a cumulative failure distribution with its cumulative frequency distribution. The null hypothesis \({(H}_{o})\) in these tests, asset and failure datasets share the same cumulative probability distribution. Similarly, an alternative hypothesis \(({H}_{a})\) states that the datasets of assets and failures are from different probability distributions. The null hypothesis is tested based on the \(p\) -value computed by each statistical test. If the \(p\) -value is smaller than \(0.05\) , the null hypothesis will be rejected, and the alternative hypothesis will hold. The null hypothesis will not be rejected if it equals or exceeds \(0.05\) . These tests’ cumulative scores are called the total \(p\) score \(\left({p}_{t}\right)\) , as shown in Eq. (1)

where \(\left[.\right]\) represents the Iverson brackets, which yields \(1\) when the underlying condition is true and \(0\) otherwise. \({p}_{ks}\) , \({p}_{cv}\) and \({p}_{dn}\) denote the \(p\) -values from \(ks\) , \(cv,\) and \(dn\) statistical tests.

Generic Framework for Testing the Significance of a Parameter.

To establish the significance of a factor, Fig. 1 shows that at least the \(p\) -values of two tests should satisfy the condition (i.e., \(<0.05).\) The factor will be considered insignificant if \({p}_{t}\) is smaller than \(2\) . Figure 2 shows three possible cases while comparing the cumulative distributions of assets and failure data. In the first case, both distributions closely follow each other, meaning cumulative failure distribution is highly influenced by its cumulative frequency distribution, as shown in Fig. 2(a). In this case, \({p}_{t}\) would be less than 2, and the factor will be considered insignificant. In the second case, the cumulative distribution of assets and failure do not follow a similar trend. \({p}_{t}\) would be equal to or greater than 2 in this case, as shown in Fig. 2(b). However, the cumulative failure distribution of failures is below the cumulative frequency distribution, \(\int \left({F}_{a}\left(x\right)-{F}_{f}\left(x\right)\right)dx\ge 0\) , showing that the tested factor is not significant. Figure 2(c) shows the case where \({p}_{t}\) will be equal to or greater than 2 and \(\int \left({F}_{a}\left(x\right)-{F}_{f}\left(x\right)\right)dx<0\) . Here, the tested factor will be considered significant if the cumulative failure distribution shows a higher failure over the range of a factor than its cumulative frequency distribution. The empirical rationale behind this hypothesis is that the probability density distribution of failure data is narrower than that of frequency (assets) data when the number of failures in a specific range of a field is not proportionate to other ranges (proportionate according to failure density distribution). This situation leads to a steeper failure cumulative distribution than its frequency cumulative distribution, as shown in Fig. 2.

Three possible cases while comparing the cumulative distribution of assets and failure data. (a) If \({F}_{f}\left(x\right)\) closely follows the \({F}_{a}(x),\) the factor is considered insignificant, (b) If \({F}_{f}\left(x\right)\) reaches 1 slower than \({F}_{a}(x)\) , the factor is considered insignificant. (c) If \({F}_{f}\left(x\right)\) reaches 1 faster than \({F}_{a}(x)\) , the factor is deemed significant. These three graphs show real scenarios from our case study. (a) and (b) are a case of DI pipes where (c) is of S pipes.

Three statistical significance tests are employed to find the significant categorical fields. For the categorical data, Pearson Chi Test (\(pc\) -test), log likelihood (\(ll\) -test), and modified log likelihood (\(pmll\) - test) tests are used to test the relevance of assets and failure data. The total score \({p}_{t}\) from the three statistical tests are denoted in Eq. (2). To determine the significance of a factor, Fig. 1 demonstrates that the p-values of at least two tests should meet the condition of being less than 0.05.

where \(\left[.\right]\) represents the Iverson bracket, which equals \(1\) when the underlying condition is true and equals \(0\) otherwise. \({p}_{pc}\) , \({p}_{ll}\) and \({p}_{mll}\) denote the \(p\) -values from \(pc\) -, \(ll\ge\) -, and \(mll\) - statistical tests.

Conventionally, the probability of the failure of a certain material pipe is calculated by counting the number of failures in that material pipe divided by the total number of failures in the whole WDN.

\(P\left({M}_{i}\right)\) is the probability of a pipe failure occurring in material i, calculated using the conventional method, \({N}_{i}\) denotes the total number of failures in \(i\) -th material and \({N}_{T}\) the total number of failures among other material pipes in the WDN. This method of computing the probability of failures from historical failure data indicates that each failure in the WDN has an equal likelihood of failure. While this method of calculating failure probabilities offers a broad perspective on the condition of a WDN, it is important to acknowledge that this assumption may oversimplify the complexities and interdependencies involved in computing failure probabilities within a WDN.

The total number of assets (pipe segments) in the water pipeline in Hong Kong is around 1.1 million. The number of failures observed in these assets in the last eleven years (2010–2020) is merely 40 thousand, representing about 4% of total WDN assets. Thus, the relative frequency method in Eq. (3) cannot provide a realistic estimate of failure probabilities for different material pipes. The total number of failures is much lower than the total assets. This means that the probability values from Eq. (3) cannot justify the failure probabilities of 1.1 million assets from only \(4\text{\%}\) of failures. This implies that the failure probabilities need to be inferred from a smaller number of failures (as compared to the total number of assets in the WDN).

The common feature method is proposed to tackle the problem of computing failure probability from the lower percentage of failure out of the total assets. This method incorporates assigned weights instead of counting the pipe failures. The probability of failure with the empirical method can be expressed as:

where \({w}_{i,j}\) is the weight associated with \(j\) -th failed asset of \(i\) -th material, and \({w}_{k}\) is the weight of \(k\) -th failed asset, including all the other material pipes in a WDN. \({w}_{i,j}\) is computed by averaging the failure rate (failure/km/year) for the last 11 years for all materials and assigned this averaged failure rate to each pipe based on its length.

Here, \({f}_{i}\) denotes the average failure rate of \(i\) -th material and \({l}_{ij}\) the length of the pipe \(i\) -th material and \(j\) -th failed material.

Figure 3 shows the failure rate of different pipe materials found in the WDN. In Fig. 3, the failure rate is expressed as the number of failures per kilometer per year. The failure rates provide a rough estimate of the reliability and performance of different material types in a WDN. Also, it identifies the type of pipes where improvements in repair or replacement may be necessary to improve the overall performance of the WDN. We propose utilizing the sum of failure rates for different material types as a weight \(({w}_{i,j})\) , as depicted in Eq. (5). The rationale behind this approach is that higher failure rates indicate a greater probability of failure for a specific material type. Conversely, in the event of a failure within the WDN, it is reasonable to anticipate a higher likelihood of failure in the pipe material with a higher failure rate. The failure rate in a WDN (failures/km/year) allows for an accurate assessment of the network’s condition by considering failures relative to the network length. Analyzing failure data per kilometer per year helps identify vulnerable sections, prioritize maintenance, and optimize renewal strategies based on failure patterns27,28,29. Another metric for failure rate found in the literature is failures/year/km. The difference lies in their interpretation: failures/year/kilometer is more suitable for a spatial context, while failures/kilometer/year is more intuitive from a temporal perspective. The dataset provided by the WSD is historical failure data of the WDN, and failures/kilometer/year appear to be a better candidate for assessing the reliability of different pipe materials in the temporal context.

Failure rates among different pipe materials in the last eleven years.

Figure 4(a) presents the material distribution within the network, representing the percentages of each material. Meanwhile, Fig. 4(b) and (c) display pie charts illustrating the probability of failures in the Water Distribution Network (WDN) using conventional and proposed methods, respectively. In Fig. 4(b), it is observed that pipes made of galvanized iron (GI) are more prone to failures compared to pipes of other materials, while the probability of failure for cast iron (CI) and steel (S) pipes is relatively lower. However, Fig. 4(c) reveals a significant difference, showing that CI pipes may experience three times more failures than S pipes. This disparity in failure probabilities between CI and S pipes results from the proposed method, which incorporates historical data and failure rates. The higher failure rate experienced by GI pipes contributes to their elevated failure probability, as determined by Eq. (4).

(a) Pie chart of the constituent of HK’s WDN; all constituent portion is shown in percentage. Pie Chart of the probability of failure of each material pipe in the WDN using the (b) common feature method and (c) the proposed method.

Calculating failure probabilities based on historical data emphasizes the extent of failures in specific pipe materials within the network. However, it offers only a surface-level understanding of failures across different materials. Considering that the number of pipes in the WDN can impact the failure probabilities is justifiable. In other words, if the failure of pipes follows a uniform univariate random process, it is likely that the greater the number of pipes made of a particular material in the WDN, the higher the likelihood of failure for that material. However, water pipeline failures are highly complicated and multivariate random processes. Moreover, deducing or concluding from the univariate failure probability analysis could not suffice for an informed decision.

The failure probabilities obtained from historical data using Eq. (4) do not account for the influence of intrinsic, extrinsic, or operational factors of the pipes. To address this limitation, conditional probability offers a means to redistribute the likelihood of failure in pipes made of a specific material, considering various factors. In this context, conditional probability can be interpreted as a normalized failure probability relative to the total failure probability of pipes belonging to a particular material type, as demonstrated in Eq. (6).

where \(P\left({F}_{l}|{M}_{i}\right)\) is the conditional probability of failure due to \(l\) -th factor given the pipe of \(i\) -th material, and \(P({F}_{l}\cap {M}_{i})\) is the joint probability of failure due \(l\) -th factor and \(i\) -th material type.

Using conditional probability enables the evaluation of failure probability for a specific factor range, considering the material of the pipes within the WDN. This approach provides a normalized metric for analyzing pipe failures, allowing for comparisons across different materials on a consistent scale ranging from 0 to 1. By employing this normalized metric, it becomes possible to analyze Polyethylene (PE) and Cast Iron (CI) pipes on an equal scale, regardless of their varying numbers within the WDN. Figure 4(a) illustrates this comparison, where PE pipes constitute approximately 42% of the WDN while CI pipes comprise only around 5%.

The pipeline data analyzed in this study is provided by the Water Supply Department (WSD) of Hong Kong. WSD is a governmental department responsible for managing, installing, and repairing water infrastructure, including water reservoirs, pumping stations, and water pipelines in Hong Kong. Hong Kong, positioned in the southern part of China, is a coastal city that shares its borders with Guangdong Province. It is situated at the mouth of the Pearl River Delta, south of Shenzhen. Geographically, Hong Kong is divided into three main regions: Hong Kong Island, Kowloon, and the New Territories. The city has a hilly and mountainous terrain with steep slopes and lowlands in the northern part of Hong Kong. It is famous for its highest population density, around 6300 persons per square kilometer. This encourages the governments to reclaim the land area for residential and economic zones. Around 70 km-square of land in Hong Kong is sea-reclaimed, about 7% of Hong Kong’s surface area (Development Bureau and Construction Industry30.

The data obtained from HK WSD encompasses a comprehensive 11-year record (2010–2020) of water pipe failures, including leaks and bursts, as displayed in Fig. 5. A leak is characterized by a smaller, often undetected water loss that may not immediately disrupt service, typically occurring at joints, fittings, or through small cracks. Leaks can persist for extended periods and are often identified during routine inspections. In contrast, a burst is a more severe failure involving a significant and sudden water loss, usually resulting from a complete fracture or large rupture in the pipe. Bursts often lead to immediate service disruptions and require urgent repair31,26,33.

Geographical locations of failures in HK WDN from 2010–2020. (a) Leaks. (b) Bursts. The red line denotes the Mass Transit Railway (MTR), and the blue line represents the light rail line. This map was generated from the data provided by WSD HK using ArcGIS Pro 3.0 accesible at Esri official website (Esri26).

The WSD database presents this information in a tabular format, capturing intrinsic and extrinsic parameters of each asset (pipe segment), along with details on the type, cause, and nature of the failures. The WDN in Hong Kong spans a total length of approximately 8000 km. Within this network, around 80% is dedicated to distributing fresh water, while the remaining 20% serves as conduits for saltwater supply to consumers. The primary constituents of Hong Kong’s WDN consist of pipes made from polyethylene (PE), galvanized iron (GI), and ductile iron (DI), accounting for approximately 42%, 23%, and 21%, respectively. The remaining 11% comprises steel (S), cast iron (CI), and other types of pipes, as depicted in Fig. 4(a).

HK’s WDN comprises about 1.1 million pipe segments, treated as individual pipe segments or assets in this study based on the proprietary data from WSD HK. The water supply department (WSD) provides the data associated with these pipes. The data provides information about the network, which is grouped into pipe-related, environment-related, and operation-related factors. Data cleaning was done to select relevant columns out of the 150 columns in the raw data.

Numerical data was transformed into ordinal data using a binning approach to facilitate the computation of failure probabilities. The K-means algorithm was employed to determine optimal bin ranges, resulting in variable bin lengths tailored to the data distribution34,35. This automated clustering method ensures that the discretization process captures meaningful patterns in the continuous variables. Figure 6 illustrates the outcomes of this K-means-based clustering for various numerical factors, including age, diameter, traffic (AADT), distance from roads and railways, and pressure. Each subplot demonstrates how the continuous data has been effectively categorized into distinct, data-driven bins, providing a foundation for subsequent probability calculations and analysis. K-means binning strategy ensures automatic binning of numerical data and forms the bins based on the data spread. This feature enables automatic binning of highly sparse data, e.g., water pipes available in the market are discrete diameters, making the diameter data sparse.

Automated clustering of numerical data using K-Means binning. (a) age, (b) diameter of Group A, (c) diameter of Group B, (d) distance from the roads, (e) annual average daily traffic, (g) distance from the mass transit railway system, and (g) operating pressure.

To convert numerical data into categorical data using the K-means algorithm, one begins by choosing the number of clusters \(k\) , and initializing \(k\) centroids (here, \(k\) was set to 6). Each data point \({x}_{i}\) is then assigned to the nearest centroid using the Euclidean distance, defined as \({c}_{i}= \text{arg mi}{\text{n}}_{j} {\left| {x}_{i} - {\mu }_{j} \right|}^{2}\) . Subsequently, centroids are recalculated as the mean of all points in each cluster, \({\mu }_{j} =\frac{1}{|{C}_{j}|} {\sum }_{{x}_{i}\in {C}_{j}}{x}_{i}\) . This assignment and update process is repeated until convergence. Finally, each data point is assigned to a cluster, which can then be labeled with categorical labels, thereby transforming numerical data into categorical data.

Further, the probability of failures of water pipes due to a factor given its material is computed using Eq. (6) and presented in tabular forms in the subsequent sections. The last column of Tables1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 2122 represents the cumulative length of each row.

\(({l}_{r})\) , with the percentage of the cumulative length depicted in parentheses for each cell. It should be noted that for the probability of a certain factor given a material, the sum of the rows for each material will be 1. Conversely, for the probability of failure of a certain pipe material given a certain factor, the sum of the rows will be 1 for each factor range. The reason for this is rooted in the nature of conditional probabilities. When calculating the probability of a factor being given a material, we consider all possible factor outcomes for that specific material. These outcomes must collectively account for 100% of the possibilities, hence summing to 1.

Similarly, when calculating the probability of a pipe material failing given a certain factor, we consider all possible material outcomes for that specific factor range. Again, these outcomes must represent all possibilities, summing to 1. This summing pattern is a direct result of the axioms of probability, specifically that the sum of probabilities for all possible outcomes in a given scenario must equal 1. In this study, each row represents a complete set of possible outcomes for a given condition (either a specific material or a specific factor range), thus adhering to this fundamental principle of probability theory.

Table 1 illustrates the trends of failure probability after a certain duration for different pipe materials, while Table 2 presents the probability of failure of different pipe materials given their age. In Table 1, the rows represent the different materials, and the columns represent various age ranges. The values in the table are interpreted as the probability of failure. CI pipes show a low probability of failure at an early age, and a higher probability of failures can be expected at the end of their service life. The increasing trend of failure probabilities of CI pipes can be associated with corrosion, as damage due to corrosion increases with time. This trend agrees with previous studies36,31,38. DI shows the highest failure probability within 15–27 years of installation, an observation that tallies with the result reported by Singh19.

The probability of experiencing a failure for PE, GI, and S pipes was found to be highest within the first 15 years of their installation. This trend gradually decreases in the later part of the pipes’ age. One of the reasons for the high failure probability of the pipes at an early age may be due to installation and manufacturing flaws. This justification is well established in the extant literature5,11. According to Yang et al.39, an inefficient tapping process during the installation of PE pipes is one of the problems that can cause catastrophic failure at an early stage. Yang et al.39 reported a burnt mark on a failed PVC pipe hole attributed to high temperature during tapping. This poses a significant issue since PE pipes cannot dissipate heat quickly when exposed to high temperatures. Therefore, it is essential to ensure proper tapping processes when installing plastic pipes to prevent catastrophic failures. Plastic pipes located in Honolulu WDN were also found to exhibit the highest failure probabilities within 20 years of installation19.

Table 2 presents the failure probability for various pipe materials based on their age, which aligns with the findings in Table 1. The first row suggests that GI and PE pipes are more likely to fail during their early stages in a WDN than other materials. Except for the last row, all other rows indicate that GI pipes have a higher probability of failure than any other material. Additionally, CI pipes are expected to have more failures after 62 years of service life.

The pipes are divided into two groups based on their diameter distribution. Group A consists of GI and PE pipes with diameters ranging from 20 to 175 mm, while Group B consists of CI, DI, and S pipes with diameters ranging from 20 to 500 mm. Given the material types, the probability of failure due to pipe diameter is presented in Tables 3 and 5. Generally, the probability of failure decreases in GI, PE, CI, and DI pipes as the diameter increases. This finding is consistent with previous studies and may be attributed to several factors5,40,41. Firstly, small-diameter pipes exhibit thinner wall thickness, making them more susceptible to damage from external factors such as ground and soil movements or nearby construction activities.

Further, small-diameter pipes are more prone to clogging and blockages, potentially increasing the pressure and stress on the pipes causing their ultimate failure12. Conversely, S pipes exhibit a higher probability of failure in larger diameters. This is primarily because these pipes are often used as water mains, transporting substantial volumes of water and enduring higher pressures, leading to increased structural stress and potential failure. Additionally, large-diameter pipes of this type tend to be older and more prone to corrosion and other forms of deterioration, further elevating the likelihood of failure11.

Tables 4 and 6 show the probability of failure based on material types for given pipe diameters. The results indicate that GI and DI pipes exhibit the highest failure probability regardless of diameter, except for pipes exceeding diameters of 150 mm and 450 mm, respectively. This suggests that GI and DI pipes may need more frequent inspections and maintenance to reduce the risk of failures.

Tables 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 present information regarding the environment-related factors of HK’s WDN, including factors such as land use (urban, rural, waterbody, and sea), soil resistivity, different geographical locations, and the effect of dynamic loading in HK.

Table 7 shows the probability of failure due to land use given pipe material. The results typify that urban areas have higher failure probabilities for most pipe materials, with CI, PE, DI, S, and GI having probabilities of 0.96, 0.73, 0.90, 0.81, and 0.48, respectively. The high density of civil and electrical infrastructure in urban areas may contribute to these failures, possibly due to dynamic loading from traffic, stray current from electrical infrastructure, or malpractices during construction or repair work42. GI pipes show similar failure probabilities in both rural and urban areas.

The failure probability of the material types given land use is shown in Table 8. According to the results, GI and PE have the highest failure probabilities in the urban and rural areas of HK. At the same time, GI and S pipes exhibited the highest failure probabilities for pipes located in waterbody and sea. This can be attributed to the higher chances of corrosion in metallic pipes used in wet areas. Exposure to elements such as saltwater can exacerbate the corrosion process, leading to higher failure probabilities43.

The failure probability of the pipes due to soil corrosivity, given their material types, is reported in Table 9. A closer look at the table indicates that pipes in mildly-corrosive soil exhibited the highest failure probability. Although one would have expected that pipes in highly-corrosive soil may experience the most failures44, it is essential to note that mildly-corrosive soil may cause gradual and prolonged damage to pipes, leading to an increased probability of failure over time. Conversely, highly corrosive soil may cause rapid and severe damage to pipes, resulting in early failures that are quickly detected and repaired. Therefore, the results highlight the need for regular monitoring and maintenance of pipes located in all types of corrosive soil to prevent failures and ensure the longevity of the water distribution network. WSD HK uses Linear Polarization Resistance (LPR) as a metric to define the corrosivity of the soil.

Moreover, the probability of failure of pipe materials, given their soil corrosivity, is indicated in Table 10. The results show that GI and PE have the highest failure probabilities for highly and mildly corrosive soil pipes. This suggests that the susceptibility to corrosion is a significant factor affecting the performance of GI and PE pipes, regardless of the level of soil corrosivity. This is consistent with previous studies showing that corrosion is a significant cause of failure in metal pipes12,45, 46. The higher failure probabilities in highly-corrosive soil can be attributed to the accelerated corrosion rate caused by the corrosive soil environment. Although PE is a plastic pipe that is typically resistant to corrosion5, it can still be impacted by environmental factors such as UV radiation, high temperatures, and chemical exposure, leading to the formation of weak spots that are more susceptible to corrosion.

Hong Kong’s surface area keeps increasing due to several reclamation projects (Development Bureau and Construction Industry26). Most of the reclaimed areas were occupied by sea before the reclamation. Table 11 shows the failure probabilities of pipes due to reclamation given their pipe materials, while Table 12 presents the failure probability due to the pipe materials given their reclamation status. N and Y represent pipes in non-reclaimed and reclaimed areas, respectively. DI, CI, and S pipes were the most affected, as they exhibit failure probability of 0.37, 0.33, and 0.27, respectively. The higher failure in the sea-reclaimed land associated with these pipes can be attributed to the higher susceptibility of these materials to external stresses and movements, such as settlement and ground deformation, common in reclaimed areas47. These stresses can cause damage to the pipes, such as cracks or leaks, which increases the likelihood of failure. Additionally, the changes in soil characteristics and composition in reclaimed areas can lead to increased corrosion rates, further contributing to the higher failure probabilities of these materials in such areas48. Water authorities need to consider these factors when selecting materials and designing pipelines in reclaimed areas to ensure the long-term reliability and safety of the water distribution network.

Hong Kong is known for having one of the highest population densities globally, resulting in a highly concentrated urban infrastructure network to meet the population’s needs49,50. This network comprises surface and buried infrastructures, including water pipelines, gas pipelines, power cables, roads, underground/surface transit railways, and traction power lines. Due to vehicular traffic and transit railway networks, the buried pipelines experience continuous dynamic loading and unloading cycles.

Table 13 presents the failure probability of the pipes located within 25 \(m\) from the road, given the pipe materials. The result demonstrates that brittle pipes like S and CI have a higher probability of failure if they are within a 10-m distance of roads. The probability of failure decreases as the distance between the pipes and roads increases. On the other hand, materials like DI and PE are comparatively flexible and exhibit no significant increase or decrease in the failure probability across various distance ranges, except for the last row (20–25 \(m\) ). The failure probability for different pipe materials given various ranges of the distance between roads and water pipes is reported in Table 14. According to the result, GI and PE pipes do not seem to be affected by the distance from the road as their failure probabilities increase as the pipes get farther from the road, showing that other factors may have influenced the higher failure probabilities.

The location of buried pipes is critical in determining their susceptibility to failure. In some cases, pipes are buried directly above highways, raising concerns about vehicular traffic’s impact on their integrity. The effect of annual average daily traffic (AADT) on pipe failure probability was examined to investigate this issue. The probability of failure due to AADT given the pipe materials and the failure probability of the various pipe materials given the AADT are recorded in Table 15 and Table 16, respectively. The results indicate that all the pipe materials experienced the highest failure probability when subjected to AADT of about 8–16 \(\times {10}^{3}\) AADT suggests that vehicular traffic intensity significantly impacts the pipes’ structural integrity42. AADT is known to cause dynamic loading and unloading on buried pipelines, which can result in cyclic stresses that may weaken the pipes over time5. Moreover, the magnitude and frequency of traffic-induced loads can be influenced by vehicle speed, axle load, and pavement conditions51.

Hong Kong’s comprehensive mass transit railway (MTR) system is a crucial transportation network, connecting most population centers through surface and subway trains. The railway system employs three primary power contact lines, namely 550 V DC, 1.5 kV DC, and 25 kV AC. This section investigates the influence of the MTR system on pipe failure.

Table 17 shows the probability of failure when the pipe is located at certain distances from the MTR system, given the pipe materials. In contrast, Table 18 represents the failure probability of the different pipe materials, given their proximity to the MTR system. Generally, the result shows that the proximity of the MTR system affects the pipe failure for metallic pipes such as CI, DI, and GI. The increased failure probability when the MTR system is close to the pipe’s location can be attributed to several reasons. First, the construction and maintenance of the railway infrastructure, such as tunnels and underground stations, can cause ground settlement and soil displacement, which can, in turn, damage and even rupture buried pipes. Second, the operation of trains, particularly heavy freight trains, can cause ground vibration and dynamic loading on buried pipes, leading to fatigue failure over time11. Third, the traction power lines that supply power to the trains can create electric and magnetic fields that may induce stray currents in nearby buried pipes, leading to corrosion and eventual failure52.

Based on the framework developed in this study, internal water pressure and water type are the two significant factors influencing water pipe failure in HK WDN.

The failure probability due to internal water pressure given the pipe materials is reported in Table 19. It reveals that the probability of failure increases for all pipe materials as the internal water pressure rises. This trend continues until the pressure reaches a range of 83–106 psi, where the probability of failure decreases. While this shows that an increase in pressure increases the likelihood of failure to a certain point, the decrease in failure probability of the pipes beyond 106 psi may be attributed to some reasons. It is possible that the pressure range is not uniformly distributed, and certain sections of the pipes are subjected to higher pressures than others. In such cases, the failure probability may increase in those sections where the pressure is highest but decrease in other sections where the pressure is lower5,10. It is also important to note that the decrease in failure probability may not be linear and that other factors may be at play that influence the behavior of the pipes under different pressure levels. Table 20 presents the pipe materials’ failure probability given the water pressure. As seen in Table 20, GI and PE pipes are more susceptible to failure than other pipes in the WDN. This highlights the need for proper material selection for pipes likely to experience higher water pressure levels.

The pipes in HK’s WDN either carry freshwater or saltwater. This section investigates the influence of water type on the probability of pipes’ failure. The probability of failure due to water type given the pipe materials is provided in Table 21, which denotes that the probability of failure in saltwater-carrying pipes is higher than that of freshwater pipes. This could be attributed to several factors, such as the chemical composition of salt water, which may cause corrosion or other forms of degradation in the pipes over time5. Moreover, saltwater pipes may be exposed to more extreme environmental conditions, such as tidal movements or high levels of sunlight, which can further contribute to their deterioration and subsequent failure53.

On the other hand, Table 22 indicates the failure probability of the different types of pipe material, given that their water type shows GI to be more susceptible to failure than other pipe materials, followed by PE for freshwater pipes. However, PE exhibited the highest failure probability for saltwater. Saltwater can cause PE pipes to swell and absorb water, reducing the pipe’s structural integrity. Additionally, saltwater can cause the material to become brittle over time, leading to cracks and fractures that increase the likelihood of failure54.

The proposed framework for analyzing the probability of failure of water pipes in a WDN has significant theoretical implications. By selecting the critical factors affecting failure through two different algorithms, one for numerical and one for categorical data, the framework provides a more comprehensive understanding of the factors influencing water pipe failure. The algorithms utilized in the framework ensure precise and accurate identification of the critical factors, thereby enhancing the reliability and validity of the results obtained. Further, the framework employs Bayes’ theorem and the likelihood feature method to determine the conditional failure probabilities of pipes, addressing the bias associated with historical failure data. This approach can be applied to any WDN or asset experiencing failure, regardless of size and complexity, making it a valuable tool for researchers and practitioners in water infrastructure management.

The proposed framework offers significant practical implications for managing and maintaining WDNs. At its core, this approach enables a more nuanced and accurate assessment of pipe failure probabilities, which in turn facilitates more informed decision-making processes across various aspects of WDN management.

One of the primary advantages of this framework is its ability to identify and prioritize critical factors influencing water pipe failures. Water utility managers can adopt a more targeted and efficient maintenance strategy by pinpointing these key determinants. This focused approach allows for the optimal allocation of often limited resources, potentially leading to substantial cost savings while simultaneously enhancing the overall reliability of the network. The framework’s capacity to provide a more reliable assessment of pipe failure probabilities is particularly valuable for asset management. Armed with this refined information, decision-makers can make more informed choices regarding pipe replacement or rehabilitation. This enhanced predictive capability allows for prioritizing high-risk assets, ensuring that maintenance efforts are directed where they are most needed. Consequently, this approach can significantly reduce the risk of unexpected failures, minimizing service disruptions and associated costs.

Moreover, the probabilistic nature of this framework aligns well with risk-based asset management strategies. It provides a quantitative basis for risk assessment, allowing managers to balance the likelihood of failure against the potential consequences. This risk-based approach can be instrumental in justifying budget allocations and investment decisions, as it provides a clear, data-driven rationale for prioritizing certain assets or areas of the network over others. The framework also holds potential for long-term planning and scenario analysis. By understanding how different factors contribute to failure probabilities, managers can model various future scenarios, such as the effects of climate change or urban development on their networks. This forward-looking capability can inform long-term infrastructure planning and investment strategies, helping utilities to build more resilient and sustainable networks.

The negative impacts of water pipe failure are numerous, including flooding, disruption of business and services, erosion, increased maintenance, and rehabilitation costs. The factors affecting the failure of the pipes are often categorized into pipe-related, environment-related, and operation-related factors. Therefore, it is necessary to mitigate the failure of these pipes by analyzing the historical failure data of a WDN to estimate the failure probability of each pipe material and the factors influencing the failure. The existing literature is limited by the absence of a systematic selection of significant factors affecting water pipes and addressing the bias associated with the frequency distribution of assets in a WDN. Hence, this study aims to fill these gaps by proposing a new framework that systematically identifies the most significant factors influencing the failure of pipes in a particular WDN. The framework is composed of two algorithms, with each algorithm consisting of three statistical tests. One algorithm is meant for numerical data, while the other is developed for categorical data.

Applying the proposed framework to HK WDN shows that 10 out of 21 factors influencing pipe failure are significant. For the pipe-related factors, it was generally found that the failure probability of the pipes increases with age, except for brittle pipes such as PE, which experience a higher failure probability within 15 years of installation. The results also indicate that smaller-diameter pipes are prone to more failure than larger ones. The significant environment-related factors include land use, soil corrosivity, reclaimed area, distance from the road, traffic, and distance from mass transit railway, and their associated justification for influencing pipe failure has been discussed. Concerning the operation-related factors, failure probability was found to increase generally with internal pressure, provided all other conditions remain constant. The result also typifies that higher failure probabilities are associated with saltwater pipes than freshwater pipes. Regarding the pipe materials, GI and PE were the most vulnerable pipes in the network.

Although the results of this analysis are directly applicable to the case study, the framework developed in this study can be applied to any other WDN. By utilizing the proposed framework, decision-makers in the water infrastructure sector can prioritize the most influential failure factors in their networks and allocate resources effectively, thereby reducing the risk of failures and minimizing the associated economic, environmental, and social consequences. While the framework developed in this study contributes to the field of water pipe failure analysis, it is noted that future research can be conducted to explore integrating the framework into existing asset management systems to provide a more holistic approach to water pipe management.

The data used in this study is confidential and cannot be shared with the public due to the confidentiality agreement between the authors and the funding agency. This agreement ensures that the data remains confidential and is only used for the purpose of the research project. As such, the authors do not have the right to share the data with the public. We have taken steps to ensure that the methods used in this study are clear and replicable.

Fan, X., Zhang, X., Yu, A., Speitel, M. & Yu, X. Assessment of the impacts of climat change on water supply system pipe failures. Sci. Rep. 13(1), 1–21. https://doi.org/10.1038/s41598-023-33548-7 (2023).

Gheibi, M. et al. A risk-based soft sensor for failure rate monitoring in water distribution network via adaptive neuro-fuzzy interference systems. Sci. Rep. 13(1), 1–27. https://doi.org/10.1038/s41598-023-38620-w (2023).

Article  ADS  CAS  Google Scholar 

Lee, J. S., Zeng, W., Lambert, M., Hilditch, T. & Gong, J. Fatigue analysis of metallic-plastic-metallic pipeline systems: A numerical study. Results Eng. https://doi.org/10.1016/j.rineng.2023.100986 (2023).

Piratla, Kr et al. Empirical Analysis of Water-Main Failure Consequences. Procedia Eng. Https://doi.org/10.1016/J.Proeng.2015.08.507 (2015).

Taiwo, R., Shaban, I. A. & Zayed, T. Development of sustainable water infrastructure: A proper understanding of water pipe failure. J. Cleaner Prod. https://doi.org/10.1016/j.jclepro.2023.136653 (2023).

Fan, X., Wang, X., Zhang, X. & Xiong, A. S. C. E. Machine learning based water pipe failure prediction: The effects of engineering, geology, climate and socio-economic factors. Reliability Eng. Syst. Safety https://doi.org/10.1016/j.ress.2021.108185 (2022).

Xu, M., Mei, Z., Luo, S. & Tan, Y. Optimization algorithms for construction site layout planning: A systematic literature review. Eng. Constr.Arch. Manag. 27(8), 1913–1938. https://doi.org/10.1108/ECAM-08-2019-0457 (2020).

Paradkar, A. B. An evaluation of failure modes for cast iron and ductile iron water pipes (Issue December). https://rc.library.uta.edu/uta-ir/handle/10106/11660. (2012).

Mortula, M. M. et al. Improving Resiliency of Water Supply System in Arid Regions: Integrating Centrality and Hydraulic Vulnerability. Journal of Management in Engineering 36(5), 1–13. https://doi.org/10.1061/(asce)me.1943-5479.0000817 (2020).

Taiwo, R., Ben Seghier, M. E. A. & Zayed, T. Toward Sustainable Water Infrastructure: The State-Of-The-Art for Modeling the Failure Probability of Water Pipes. Water Resourc. Res. https://doi.org/10.1029/2022WR033256 (2023).

Shaban, I. A., Eltoukhy, A. E. E. & Zayed, T. Systematic and scientometric analyses of predictors for modelling water pipes deterioration. Autom. Constr. https://doi.org/10.1016/j.autcon.2022.104710 (2023).

Barton, N. A., Farewell, T. S., Hallett, S. H. & Acland, T. F. Improving pipe failure predictions: Factors effecting pipe failure in drinking water networks. Water Res. https://doi.org/10.1016/j.watres.2019.114926 (2019).

Creaco, E., Giudicianni, C. & Mottahedin, A. Improved community detection for WDN partitioning in the dual topology based on segments and valves. J. Hydroinform. 25(4), 1341–1357. https://doi.org/10.2166/hydro.2023.209 (2023).

Dawood, T., Elwakil, E., Novoa, H. M. & Delgado, J. F. G. Soft computing for modeling pipeline risk index under uncertainty. Eng. Fail. Anal. https://doi.org/10.1016/j.engfailanal.2020.104949 (2020).

Ellison, D. & Spencer, D. The true causes of ac pipe failures—according to the data. Pipelines 2016, 637–647. https://doi.org/10.1061/9780784479957.058 (2016).

Mottahedin, A., Giudicianni, C., Brentan, B. & Creaco, E. Clustering-Based Maintenance Strategy of Isolation Valves in Water Distribution Networks. ACS EST Water 4, 1798–1807 (2024).

Zhou, S., Ng, S. T., Yang, Y. & Xu, J. F. Delineating infrastructure failure interdependencies and associated stakeholders through news mining: The case of hong kong’s water pipe bursts. J. Manag. Eng. 36(5), 04020060. https://doi.org/10.1061/(asce)me.1943-5479.0000821 (2020).

Robles-Velasco, A., Cortés, P., Muñuzuri, J. & Onieva, L. Prediction of pipe failures in water supply networks using logistic regression and support vector classification. Reliability Eng. Syst. Safety https://doi.org/10.1016/j.ress.2019.106754 (2020).

Singh, A. Bayesian analysis for causes of failure at a water utility. Built Environ.Project Asset Manag. 1(2), 195–210. https://doi.org/10.1108/20441241111180433 (2011).

Tchórzewska-Cieślak, B., Rak, J. R. & Szpak, D. Bayesian Inference in the Analysis of the Failure Risk of the Water Supply Network. J. Konbin 49(3), 433–450. https://doi.org/10.2478/jok-2019-0066 (2019).

Zangenehmadar, Z. & Moselhi, O. Prioritizing deterioration factors of water pipelines using Delphi method. Measurement 90, 491–499. https://doi.org/10.1016/j.measurement.2016.05.001 (2016).

El Chanati, H. et al. Multi-criteria decision making models for water pipelines. J. Perform. Construct. Facilities 30(4), 1–12. https://doi.org/10.1061/(asce)cf.1943-5509.0000842 (2016).

Elshaboury, N., Attia, T. & Marzouk, M. Comparison of Several Aggregation Techniques for Deriving Analytic Network Process Weights. Water Resourc. Manag. 34(15), 4901–4919. https://doi.org/10.1007/s11269-020-02698-y (2020).

Taiwo, R., Yussif, A. M., Ben Seghier, M. E. A. & Zayed, T. Explainable ensemble models for predicting wall thickness loss of water pipes. Ain Shams Eng. J. https://doi.org/10.1016/j.asej.2024.102630 (2024).

Taiwo, R., Zayed, T. & Ben Seghier, M. E. A. Integrated intelligent models for predicting water pipe failure probability. Alexandria Eng. J. 86, 243–257. https://doi.org/10.1016/j.aej.2023.11.047 (2024).

Esri Canada. Desktop GIS Software | Mapping Analytics | ArcGIS Pro. Retrieved July 27, 2024, https://www.esri.com/en-us/arcgis/products/arcgis-pro/overview. (2024).

Gheibi, M. et al. Assessing the failures in water distribution networks using a combination of geographic information system, EPANET 2, and descriptive statistical analysis: A case study. Sustain. Water Resour. Manag. 8(2), 47. https://doi.org/10.1007/s40899-022-00638-9 (2022).

Nugroho, W., Utomo, C. & Iriawan, N. A bayesian pipe failure prediction for optimizing pipe renewal time in water distribution networks. Infrastructures 7(10), 136. https://doi.org/10.3390/infrastructures7100136 (2022).

Nunes, R., Arraut, E. & Pimentel, M. Risk assessment model for the renewal of water distribution networks: A practical approach. Water (Switzerland) 15(8), 1509. https://doi.org/10.3390/w15081509 (2023).

Development Bureau and Construction Industry. Guildeline for considerate construction sites award scheme. https://devb.gov.hk/filemanager/en/content_175/Guideline_23rd_CCSAS_NonPublic_Works_Sites_English_20160229.pdf. (2017).

Abd Rahman, N., Muhammad, N. S. & Wan Mohtar, W. H. M. Evolution of research on water leakage control strategies: where are we now?. Urban Water J. https://doi.org/10.1080/1573062X.2018.1547773 (2018).

Lo Presti, J. et al. Combining clustering and regularised neural network for burst detection and localization and flow/pressure sensor placement in water distribution networks. J. Water Process Eng. https://doi.org/10.1016/j.jwpe.2024.105473 (2024).

Wéber, R., Huzsvár, T. & Hős, C. Vulnerability analysis of water distribution networks to accidental pipe burst. Water Res. https://doi.org/10.1016/j.watres.2020.116178 (2020).

Hartigan, J. A. & Wong, M. A. Algorithm AS 136: A k-means clustering algorithm. J. R. Stat. Soc. 28(1), 100–108 (1979).

KBinsDiscretizer. In scikit-learn. Retrieved June 27, 2024, from https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.KBinsDiscretizer.html (n.d.).

Folkman, S. Water Main Break Rates In the USA and Canada: A Comprehensive Study. Mechanical and Aerospace Engineering Faculty Publications. Paper 174. https://digitalcommons.usu.edu/mae_facpub/174 (2018).

Folkman, S., Rice, J., Sorenson, A. & Braithwaite, N. Survey of water main failures in the United States and Canada. J. Am. Water Works Assoc. 104(10), 70–79. https://doi.org/10.5942/jawwa.2012.104.0135 (2012).

Rezaei, H., Ryan, B. & Stoianov, I. Pipe failure analysis and impact of dynamic hydraulic conditions in water supply networks. Proc. Eng. 119(1), 253–262. https://doi.org/10.1016/j.proeng.2015.08.883 (2015).

Yang, W., Xue, Y., Wu, S., Xiao, Y. & Zhou, M. Performance investigation and environmental application of basic oxygen furnace slag – Rice husk ash based composite cementitious materials. Constr. Build. Mater. https://doi.org/10.1016/j.conbuildmat.2016.07.051 (2016).

Hekmati, N., Rahman, M. M., Gorjian, N., Rameezdeen, R. & Chow, C. W. K. Relationship between environmental factors and water pipe failure: an open access data study. SN Appl. Sci. 2(11), 1–12. https://doi.org/10.1007/s42452-020-03581-6 (2020).

Rajeev, P., Kodikara, J., Robert, D., Zeman, P., & Rajani, B. Factors contributing to large diameter water pipe failure. Water Asset Management International, https://researchrepository.rmit.edu.au/esploro/outputs/9921862143101341, (2014).

Nguyen, H. et al. Joint maintenance planning of deteriorating co-located road and water infrastructures with interdependencies. Reliability Eng. Syst. Safety https://doi.org/10.1016/j.ress.2022.108678 (2022).

Sadiq, R., Kleiner, Y. & Rajani, B. Aggregative risk analysis for water quality failure in distribution networks. J. Water Supply Res. Technol. Aqua 53(4), 241–261. https://doi.org/10.2166/aqua.2004.0020 (2004).

Arriba-Rodriguez, LD, Villanueva-Balse, J., Ortega-Fernandez, F. & Rodriguez-Perez, F. Methods to Evaluate Corrosion in Buried Steel Structures: A Review. metals https://doi.org/10.3390/met850334 (2018).

Doyle, G., Seica, M. V. & Grabinsky, M. W. F. The role of soil in the external corrosion of cast iron water mains in Toronto Canada. Can. Geotech. J. 40(2), 225–236. https://doi.org/10.1139/t02-106 (2003).

Pȩkala, A. & Pietrucha-Urbanik, K. The influence of the soil environment on the corrosivity of failure infrastructure - Case study of the exemplary water network. Arch. Civil Eng. 64(1), 133–144. https://doi.org/10.2478/ace-2018-0009 (2018).

Trickey, S. A., Moore, I. D. & Balkaya, M. Parametric study of frost-induced bending moments in buried cast iron water pipes. Tunnell. Underground Space Technol. https://doi.org/10.1016/j.tust.2015.10.028 (2016).

Demissie, G., Tesfamariam, S. & Sadiq, R. Considering soil parameters in prediction of remaining service life of metallic Pipes: Bayesian belief network model. J. Pipeline Syst. Eng. Pract. https://doi.org/10.1061/(asce)ps.1949-1204.0000229 (2016).

Taiwo, R., Wang, K. C., Olanrewaju, O. I., Tariq, S., Abimbola, O. T., Mehmood, I., & Zayed, T. An Analysis of Employee Motivation in the Construction Industry: The Case of Hong Kong. ICEC 2022, 22(1), 11. https://doi.org/10.3390/engproc2022022011, (2022).

Yeung, H. C., Ridwan, T., Tariq, S. & Zayed, T. BEAM Plus implementation in Hong Kong: assessment of challenges and policies. Int. J. Construct. Manag. 22(14), 2830–2844. https://doi.org/10.1080/15623599.2020.1827692 (2022).

Mackey, T., Cashman, A., & Cumberbatch, R. Identification of Factors Contributing to the Deterioration and Losses in the Water Distribution System in Barbados. 68, 73. https://www.cavehill.uwi.edu/cermes/docs/technical_reports/mackey_et_al_2014_pipe_deterioration_and_water_los.aspx. (2014).

Hussein Farh, H. M., Seghier, E. A. B. & Zayed, M. T. A Comprehensive Review of Corrosion Protection and Control Techniques for Metallic Pipelines. Eng. Fail. Anal. https://doi.org/10.1016/j.engfailanal.2022.106885 (2022).

Firouzi, A., Yang, W., Shi, W. & Li, C. Q. Failure of corrosion affected buried cast iron pipes subject to water hammer. Eng. Fail. Anal. https://doi.org/10.1016/j.engfailanal.2020.104993 (2021).

Jun, H. J., Park, J. K. & Bae, C. H. Factors affecting steel water-transmission pipe failure and pipe-failure mechanisms. J. Environ. Eng. https://doi.org/10.1061/(asce)ee.1943-7870.0001692 (2020).

The authors gratefully acknowledge the support from the Innovation and Technology Fund (Innovation and Technology Support Programme (ITSP)) under grant number ITS/033/20FP, and the Water Supplies Department of Hong Kong. The statements made herein are solely the responsibility of the authors and the funding agencies do not carry any responsibilities or liabilities related to the above findings. The corresponding author acknowledges funding from the European Union’s Horizon 2021 research and innovation programme under the Marie Sklodowska-Curie project No-101061320 and the funded APC by Oslo Metropolitan University.

Open access funding provided by OsloMet - Oslo Metropolitan University.

Department of Building and Real Estate, The Hong Kong Polytechnic University, 11 Yuk Choi Road, Hung Hom, Kowloon, Hong Kong, China

Muhammad Muddassir, Tarek Zayed & Ridwan Taiwo

Department of Built Environment, Oslo Metropolitan University, Oslo, Norway

Mohamed el Amine Ben Seshier

You can also search for this author in PubMed  Google Scholar

You can also search for this author in PubMed  Google Scholar

You can also search for this author in PubMed  Google Scholar

You can also search for this author in PubMed  Google Scholar

M.M . Methodology, Validation, Formal analysis, Writing—Original Draft. T.Z. Methodology, Writing—Original Draft, Supervision, Project administration. R.T. Methodology, Validation, Formal analysis, Writing—Original Draft. M.E.A.B.S. Validation, Supervision, Writing—Review & Editing.

Correspondence to Mohamed the Amine Ben Segeier.

The authors declare no competing interests.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Muddassir, M., Zayed, T., Taiwo, R. et al. Advancing the analysis of water pipe failures: a probabilistic framework for identifying significant factors. Sci Rep 14, 19218 (2024). https://doi.org/10.1038/s41598-024-69855-w

DOI: https://doi.org/10.1038/s41598-024-69855-w

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Scientific Reports (Sci Rep) ISSN 2045-2322 (online)

detect bellow pipe Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.