2,761
Views
2
CrossRef citations to date
0
Altmetric
Research Articles

A machine learning study to improve the reliability of project cost estimates

ORCID Icon, ORCID Icon, &
Pages 4372-4388 | Received 31 Jan 2023, Accepted 14 Sep 2023, Published online: 25 Sep 2023

Figures & data

Figure 1. The research methodology.

This ML-based approach, presented in five stages and an alternative to the traditional EVM approach, is used to improve the final cost estimates in projects.
Figure 1. The research methodology.

Figure 2. A typical cost S-curve of a project.

A line graph, plotting the three curves of the planned, earned, and actual cost values in EVM, respectively, resembles a typical S-shape pattern.
Figure 2. A typical cost S-curve of a project.

Table 1. The EVM data of a hypothetical project (the tracking period – month 3).

Table 2. The EVM data and inputs to the XGBoost model (the early-stage estimation).

Figure 3. The training-testing protocol of the ML algorithm.

A flowchart, representing training and testing processes separately, depicts the process of implementing the ML algorithm to the EVM project data to estimate the final project cost.
Figure 3. The training-testing protocol of the ML algorithm.

Table 3. The ML models, their parameter settings and explanation.

Table 4. The accuracy results of the forecasting models, MAPE%.

Table 5. The accuracy results of the forecasting models, NRMSE.

Figure 4. The number of more accurate cases (frequency of projects) by the XGBoost and index-based benchmark models.

A chart, contrasting the estimates’ frequency by the index-based model and the XGBoost model, shows that the performance of XGBoost is significantly better in forecasting the final project cost.
Figure 4. The number of more accurate cases (frequency of projects) by the XGBoost and index-based benchmark models.

Data availability statement

The authors confirm that the data supporting the findings of this study are available within the article and its supplementary materials.