Estimating Road Construction Costs with Explainable Machine Learning

Published Online:https://doi.org/10.1287/inte.2023.0027

A preliminary estimation of construction costs is a crucial operation of any project related to civil engineering. An accurate estimation ensures a proper management of the available funds and helps the project managers in their decision-making processes. For instance, it is common that specific subtasks of the project are delegated to private subcontractors. Through a call for tenders, each eventual subcontractor has the opportunity to propose a bid with a price for supplying the service. Because the call is generally public, a competition may arise between subcontractors. This impacts the price proposed by the competitors to get the contract. In order to select a subcontractor, the project manager needs to have an accurate idea of a reasonable price for the subtask given. A price higher than expected is undesirable, but a price significantly lower than expected may also result in poor quality of service. The project manager must also be able to explain to stakeholders why a price is suited and justify why a specific subcontractor has been selected. Providing an estimation that is both accurate and transparent is a hard problem for the project manager. A growing trend is to leverage machine learning for this estimation, but designing a model that is both accurate and explainable is still a challenge. Another difficulty is that an approach that is accurate for estimating the cost of a subtask may not be efficient for another one. Based on this context, this paper introduces a framework for estimating construction costs while tackling both challenges. It is based on six machine learning models and on Shapley additive explanations. This project was commissioned by the Ministry of Transport and Sustainable Mobility, a public agency responsible for transport infrastructure in Quebec, Canada. Experiments were carried out on real data, covering historical road construction costs of 11,646 contracts and eight subtasks from 2014 to 2021. Results show that the framework is able to surpass the accuracy of human estimations by up to 31.56% while being able to adequately explain how the estimations have been obtained.

History: This paper was refereed.

Funding: This project was commissioned by the Ministry of Transport and Sustainable Mobility, a public agency responsible for transport infrastructure in Quebec, Canada.

Supplemental Material: The online appendix is available at https://doi.org/10.1287/inte.2023.0027.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.