Dopazo, DA;
Mahdjoubi, L;
Gething, B;
Mahamadu, AM;
(2023)
An automated machine learning approach for classifying infrastructure cost data.
Computer-Aided Civil and Infrastructure Engineering
10.1111/mice.13114.
(In press).
Preview |
Text
automated machine learning approach for classifying infrastructure cost data.pdf - Published Version Download (872kB) | Preview |
Abstract
Data on infrastructure project costs are often unstructured and lack consistency. To enable costs to be compared within and between organizations, large amounts of data must be classified to a common standard, typically a manual process. This is time-consuming, error-prone, inconsistent, and subjective, as it is based on human judgment. This paper describes a novel approach for automating the process by harnessing natural language processing identifying the relevant keywords in the text descriptions and implementing machine learning classifiers to emulate the expert's knowledge. The task was to identify “extra over” cost items, conversion factors, and to recognize the correct work breakdown structure (WBS) category. The results show that 94% of the “extra over” cases were correctly classified, and 90% of cases that needed conversion, correctly predicting an associated conversion factor with 87% accuracy. Finally, the WBS categories were identified with 72% accuracy. The approach has the potential to provide a step change in the speed and accuracy of structuring and classifying infrastructure cost data for benchmarking.
Type: | Article |
---|---|
Title: | An automated machine learning approach for classifying infrastructure cost data |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1111/mice.13114 |
Publisher version: | https://doi.org/10.1111/mice.13114 |
Language: | English |
Additional information: | © 2023 The Authors. Computer-Aided Civil and Infrastructure Engineering published by Wiley Periodicals LLC on behalf of Editor. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of the Built Environment |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10180981 |
Archive Staff Only
View Item |