CTGAN-Based Model to Mitigate Data Scarcity for Cost Estimation in Green Building ProjectsSource: Journal of Management in Engineering:;2024:;Volume ( 040 ):;issue: 004::page 04024024-1DOI: 10.1061/JMENEA.MEENG-5880Publisher: American Society of Civil Engineers
Abstract: This study presents a method for estimating construction costs, even when dealing with limited and unreliable data, to enhance decision-making in the early project stages. Owners, particularly in green building projects, often face challenges due to the scarcity of usable data, making cost estimation a complex task. They struggle to differentiate between costs associated with existing buildings and green buildings. To address this issue, we introduce a novel approach that leverages conditional tabular generative adversarial networks (CTGANs) for data augmentation, overcoming the limitations of relying solely on historical data. This involves training an artificial neural network (ANN)–based model using synthetic data, effectively addressing the scarcity and imbalance present in the original small data set. Compared to models trained exclusively on the original data set, our approach yielded a remarkable reduction of approximately 66% in root-mean-square error (RMSE), while increasing the validity from 0% to 15.09%. This study not only improves construction cost estimation but also facilitates more informed decision-making for owners, even in cases with limited and unreliable data, ultimately contributing to the efficiency of the construction project planning process.
|
Collections
Show full item record
| contributor author | Eunbin Hong | |
| contributor author | June-Seong Yi | |
| contributor author | Donghwan Lee | |
| date accessioned | 2024-12-24T10:42:20Z | |
| date available | 2024-12-24T10:42:20Z | |
| date copyright | 7/1/2024 12:00:00 AM | |
| date issued | 2024 | |
| identifier other | JMENEA.MEENG-5880.pdf | |
| identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4299399 | |
| description abstract | This study presents a method for estimating construction costs, even when dealing with limited and unreliable data, to enhance decision-making in the early project stages. Owners, particularly in green building projects, often face challenges due to the scarcity of usable data, making cost estimation a complex task. They struggle to differentiate between costs associated with existing buildings and green buildings. To address this issue, we introduce a novel approach that leverages conditional tabular generative adversarial networks (CTGANs) for data augmentation, overcoming the limitations of relying solely on historical data. This involves training an artificial neural network (ANN)–based model using synthetic data, effectively addressing the scarcity and imbalance present in the original small data set. Compared to models trained exclusively on the original data set, our approach yielded a remarkable reduction of approximately 66% in root-mean-square error (RMSE), while increasing the validity from 0% to 15.09%. This study not only improves construction cost estimation but also facilitates more informed decision-making for owners, even in cases with limited and unreliable data, ultimately contributing to the efficiency of the construction project planning process. | |
| publisher | American Society of Civil Engineers | |
| title | CTGAN-Based Model to Mitigate Data Scarcity for Cost Estimation in Green Building Projects | |
| type | Journal Article | |
| journal volume | 40 | |
| journal issue | 4 | |
| journal title | Journal of Management in Engineering | |
| identifier doi | 10.1061/JMENEA.MEENG-5880 | |
| journal fristpage | 04024024-1 | |
| journal lastpage | 04024024-17 | |
| page | 17 | |
| tree | Journal of Management in Engineering:;2024:;Volume ( 040 ):;issue: 004 | |
| contenttype | Fulltext |