A Hybrid Framework for Predicting Crash Severity in Construction Work Zones Using Knowledge Distillation and Conditional GANsSource: Journal of Computing in Civil Engineering:;2025:;Volume ( 039 ):;issue: 002::page 04025010-1DOI: 10.1061/JCCEE5.CPENG-6043Publisher: American Society of Civil Engineers
Abstract: Construction work zone crashes represent a critical area of concern within the realm of traffic safety, posing unique challenges for both road users and transportation authorities. Common factors contributing to work zone crashes include reduced speeds, lane closures, and the presence of construction equipment and workers. Fatal crashes within work zones, while relatively rare compared to nonfatal incidents, carry substantial significance due to their severe consequences. The complexity and variability of work zone conditions, coupled with the infrequency of fatal crashes, make it challenging for both machine learning (ML) and statistical models to predict them accurately. Furthermore, existing ML models for predicting crash severity are computationally demanding, which may not be feasible in all situations. To this end, this paper investigates the potential use of conditional tabular generative adversarial networks (CTGAN) and knowledge distillation (KD) in overcoming these challenges through a comprehensive framework. The results demonstrate that synthetic data generated by CTGAN markedly boost the models’ ability to identify underrepresented classes by up to 15.2 percentage points. Moreover, the distillation process exhibited promising outcomes in enhancing the performance of simpler models, such as decision trees, which could be beneficial for deployment on devices with limited computational resources. Construction work zone crashes pose significant challenges for transportation authorities and safety managers due to the dynamic and complex nature of these environments. Common contributing factors to work zone crashes include reduced speeds, lane closures, and the presence of construction equipment and workers. The severity of crashes, particularly fatal incidents, underscores the need for accurate and efficient prediction models. To this end, this study presents a hybrid framework combining CTGAN and KD to enhance the predictive accuracy and efficiency of crash severity models in work zones. By generating synthetic data, CTGAN addresses the challenge of imbalanced data sets, particularly for low-frequency, high-severity crashes. This leads to a more comprehensive data set for training predictive models, resulting in improved accuracy. The framework also leverages KD to distill knowledge from complex models into smaller, efficient models suitable for deployment on mobile devices. This allows DOT safety managers to run predictive models in the field, enabling real-time decision-making and rapid response to safety concerns. This approach is particularly beneficial for DOT safety managers who often work in field environments with limited computational resources.
|
Collections
Show full item record
contributor author | Ali Hassandokht Mashhadi | |
contributor author | Abbas Rashidi | |
contributor author | Juan C. Medina | |
contributor author | Nikola Marković | |
date accessioned | 2025-04-20T10:11:23Z | |
date available | 2025-04-20T10:11:23Z | |
date copyright | 1/13/2025 12:00:00 AM | |
date issued | 2025 | |
identifier other | JCCEE5.CPENG-6043.pdf | |
identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4304176 | |
description abstract | Construction work zone crashes represent a critical area of concern within the realm of traffic safety, posing unique challenges for both road users and transportation authorities. Common factors contributing to work zone crashes include reduced speeds, lane closures, and the presence of construction equipment and workers. Fatal crashes within work zones, while relatively rare compared to nonfatal incidents, carry substantial significance due to their severe consequences. The complexity and variability of work zone conditions, coupled with the infrequency of fatal crashes, make it challenging for both machine learning (ML) and statistical models to predict them accurately. Furthermore, existing ML models for predicting crash severity are computationally demanding, which may not be feasible in all situations. To this end, this paper investigates the potential use of conditional tabular generative adversarial networks (CTGAN) and knowledge distillation (KD) in overcoming these challenges through a comprehensive framework. The results demonstrate that synthetic data generated by CTGAN markedly boost the models’ ability to identify underrepresented classes by up to 15.2 percentage points. Moreover, the distillation process exhibited promising outcomes in enhancing the performance of simpler models, such as decision trees, which could be beneficial for deployment on devices with limited computational resources. Construction work zone crashes pose significant challenges for transportation authorities and safety managers due to the dynamic and complex nature of these environments. Common contributing factors to work zone crashes include reduced speeds, lane closures, and the presence of construction equipment and workers. The severity of crashes, particularly fatal incidents, underscores the need for accurate and efficient prediction models. To this end, this study presents a hybrid framework combining CTGAN and KD to enhance the predictive accuracy and efficiency of crash severity models in work zones. By generating synthetic data, CTGAN addresses the challenge of imbalanced data sets, particularly for low-frequency, high-severity crashes. This leads to a more comprehensive data set for training predictive models, resulting in improved accuracy. The framework also leverages KD to distill knowledge from complex models into smaller, efficient models suitable for deployment on mobile devices. This allows DOT safety managers to run predictive models in the field, enabling real-time decision-making and rapid response to safety concerns. This approach is particularly beneficial for DOT safety managers who often work in field environments with limited computational resources. | |
publisher | American Society of Civil Engineers | |
title | A Hybrid Framework for Predicting Crash Severity in Construction Work Zones Using Knowledge Distillation and Conditional GANs | |
type | Journal Article | |
journal volume | 39 | |
journal issue | 2 | |
journal title | Journal of Computing in Civil Engineering | |
identifier doi | 10.1061/JCCEE5.CPENG-6043 | |
journal fristpage | 04025010-1 | |
journal lastpage | 04025010-15 | |
page | 15 | |
tree | Journal of Computing in Civil Engineering:;2025:;Volume ( 039 ):;issue: 002 | |
contenttype | Fulltext |