YaBeSH Engineering and Technology Library

    • Journals
    • PaperQuest
    • YSE Standards
    • YaBeSH
    • Login
    View Item 
    •   YE&T Library
    • ASCE
    • Journal of Construction Engineering and Management
    • View Item
    •   YE&T Library
    • ASCE
    • Journal of Construction Engineering and Management
    • View Item
    • All Fields
    • Source Title
    • Year
    • Publisher
    • Title
    • Subject
    • Author
    • DOI
    • ISBN
    Advanced Search
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Archive

    Using Text Mining and Bayesian Network to Identify Key Risk Factors for Safety Accidents in Metro Construction

    Source: Journal of Construction Engineering and Management:;2024:;Volume ( 150 ):;issue: 006::page 04024052-1
    Author:
    Jianhong Shen
    ,
    Shupeng Liu
    ,
    Jing Zhang
    DOI: 10.1061/JCEMD4.COENG-14114
    Publisher: American Society of Civil Engineers
    Abstract: Complex risk factors make metro construction safety accidents prone to occur, and there are various types of accidents. Accident reports record detailed information about different types of accidents in text form. However, effectively utilizing such unstructured data presents a significant challenge. Text mining (TM) provides a viable foundation for addressing this challenge, but related studies have limitations in risk feature extraction and lack of in-depth analysis capability. To address the deficiencies of existing studies and provide a feasible strategy for identifying key risk factors in the metro construction domain, this paper proposes an integrated model combining TM and machine learning–based Bayesian networks. Firstly, the term frequency-inverse document frequency (TF-IDF) algorithm in TM was used to separately extract the direct and indirect cause factors from the accident reports, with the missing factors supplemented using the TextRank algorithm. Then, depending on the assumption of whether to consider the conditional independence between factors, an improved naive Bayesian network (NBN) and a tree-augmented naive Bayesian network (TAN) were built based on the extracted factors and the corresponding accident types, respectively, for further in-depth analysis. Finally, the training set was divided to train the two network models, and sensitivity analysis was used to identify the key risk factors. Using 162 accident reports from China as an application example, the results showed that TAN exhibited a higher average accuracy (79.62%) in the test set compared with the improved NBN (71.75%), and the importance of risk factors for different accident types was successfully ranked from multiple perspectives using TAN. Meanwhile, some new insights into metro accidents in China were obtained, which can support decision-making for accident prevention and control. In conclusion, this paper effectively addresses the relevant limitations of accident text utilization and presents a novel approach for metro construction safety management. Analyzing accident texts can help gain insights from objective historical data to support safety management efforts. However, accident texts are often unstructured and contain a lot of irrelevant content. How to quickly extract valid information from accident text and use it to analyze accidents in depth is of continuous interest to safety managers. In particular, those models that have real-time decision support capabilities in addition to theoretical insights. This paper proposes an integrated model that combines text mining and machine-learning Bayesian networks. This model achieves comprehensive textual feature extraction, multifaceted accident causation analysis, and allows safety managers to input current accident information into the model to obtain real-time decision support for accident prevention and control. Although the proposed model is developed for metro construction, it can be slightly adapted by incorporating the characteristics of accident texts from similar domains to obtain an integrated model suitable for these domains, so as to effectively control the occurrence of safety accidents.
    • Download: (1.439Mb)
    • Show Full MetaData Hide Full MetaData
    • Get RIS
    • Item Order
    • Go To Publisher
    • Price: 5000 Rial
    • Statistics

      Using Text Mining and Bayesian Network to Identify Key Risk Factors for Safety Accidents in Metro Construction

    URI
    http://yetl.yabesh.ir/yetl1/handle/yetl/4298740
    Collections
    • Journal of Construction Engineering and Management

    Show full item record

    contributor authorJianhong Shen
    contributor authorShupeng Liu
    contributor authorJing Zhang
    date accessioned2024-12-24T10:20:27Z
    date available2024-12-24T10:20:27Z
    date copyright6/1/2024 12:00:00 AM
    date issued2024
    identifier otherJCEMD4.COENG-14114.pdf
    identifier urihttp://yetl.yabesh.ir/yetl1/handle/yetl/4298740
    description abstractComplex risk factors make metro construction safety accidents prone to occur, and there are various types of accidents. Accident reports record detailed information about different types of accidents in text form. However, effectively utilizing such unstructured data presents a significant challenge. Text mining (TM) provides a viable foundation for addressing this challenge, but related studies have limitations in risk feature extraction and lack of in-depth analysis capability. To address the deficiencies of existing studies and provide a feasible strategy for identifying key risk factors in the metro construction domain, this paper proposes an integrated model combining TM and machine learning–based Bayesian networks. Firstly, the term frequency-inverse document frequency (TF-IDF) algorithm in TM was used to separately extract the direct and indirect cause factors from the accident reports, with the missing factors supplemented using the TextRank algorithm. Then, depending on the assumption of whether to consider the conditional independence between factors, an improved naive Bayesian network (NBN) and a tree-augmented naive Bayesian network (TAN) were built based on the extracted factors and the corresponding accident types, respectively, for further in-depth analysis. Finally, the training set was divided to train the two network models, and sensitivity analysis was used to identify the key risk factors. Using 162 accident reports from China as an application example, the results showed that TAN exhibited a higher average accuracy (79.62%) in the test set compared with the improved NBN (71.75%), and the importance of risk factors for different accident types was successfully ranked from multiple perspectives using TAN. Meanwhile, some new insights into metro accidents in China were obtained, which can support decision-making for accident prevention and control. In conclusion, this paper effectively addresses the relevant limitations of accident text utilization and presents a novel approach for metro construction safety management. Analyzing accident texts can help gain insights from objective historical data to support safety management efforts. However, accident texts are often unstructured and contain a lot of irrelevant content. How to quickly extract valid information from accident text and use it to analyze accidents in depth is of continuous interest to safety managers. In particular, those models that have real-time decision support capabilities in addition to theoretical insights. This paper proposes an integrated model that combines text mining and machine-learning Bayesian networks. This model achieves comprehensive textual feature extraction, multifaceted accident causation analysis, and allows safety managers to input current accident information into the model to obtain real-time decision support for accident prevention and control. Although the proposed model is developed for metro construction, it can be slightly adapted by incorporating the characteristics of accident texts from similar domains to obtain an integrated model suitable for these domains, so as to effectively control the occurrence of safety accidents.
    publisherAmerican Society of Civil Engineers
    titleUsing Text Mining and Bayesian Network to Identify Key Risk Factors for Safety Accidents in Metro Construction
    typeJournal Article
    journal volume150
    journal issue6
    journal titleJournal of Construction Engineering and Management
    identifier doi10.1061/JCEMD4.COENG-14114
    journal fristpage04024052-1
    journal lastpage04024052-13
    page13
    treeJournal of Construction Engineering and Management:;2024:;Volume ( 150 ):;issue: 006
    contenttypeFulltext
    DSpace software copyright © 2002-2015  DuraSpace
    نرم افزار کتابخانه دیجیتال "دی اسپیس" فارسی شده توسط یابش برای کتابخانه های ایرانی | تماس با یابش
    yabeshDSpacePersian
     
    DSpace software copyright © 2002-2015  DuraSpace
    نرم افزار کتابخانه دیجیتال "دی اسپیس" فارسی شده توسط یابش برای کتابخانه های ایرانی | تماس با یابش
    yabeshDSpacePersian