Show simple item record

contributor authorJiaqi Liu
contributor authorWeijie Li
contributor authorFangchang Li
contributor authorXuefeng Zhao
date accessioned2025-08-17T22:41:04Z
date available2025-08-17T22:41:04Z
date copyright6/1/2025 12:00:00 AM
date issued2025
identifier otherJCEMD4.COENG-16019.pdf
identifier urihttp://yetl.yabesh.ir/yetl1/handle/yetl/4307293
description abstractBridge inspection reports are a vital source of data for bridge management and maintenance, encompassing essential structural information indispensable for damage evaluation and decision-making. However, in the process of automatically extracting unstructured textual data and identifying damage entities, because the same type of bridge damage entity often corresponds to multiple structural components, and strong correlations along with prominent nested features exist among entities, general named entity recognition (NER) methods have limited effectiveness. To address these issues, this study introduces a novel method for NER of damage and defects in bridge inspection, leveraging text mining and pretrained natural language models. First, the study constructs a specialized corpus of bridge damage and defects from a large number of bridge inspection reports, and fine-grained entity annotations are performed on sentences describing damage and defects. Next, the study proposes an advanced bridge damage entity recognition model, which integrates pretrained natural language models with deep learning models. The model leverages the Bidirectional Encoder Representations from Transformers (BERT) pretrained model to extract vector features from Chinese characters in damage-related sentences. It then utilizes a bidirectional long short-term memory (BiLSTM) network to capture sequential patterns of multitype entity labels. Finally, it integrates conditional random fields (CRF) to enforce label constraints, generating the optimal label sequence. The model is validated through experiments using the constructed Chinese bridge inspection damage and defect named entity corpus. Experimental results demonstrate that the model proposed in this study surpasses other mainstream NER models, achieving an F1 score of 98.31% and successfully identifying seven categories of fine-grained bridge damage entities. This study not only enhances the automation of extracting information from damage-related bridge inspection text sentences but also establishes a solid foundation for building knowledge graphs in the bridge domain, advancing the development of intelligent bridge management.
publisherAmerican Society of Civil Engineers
titleChinese Named Entity Recognition for Bridge Damage and Defects Based on Text Mining and Natural Language Pretraining Models
typeJournal Article
journal volume151
journal issue6
journal titleJournal of Construction Engineering and Management
identifier doi10.1061/JCEMD4.COENG-16019
journal fristpage04025060-1
journal lastpage04025060-12
page12
treeJournal of Construction Engineering and Management:;2025:;Volume ( 151 ):;issue: 006
contenttypeFulltext


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record