description abstract | Bridge inspection reports are a vital source of data for bridge management and maintenance, encompassing essential structural information indispensable for damage evaluation and decision-making. However, in the process of automatically extracting unstructured textual data and identifying damage entities, because the same type of bridge damage entity often corresponds to multiple structural components, and strong correlations along with prominent nested features exist among entities, general named entity recognition (NER) methods have limited effectiveness. To address these issues, this study introduces a novel method for NER of damage and defects in bridge inspection, leveraging text mining and pretrained natural language models. First, the study constructs a specialized corpus of bridge damage and defects from a large number of bridge inspection reports, and fine-grained entity annotations are performed on sentences describing damage and defects. Next, the study proposes an advanced bridge damage entity recognition model, which integrates pretrained natural language models with deep learning models. The model leverages the Bidirectional Encoder Representations from Transformers (BERT) pretrained model to extract vector features from Chinese characters in damage-related sentences. It then utilizes a bidirectional long short-term memory (BiLSTM) network to capture sequential patterns of multitype entity labels. Finally, it integrates conditional random fields (CRF) to enforce label constraints, generating the optimal label sequence. The model is validated through experiments using the constructed Chinese bridge inspection damage and defect named entity corpus. Experimental results demonstrate that the model proposed in this study surpasses other mainstream NER models, achieving an F1 score of 98.31% and successfully identifying seven categories of fine-grained bridge damage entities. This study not only enhances the automation of extracting information from damage-related bridge inspection text sentences but also establishes a solid foundation for building knowledge graphs in the bridge domain, advancing the development of intelligent bridge management. | |