YaBeSH Engineering and Technology Library

    • Journals
    • PaperQuest
    • YSE Standards
    • YaBeSH
    • Login
    View Item 
    •   YE&T Library
    • ASCE
    • Journal of Computing in Civil Engineering
    • View Item
    •   YE&T Library
    • ASCE
    • Journal of Computing in Civil Engineering
    • View Item
    • All Fields
    • Source Title
    • Year
    • Publisher
    • Title
    • Subject
    • Author
    • DOI
    • ISBN
    Advanced Search
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Archive

    Fusing Data Extracted from Bridge Inspection Reports for Enhanced Data-Driven Bridge Deterioration Prediction: A Hybrid Data Fusion Method

    Source: Journal of Computing in Civil Engineering:;2020:;Volume ( 034 ):;issue: 006
    Author:
    Kaijian Liu
    ,
    Nora El-Gohary
    DOI: 10.1061/(ASCE)CP.1943-5487.0000921
    Publisher: ASCE
    Abstract: Data buried in textual bridge inspection reports offer great promise for enhanced data-driven bridge deterioration prediction. However, learning from these reports is challenging because they typically use multiple concept names to refer to the same entity and typically describe multiple instances of the same type of deficiency. Such multiple names and instances increase the dimensionality and the sparsity of the feature space, which would cause overfitting to a particular feature, undermine the generalizability of the machine learning models, and compromise the performance of the data-driven prediction. To address this challenge, this paper proposes a new hybrid data fusion method. It combines an unsupervised named entity normalization method and an entropy-based numerical data fusion method to fuse concept names and numerical data, respectively. The proposed normalization method uses an n-gram model to generate candidate canonical identifier names and utilizes corpus statistics and lexical patterns to fuse the multiple concept names into a candidate name that balances abstraction and detailedness. The proposed fusion method uses data discretization and information entropy to fuse the multiple deficiency measures (of the instances) into a single representation. The hybrid fusion method was validated in fusing data extracted from textual bridge inspection reports for supporting the prediction of future bridge condition ratings. Learning from the fused data, compared to learning from the unfused data, improved the accuracies of predicting the ratings of decks, superstructures, and substructures by 8.0%, 8.5%, and 7.9%, respectively.
    • Download: (510.2Kb)
    • Show Full MetaData Hide Full MetaData
    • Get RIS
    • Item Order
    • Go To Publisher
    • Price: 5000 Rial
    • Statistics

      Fusing Data Extracted from Bridge Inspection Reports for Enhanced Data-Driven Bridge Deterioration Prediction: A Hybrid Data Fusion Method

    URI
    http://yetl.yabesh.ir/yetl1/handle/yetl/4268388
    Collections
    • Journal of Computing in Civil Engineering

    Show full item record

    contributor authorKaijian Liu
    contributor authorNora El-Gohary
    date accessioned2022-01-30T21:32:30Z
    date available2022-01-30T21:32:30Z
    date issued11/1/2020 12:00:00 AM
    identifier other%28ASCE%29CP.1943-5487.0000921.pdf
    identifier urihttp://yetl.yabesh.ir/yetl1/handle/yetl/4268388
    description abstractData buried in textual bridge inspection reports offer great promise for enhanced data-driven bridge deterioration prediction. However, learning from these reports is challenging because they typically use multiple concept names to refer to the same entity and typically describe multiple instances of the same type of deficiency. Such multiple names and instances increase the dimensionality and the sparsity of the feature space, which would cause overfitting to a particular feature, undermine the generalizability of the machine learning models, and compromise the performance of the data-driven prediction. To address this challenge, this paper proposes a new hybrid data fusion method. It combines an unsupervised named entity normalization method and an entropy-based numerical data fusion method to fuse concept names and numerical data, respectively. The proposed normalization method uses an n-gram model to generate candidate canonical identifier names and utilizes corpus statistics and lexical patterns to fuse the multiple concept names into a candidate name that balances abstraction and detailedness. The proposed fusion method uses data discretization and information entropy to fuse the multiple deficiency measures (of the instances) into a single representation. The hybrid fusion method was validated in fusing data extracted from textual bridge inspection reports for supporting the prediction of future bridge condition ratings. Learning from the fused data, compared to learning from the unfused data, improved the accuracies of predicting the ratings of decks, superstructures, and substructures by 8.0%, 8.5%, and 7.9%, respectively.
    publisherASCE
    titleFusing Data Extracted from Bridge Inspection Reports for Enhanced Data-Driven Bridge Deterioration Prediction: A Hybrid Data Fusion Method
    typeJournal Paper
    journal volume34
    journal issue6
    journal titleJournal of Computing in Civil Engineering
    identifier doi10.1061/(ASCE)CP.1943-5487.0000921
    page14
    treeJournal of Computing in Civil Engineering:;2020:;Volume ( 034 ):;issue: 006
    contenttypeFulltext
    DSpace software copyright © 2002-2015  DuraSpace
    نرم افزار کتابخانه دیجیتال "دی اسپیس" فارسی شده توسط یابش برای کتابخانه های ایرانی | تماس با یابش
    yabeshDSpacePersian
     
    DSpace software copyright © 2002-2015  DuraSpace
    نرم افزار کتابخانه دیجیتال "دی اسپیس" فارسی شده توسط یابش برای کتابخانه های ایرانی | تماس با یابش
    yabeshDSpacePersian