YaBeSH Engineering and Technology Library

    • Journals
    • PaperQuest
    • YSE Standards
    • YaBeSH
    • Login
    View Item 
    •   YE&T Library
    • ASCE
    • Journal of Computing in Civil Engineering
    • View Item
    •   YE&T Library
    • ASCE
    • Journal of Computing in Civil Engineering
    • View Item
    • All Fields
    • Source Title
    • Year
    • Publisher
    • Title
    • Subject
    • Author
    • DOI
    • ISBN
    Advanced Search
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Archive

    Fusion of Convolution Neural Network and Visual Transformer for Lithology Identification Using Tunnel Face Images

    Source: Journal of Computing in Civil Engineering:;2025:;Volume ( 039 ):;issue: 002::page 04024056-1
    Author:
    Jianjun Tong
    ,
    Lulu Xiang
    ,
    Allen A. Zhang
    ,
    Xingwang Miao
    ,
    Mingnian Wang
    ,
    Pei Ye
    DOI: 10.1061/JCCEE5.CPENG-5997
    Publisher: American Society of Civil Engineers
    Abstract: This study proposes an intelligent method for recognizing the lithology of a tunnel working face by combining a convolutional neural network and visual transformer. First, an efficient method for collecting high-resolution images of the tunnel working face after construction blasting is developed. Based on relevant geological data, the lithology labels of the tunnel face images are manually prepared. A data augmentation technique is then applied to expand the number of original image samples. Given the established sets of tunnel face images and corresponding lithology labels, the performances of ResNet18 and VIT-4 (which contains four transformer encoding layers) developed in this paper in identifying lithology is compared and analyzed. Subsequently, the efficiencies of using ResNet18 and VIT-4 in both parallel and successive manners is evaluated. The experimental results show that the accuracies of ResNet18 and VIT-4 are 95.7% and 95.4%, respectively. However, stacking ResNet18 and VIT-4 in a parallel manner achieves significantly improved performance in lithology recognition, with an accuracy rate of 98.3%. In contrast, the performance achieved from combining ResNet18 and VIT-4 in a serial manner depends on their structures. Achieving optimal classification performance hinges on minimizing the number of convolution blocks in ResNet18 and concatenating appropriate transformer blocks. The highest accuracy achieved by the method for deploying ResNet18 and VIT-4 in a serial manner using the optimal network structure is 98.5%.
    • Download: (5.710Mb)
    • Show Full MetaData Hide Full MetaData
    • Get RIS
    • Item Order
    • Go To Publisher
    • Price: 5000 Rial
    • Statistics

      Fusion of Convolution Neural Network and Visual Transformer for Lithology Identification Using Tunnel Face Images

    URI
    http://yetl.yabesh.ir/yetl1/handle/yetl/4304685
    Collections
    • Journal of Computing in Civil Engineering

    Show full item record

    contributor authorJianjun Tong
    contributor authorLulu Xiang
    contributor authorAllen A. Zhang
    contributor authorXingwang Miao
    contributor authorMingnian Wang
    contributor authorPei Ye
    date accessioned2025-04-20T10:25:11Z
    date available2025-04-20T10:25:11Z
    date copyright11/22/2024 12:00:00 AM
    date issued2025
    identifier otherJCCEE5.CPENG-5997.pdf
    identifier urihttp://yetl.yabesh.ir/yetl1/handle/yetl/4304685
    description abstractThis study proposes an intelligent method for recognizing the lithology of a tunnel working face by combining a convolutional neural network and visual transformer. First, an efficient method for collecting high-resolution images of the tunnel working face after construction blasting is developed. Based on relevant geological data, the lithology labels of the tunnel face images are manually prepared. A data augmentation technique is then applied to expand the number of original image samples. Given the established sets of tunnel face images and corresponding lithology labels, the performances of ResNet18 and VIT-4 (which contains four transformer encoding layers) developed in this paper in identifying lithology is compared and analyzed. Subsequently, the efficiencies of using ResNet18 and VIT-4 in both parallel and successive manners is evaluated. The experimental results show that the accuracies of ResNet18 and VIT-4 are 95.7% and 95.4%, respectively. However, stacking ResNet18 and VIT-4 in a parallel manner achieves significantly improved performance in lithology recognition, with an accuracy rate of 98.3%. In contrast, the performance achieved from combining ResNet18 and VIT-4 in a serial manner depends on their structures. Achieving optimal classification performance hinges on minimizing the number of convolution blocks in ResNet18 and concatenating appropriate transformer blocks. The highest accuracy achieved by the method for deploying ResNet18 and VIT-4 in a serial manner using the optimal network structure is 98.5%.
    publisherAmerican Society of Civil Engineers
    titleFusion of Convolution Neural Network and Visual Transformer for Lithology Identification Using Tunnel Face Images
    typeJournal Article
    journal volume39
    journal issue2
    journal titleJournal of Computing in Civil Engineering
    identifier doi10.1061/JCCEE5.CPENG-5997
    journal fristpage04024056-1
    journal lastpage04024056-17
    page17
    treeJournal of Computing in Civil Engineering:;2025:;Volume ( 039 ):;issue: 002
    contenttypeFulltext
    DSpace software copyright © 2002-2015  DuraSpace
    نرم افزار کتابخانه دیجیتال "دی اسپیس" فارسی شده توسط یابش برای کتابخانه های ایرانی | تماس با یابش
    yabeshDSpacePersian
     
    DSpace software copyright © 2002-2015  DuraSpace
    نرم افزار کتابخانه دیجیتال "دی اسپیس" فارسی شده توسط یابش برای کتابخانه های ایرانی | تماس با یابش
    yabeshDSpacePersian