YaBeSH Engineering and Technology Library

    • Journals
    • PaperQuest
    • YSE Standards
    • YaBeSH
    • Login
    View Item 
    •   YE&T Library
    • ASCE
    • Journal of Computing in Civil Engineering
    • View Item
    •   YE&T Library
    • ASCE
    • Journal of Computing in Civil Engineering
    • View Item
    • All Fields
    • Source Title
    • Year
    • Publisher
    • Title
    • Subject
    • Author
    • DOI
    • ISBN
    Advanced Search
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Archive

    Vector-Quantized Variational Teacher and Multimodal Collaborative Student for Crack Segmentation via Knowledge Distillation

    Source: Journal of Computing in Civil Engineering:;2025:;Volume ( 039 ):;issue: 003::page 04025030-1
    Author:
    Shi Qiu
    ,
    Qasim Zaheer
    ,
    S. Muhammad Ahmed Hassan Shah
    ,
    Chengbo Ai
    ,
    Jin Wang
    ,
    You Zhan
    DOI: 10.1061/JCCEE5.CPENG-6339
    Publisher: American Society of Civil Engineers
    Abstract: This paper proposes a novel method for real-time crack segmentation in infrastructure inspection that achieves state-of-the-art performance. This approach leverages knowledge distillation, in which a vector-quantized variational autoencoder (VQ-VAE) acts as the “teacher” that extracts informative representations and learns codebook, and a multimodal collaborative student (MCS) utilizes the learned codebook for improved crack segmentation. This framework, incorporating the Teacher’s Codebook Cheating (TCC), achieves high accuracy and efficiency. With minimal parameters (0.59 million), the model demonstrates significant improvements in crack segmentation speed and precision, achieving a Dice score of 93.19, Intersection over Union (IOU) of 0.8723, and mean pixel accuracy of 97.52. Notably, the model processes frames at an impressive 89.3 frames per second (FPS), outperforming all other state-of-the-art methods despite using a smaller input size of 128×128×3; nevertheless, its efficiency stems from its simplicity, with only 0.59 million parameters, making it well-suited for resource-constrained environments. These results highlight the effectiveness of our method for real-time crack segmentation, paving the way for more automated and accessible infrastructure inspection.
    • Download: (6.124Mb)
    • Show Full MetaData Hide Full MetaData
    • Get RIS
    • Item Order
    • Go To Publisher
    • Price: 5000 Rial
    • Statistics

      Vector-Quantized Variational Teacher and Multimodal Collaborative Student for Crack Segmentation via Knowledge Distillation

    URI
    http://yetl.yabesh.ir/yetl1/handle/yetl/4307172
    Collections
    • Journal of Computing in Civil Engineering

    Show full item record

    contributor authorShi Qiu
    contributor authorQasim Zaheer
    contributor authorS. Muhammad Ahmed Hassan Shah
    contributor authorChengbo Ai
    contributor authorJin Wang
    contributor authorYou Zhan
    date accessioned2025-08-17T22:36:09Z
    date available2025-08-17T22:36:09Z
    date copyright5/1/2025 12:00:00 AM
    date issued2025
    identifier otherJCCEE5.CPENG-6339.pdf
    identifier urihttp://yetl.yabesh.ir/yetl1/handle/yetl/4307172
    description abstractThis paper proposes a novel method for real-time crack segmentation in infrastructure inspection that achieves state-of-the-art performance. This approach leverages knowledge distillation, in which a vector-quantized variational autoencoder (VQ-VAE) acts as the “teacher” that extracts informative representations and learns codebook, and a multimodal collaborative student (MCS) utilizes the learned codebook for improved crack segmentation. This framework, incorporating the Teacher’s Codebook Cheating (TCC), achieves high accuracy and efficiency. With minimal parameters (0.59 million), the model demonstrates significant improvements in crack segmentation speed and precision, achieving a Dice score of 93.19, Intersection over Union (IOU) of 0.8723, and mean pixel accuracy of 97.52. Notably, the model processes frames at an impressive 89.3 frames per second (FPS), outperforming all other state-of-the-art methods despite using a smaller input size of 128×128×3; nevertheless, its efficiency stems from its simplicity, with only 0.59 million parameters, making it well-suited for resource-constrained environments. These results highlight the effectiveness of our method for real-time crack segmentation, paving the way for more automated and accessible infrastructure inspection.
    publisherAmerican Society of Civil Engineers
    titleVector-Quantized Variational Teacher and Multimodal Collaborative Student for Crack Segmentation via Knowledge Distillation
    typeJournal Article
    journal volume39
    journal issue3
    journal titleJournal of Computing in Civil Engineering
    identifier doi10.1061/JCCEE5.CPENG-6339
    journal fristpage04025030-1
    journal lastpage04025030-22
    page22
    treeJournal of Computing in Civil Engineering:;2025:;Volume ( 039 ):;issue: 003
    contenttypeFulltext
    DSpace software copyright © 2002-2015  DuraSpace
    نرم افزار کتابخانه دیجیتال "دی اسپیس" فارسی شده توسط یابش برای کتابخانه های ایرانی | تماس با یابش
    yabeshDSpacePersian
     
    DSpace software copyright © 2002-2015  DuraSpace
    نرم افزار کتابخانه دیجیتال "دی اسپیس" فارسی شده توسط یابش برای کتابخانه های ایرانی | تماس با یابش
    yabeshDSpacePersian