description abstract | Damage due to cracking can be detected through either manual visual methods or machine vision techniques for early prevention and maintenance. In recent years, image-based deep learning methods have emerged as potent tools for automatic crack detection. In this study, five deep learning object detection algorithms—faster R-CNN, single-shot detector (SSD), You Only Look Once (YOLO) v3 and v8, and RetinaNet—were systematically compared, and the results were analyzed. Object detection involves the generation of bounding boxes of various sizes for objects of interest. Because cracks are thin and small and thus difficult to capture in a unique bounding box, redundant measurements are common, but they compromise the accuracy and consistency of the model. Therefore, an improved confusion matrix based on linear features was employed in this study to evaluate the crack detection performance of the five object detection algorithms. In evaluation experiments, the overall accuracy levels of SSD were 90.6% on visible atmospherically resistant index (VARI) images, indicating effective concrete crack detection performance. Notably, SSD excels in cases involving small cracks and data imbalance, thus demonstrating a high level of model stability. This comparative analysis of the performances of different deep learning algorithms in crack detection contributes to the formulation of methods for automatic damage detection. | |