Two-Dimensional and Three-Dimensional CNN-Based Simultaneous Detection and Activity Classification of Construction WorkersSource: Journal of Computing in Civil Engineering:;2022:;Volume ( 036 ):;issue: 004::page 04022009DOI: 10.1061/(ASCE)CP.1943-5487.0001024Publisher: ASCE
Abstract: The type and duration of construction workers’ activities are useful information for project management purposes. Therefore, several studies have used surveillance cameras and computer vision to automate the time-consuming process of manually gathering this information. However, the three-stage method they have adopted consisting of separate detection, tracking, and activity classification modules is not fully optimized. Additionally, the activity classification module is trained per-clip/segment on trimmed video clips and fails when applied to long untrimmed construction videos. This paper aims to (1) investigate the benefits of a fully optimized method such as you only watch once (YOWO) and a per-frame and per-worker annotated untrimmed data set over the previous approach for activity recognition of construction workers; (2) propose an improved version of YOWO, called YOWO53, to improve detection performance; (3) propose a semiautomatic data set annotation; (4) conduct a sensitivity analysis to compare the performance of YOWO, YOWO53, and the three-stage method; and (5) conduct a case study to compute the percentage of different workers’ activities. YOWO53 improves the detection recall of YOWO by up to 3%, and the classification accuracy of the three-stage method by 16.3%. Although YOWO53 has a lower inference speed, it is still sufficiently fast for productivity analysis.
|
Collections
Show full item record
contributor author | Ghazaleh Torabi | |
contributor author | Amin Hammad | |
contributor author | Nizar Bouguila | |
date accessioned | 2022-05-07T20:58:05Z | |
date available | 2022-05-07T20:58:05Z | |
date issued | 2022-03-25 | |
identifier other | (ASCE)CP.1943-5487.0001024.pdf | |
identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4283131 | |
description abstract | The type and duration of construction workers’ activities are useful information for project management purposes. Therefore, several studies have used surveillance cameras and computer vision to automate the time-consuming process of manually gathering this information. However, the three-stage method they have adopted consisting of separate detection, tracking, and activity classification modules is not fully optimized. Additionally, the activity classification module is trained per-clip/segment on trimmed video clips and fails when applied to long untrimmed construction videos. This paper aims to (1) investigate the benefits of a fully optimized method such as you only watch once (YOWO) and a per-frame and per-worker annotated untrimmed data set over the previous approach for activity recognition of construction workers; (2) propose an improved version of YOWO, called YOWO53, to improve detection performance; (3) propose a semiautomatic data set annotation; (4) conduct a sensitivity analysis to compare the performance of YOWO, YOWO53, and the three-stage method; and (5) conduct a case study to compute the percentage of different workers’ activities. YOWO53 improves the detection recall of YOWO by up to 3%, and the classification accuracy of the three-stage method by 16.3%. Although YOWO53 has a lower inference speed, it is still sufficiently fast for productivity analysis. | |
publisher | ASCE | |
title | Two-Dimensional and Three-Dimensional CNN-Based Simultaneous Detection and Activity Classification of Construction Workers | |
type | Journal Paper | |
journal volume | 36 | |
journal issue | 4 | |
journal title | Journal of Computing in Civil Engineering | |
identifier doi | 10.1061/(ASCE)CP.1943-5487.0001024 | |
journal fristpage | 04022009 | |
journal lastpage | 04022009-19 | |
page | 19 | |
tree | Journal of Computing in Civil Engineering:;2022:;Volume ( 036 ):;issue: 004 | |
contenttype | Fulltext |