Automated Human Use Mapping of Social Infrastructure by Deep Learning Methods Applied to Smart City Camera Systems

Peng Sun; Gabriel Draughon; Rui Hou; Jerome P. Lynch

Source: Journal of Computing in Civil Engineering:;2022:;Volume ( 036 ):;issue: 004::page 04022011

Author:

DOI: 10.1061/(ASCE)CP.1943-5487.0000998

Publisher: ASCE

Abstract: With the emergence of the smart city, there is a growing need for scalable methods that sense how humans interact and use infrastructure in order to model social behaviors relevant to designing sustainable and resilient built environments. Cyber-physical system (CPS) frameworks used to monitor and automate infrastructure systems in smart cities can be extended to sense people to better understand how they use infrastructure systems including social infrastructure (e.g., parks, markets). This paper adopts convolutional neural network (CNN) architectures to automate the detection and spatiotemporal mapping of people using camera data to form a cyber-physical-social system (CPSS) for smart cities. The Mask region based convolutional neural network (R-CNN) detector was adopted and tailored to identify and segment human subjects in real time using camera images with an average speed of 7 frames per second. The Mask R-CNN framework was trained end to end using the Objects in Public Open Spaces (OPOS) image data set that includes classified segmentations of people in public spaces. A two-dimensional/three-dimensional (2D-3D) lifting algorithm based on a monocular camera calibration model was also employed to accurately position detected people in space. Finally, a Hungarian assignment algorithm based on association metrics extracted from detected people was used to assign people to spatiotemporal trajectories. To demonstrate the proposed framework, this study used the Detroit riverfront parks to study how people utilize community parks, which are a form of social infrastructure. The Mask R-CNN detector is proven precise in detecting and classifying the behavior of people in parks with mean average precision well above 85% for all class types defined in the OPOS library. The framework is also shown to be effective in spatially mapping the various uses of park furnishings, leading to better management of parks.

Download: (6.327Mb)
Show Full MetaData Hide Full MetaData
Get RIS
Item Order
Go To Publisher
Price: 5000 Rial
Statistics

Automated Human Use Mapping of Social Infrastructure by Deep Learning Methods Applied to Smart City Camera Systems

URI

http://yetl.yabesh.ir/yetl1/handle/yetl/4283107

Collections

Journal of Computing in Civil Engineering

Show full item record

contributor author	Peng Sun
contributor author	Gabriel Draughon
contributor author	Rui Hou
contributor author	Jerome P. Lynch
date accessioned	2022-05-07T20:56:59Z
date available	2022-05-07T20:56:59Z
date issued	2022-04-04
identifier other	(ASCE)CP.1943-5487.0000998.pdf
identifier uri	http://yetl.yabesh.ir/yetl1/handle/yetl/4283107
description abstract	With the emergence of the smart city, there is a growing need for scalable methods that sense how humans interact and use infrastructure in order to model social behaviors relevant to designing sustainable and resilient built environments. Cyber-physical system (CPS) frameworks used to monitor and automate infrastructure systems in smart cities can be extended to sense people to better understand how they use infrastructure systems including social infrastructure (e.g., parks, markets). This paper adopts convolutional neural network (CNN) architectures to automate the detection and spatiotemporal mapping of people using camera data to form a cyber-physical-social system (CPSS) for smart cities. The Mask region based convolutional neural network (R-CNN) detector was adopted and tailored to identify and segment human subjects in real time using camera images with an average speed of 7 frames per second. The Mask R-CNN framework was trained end to end using the Objects in Public Open Spaces (OPOS) image data set that includes classified segmentations of people in public spaces. A two-dimensional/three-dimensional (2D-3D) lifting algorithm based on a monocular camera calibration model was also employed to accurately position detected people in space. Finally, a Hungarian assignment algorithm based on association metrics extracted from detected people was used to assign people to spatiotemporal trajectories. To demonstrate the proposed framework, this study used the Detroit riverfront parks to study how people utilize community parks, which are a form of social infrastructure. The Mask R-CNN detector is proven precise in detecting and classifying the behavior of people in parks with mean average precision well above 85% for all class types defined in the OPOS library. The framework is also shown to be effective in spatially mapping the various uses of park furnishings, leading to better management of parks.
publisher	ASCE
title	Automated Human Use Mapping of Social Infrastructure by Deep Learning Methods Applied to Smart City Camera Systems
type	Journal Paper
journal volume	36
journal issue	4
journal title	Journal of Computing in Civil Engineering
identifier doi	10.1061/(ASCE)CP.1943-5487.0000998
journal fristpage	04022011
journal lastpage	04022011-21
page	21
tree	Journal of Computing in Civil Engineering:;2022:;Volume ( 036 ):;issue: 004
contenttype	Fulltext

YaBeSH Engineering and Technology Library

Archive