contributor author | Xiaorui Xue | |
contributor author | Jiansong Zhang | |
date accessioned | 2022-01-30T21:32:19Z | |
date available | 2022-01-30T21:32:19Z | |
date issued | 9/1/2020 12:00:00 AM | |
identifier other | %28ASCE%29CP.1943-5487.0000917.pdf | |
identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4268382 | |
description abstract | To enable full automation, automated code compliance checking systems need to extract regulatory information in building codes and convert it to computable representations. This conversion is a natural language processing (NLP) task that requires highly accurate part-of-speech (POS) tagging results on building codes. Existing POS taggers, however, do not provide such accuracy on building codes. To address this need, the authors propose to improve the performance of POS taggers by error-driven transformational rules that revise machine-tagged POS results. The proposed method utilizes a syntactic and semantic rule-based, NLP approach combined with a structure that is inspired by transfer learning. This method generates a group of transformational rulesets, from simple ones to complex ones, that will convert machine taggers’ tagging results to their corresponding human-labeled gold standard. The transformational rules utilize syntactic and semantic information of domain texts. All rules are constrained not to introduce any new errors when fixing existing errors of machine taggers. The last ruleset, which fixes most common remaining errors in textual data after all other rules are applied, is exempted from this constraint. An experimental test on part-of-speech tagged building code (PTBC) data shows this method reduced 82.7% of errors in POS tagging results of building codes, which increased the POS tagging accuracy on building codes from 89.13% to 98.12%. | |
publisher | ASCE | |
title | Building Codes Part-of-Speech Tagging Performance Improvement by Error-Driven Transformational Rules | |
type | Journal Paper | |
journal volume | 34 | |
journal issue | 5 | |
journal title | Journal of Computing in Civil Engineering | |
identifier doi | 10.1061/(ASCE)CP.1943-5487.0000917 | |
page | 10 | |
tree | Journal of Computing in Civil Engineering:;2020:;Volume ( 034 ):;issue: 005 | |
contenttype | Fulltext | |