contributor author | Haile Woldesellasse | |
contributor author | Solomon Tesfamariam | |
date accessioned | 2022-02-01T22:07:18Z | |
date available | 2022-02-01T22:07:18Z | |
date issued | 8/1/2021 | |
identifier other | %28ASCE%29PS.1949-1204.0000572.pdf | |
identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4272657 | |
description abstract | Data scarcity and missing values are a prime challenge in developing a corrosion prediction model. In this paper, eight imputation techniques are explored using the National Bureau of Standards (NBS) corrosion database. The eight imputation techniques are mean, median, linear regression (LR), K-nearest neighbor (KNN), iterative robust model-based imputation (IRMI), multiple imputations of incomplete multivariate data (AMELIA), sequential imputation for missing values (IMPSEQ), and principal component analysis (PCA). The utility of imputation techniques is checked by training a neural network (NN) on the data sets imputed by the eight techniques. Among the techniques, KNN and IMPSEQ performed better by achieving a low error and high coefficient of determination R2. Results were compared with a baseline accuracy, where the NN model was trained on the original corrosion data set without the missing values. The NN performance increased from the baseline accuracy (81%) when it was trained by KNN (85%) and IMPSEQ (91%) imputed data sets. | |
publisher | ASCE | |
title | Handling Incomplete and Missing Data in Corrosion Pit Measurement Database Using Imputation Methods: Model Development Using Artificial Neural Network | |
type | Journal Paper | |
journal volume | 12 | |
journal issue | 3 | |
journal title | Journal of Pipeline Systems Engineering and Practice | |
identifier doi | 10.1061/(ASCE)PS.1949-1204.0000572 | |
journal fristpage | 04021033-1 | |
journal lastpage | 04021033-12 | |
page | 12 | |
tree | Journal of Pipeline Systems Engineering and Practice:;2021:;Volume ( 012 ):;issue: 003 | |
contenttype | Fulltext | |