Show simple item record

contributor authorHaile Woldesellasse
contributor authorSolomon Tesfamariam
date accessioned2022-02-01T22:07:18Z
date available2022-02-01T22:07:18Z
date issued8/1/2021
identifier other%28ASCE%29PS.1949-1204.0000572.pdf
identifier urihttp://yetl.yabesh.ir/yetl1/handle/yetl/4272657
description abstractData scarcity and missing values are a prime challenge in developing a corrosion prediction model. In this paper, eight imputation techniques are explored using the National Bureau of Standards (NBS) corrosion database. The eight imputation techniques are mean, median, linear regression (LR), K-nearest neighbor (KNN), iterative robust model-based imputation (IRMI), multiple imputations of incomplete multivariate data (AMELIA), sequential imputation for missing values (IMPSEQ), and principal component analysis (PCA). The utility of imputation techniques is checked by training a neural network (NN) on the data sets imputed by the eight techniques. Among the techniques, KNN and IMPSEQ performed better by achieving a low error and high coefficient of determination R2. Results were compared with a baseline accuracy, where the NN model was trained on the original corrosion data set without the missing values. The NN performance increased from the baseline accuracy (81%) when it was trained by KNN (85%) and IMPSEQ (91%) imputed data sets.
publisherASCE
titleHandling Incomplete and Missing Data in Corrosion Pit Measurement Database Using Imputation Methods: Model Development Using Artificial Neural Network
typeJournal Paper
journal volume12
journal issue3
journal titleJournal of Pipeline Systems Engineering and Practice
identifier doi10.1061/(ASCE)PS.1949-1204.0000572
journal fristpage04021033-1
journal lastpage04021033-12
page12
treeJournal of Pipeline Systems Engineering and Practice:;2021:;Volume ( 012 ):;issue: 003
contenttypeFulltext


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record