| contributor author | Haile Woldesellasse | |
| contributor author | Solomon Tesfamariam | |
| date accessioned | 2022-02-01T22:07:18Z | |
| date available | 2022-02-01T22:07:18Z | |
| date issued | 8/1/2021 | |
| identifier other | %28ASCE%29PS.1949-1204.0000572.pdf | |
| identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4272657 | |
| description abstract | Data scarcity and missing values are a prime challenge in developing a corrosion prediction model. In this paper, eight imputation techniques are explored using the National Bureau of Standards (NBS) corrosion database. The eight imputation techniques are mean, median, linear regression (LR), K-nearest neighbor (KNN), iterative robust model-based imputation (IRMI), multiple imputations of incomplete multivariate data (AMELIA), sequential imputation for missing values (IMPSEQ), and principal component analysis (PCA). The utility of imputation techniques is checked by training a neural network (NN) on the data sets imputed by the eight techniques. Among the techniques, KNN and IMPSEQ performed better by achieving a low error and high coefficient of determination R2. Results were compared with a baseline accuracy, where the NN model was trained on the original corrosion data set without the missing values. The NN performance increased from the baseline accuracy (81%) when it was trained by KNN (85%) and IMPSEQ (91%) imputed data sets. | |
| publisher | ASCE | |
| title | Handling Incomplete and Missing Data in Corrosion Pit Measurement Database Using Imputation Methods: Model Development Using Artificial Neural Network | |
| type | Journal Paper | |
| journal volume | 12 | |
| journal issue | 3 | |
| journal title | Journal of Pipeline Systems Engineering and Practice | |
| identifier doi | 10.1061/(ASCE)PS.1949-1204.0000572 | |
| journal fristpage | 04021033-1 | |
| journal lastpage | 04021033-12 | |
| page | 12 | |
| tree | Journal of Pipeline Systems Engineering and Practice:;2021:;Volume ( 012 ):;issue: 003 | |
| contenttype | Fulltext | |