Systematic Analysis of the Impact of Data Preprocessing Techniques on Machine-Learning Model Performance: A Case Study of a Compressive Strength Prediction Model for Geopolymer ConcreteSource: Journal of Computing in Civil Engineering:;2025:;Volume ( 039 ):;issue: 004::page 04025051-1Author:Madushan Rathnayaka
,
Dulakshi Karunasingha
,
Chamila Gunasekara
,
Kushan Wijesundara
,
David W. Law
,
Weena Lokuge
DOI: 10.1061/JCCEE5.CPENG-6547Publisher: American Society of Civil Engineers
Abstract: Geopolymer concrete (GPC) is emerging as a sustainable alternative to ordinary Portland cement (OPC) concrete. However, developing mix designs for GPC presents unique challenges due to the variability in fly ash properties and the selection of appropriate alkaline activators. Traditional experimental and statistical methods often fall short in predicting the compressive strength of GPC. Recently, machine learning (ML) models have gained popularity in this area. This study evaluates the impact of various data preprocessing (DP) techniques namely missing data imputation, outlier detection, and normalization of the performance of an artificial neural network (ANN) model for predicting the compressive strength of GPC. The study also assessed the influence of feature selection using an ANN model considering common input variables identified in the literature. Results indicate that the chemical oxide content of fly ash, NaOH solid content, and total water content are critical to accurate strength prediction. Furthermore, curing temperature and time significantly enhances model accuracy. The study found that removing missing data and using median outlier detection improved model performance, while proper hyperparameter tuning markedly enhanced prediction accuracy. Validation through laboratory experiments confirmed the model’s reliability, demonstrating its potential for advancing GPC mix design.
|
Collections
Show full item record
contributor author | Madushan Rathnayaka | |
contributor author | Dulakshi Karunasingha | |
contributor author | Chamila Gunasekara | |
contributor author | Kushan Wijesundara | |
contributor author | David W. Law | |
contributor author | Weena Lokuge | |
date accessioned | 2025-08-17T22:36:30Z | |
date available | 2025-08-17T22:36:30Z | |
date copyright | 7/1/2025 12:00:00 AM | |
date issued | 2025 | |
identifier other | JCCEE5.CPENG-6547.pdf | |
identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4307183 | |
description abstract | Geopolymer concrete (GPC) is emerging as a sustainable alternative to ordinary Portland cement (OPC) concrete. However, developing mix designs for GPC presents unique challenges due to the variability in fly ash properties and the selection of appropriate alkaline activators. Traditional experimental and statistical methods often fall short in predicting the compressive strength of GPC. Recently, machine learning (ML) models have gained popularity in this area. This study evaluates the impact of various data preprocessing (DP) techniques namely missing data imputation, outlier detection, and normalization of the performance of an artificial neural network (ANN) model for predicting the compressive strength of GPC. The study also assessed the influence of feature selection using an ANN model considering common input variables identified in the literature. Results indicate that the chemical oxide content of fly ash, NaOH solid content, and total water content are critical to accurate strength prediction. Furthermore, curing temperature and time significantly enhances model accuracy. The study found that removing missing data and using median outlier detection improved model performance, while proper hyperparameter tuning markedly enhanced prediction accuracy. Validation through laboratory experiments confirmed the model’s reliability, demonstrating its potential for advancing GPC mix design. | |
publisher | American Society of Civil Engineers | |
title | Systematic Analysis of the Impact of Data Preprocessing Techniques on Machine-Learning Model Performance: A Case Study of a Compressive Strength Prediction Model for Geopolymer Concrete | |
type | Journal Article | |
journal volume | 39 | |
journal issue | 4 | |
journal title | Journal of Computing in Civil Engineering | |
identifier doi | 10.1061/JCCEE5.CPENG-6547 | |
journal fristpage | 04025051-1 | |
journal lastpage | 04025051-13 | |
page | 13 | |
tree | Journal of Computing in Civil Engineering:;2025:;Volume ( 039 ):;issue: 004 | |
contenttype | Fulltext |