Supervised Stacking Ensemble Machine Learning Approach for Enhancing Prediction of Total Suspended Solids Concentration in Urban Watersheds

Mohammadreza Moeini; Ali Shojaeizadeh; Mengistu Geza

Source: Journal of Environmental Engineering:;2022:;Volume ( 148 ):;issue: 006::page 04022026

Author:

Mohammadreza Moeini

Ali Shojaeizadeh

Mengistu Geza

DOI: 10.1061/(ASCE)EE.1943-7870.0001998

Publisher: ASCE

Abstract: The potential for stacking ensemble modeling to enhance the performance and generalizability of machine learning (ML) models for the estimation of total suspended solids (TSS) concentration was assessed by comparing the results with ensemble boosting, bagging, and single ML models. Seven stacking ensemble models (M1 to M7) were created using combinations of basic learners, including single, bagging, and boosting models. Adaptive Boosting (AdB) was used as an aggregation method in M1 to M6. The six models showed coefficient of determination (R2) values ranging from 0.87 to 0.95, root mean square error (RMSE) values ranging from 50 to 90 mg/L, and mean absolute error (MAE) values ranging from 11 to 86 mg/L where the best R2, RMSE, and MAE values were 0.95, 50 mg/L, and 12 mg/L, respectively. To further improve the predictions, we tested aggregation methods, including AdB, Random Forest (RF), Variable Weighting kNN (VW-kNN), Regression Tree (RT), and Support Vector Regression (SVR) using the structure of the highest-performing M6 model. This led to a new best fit model (M7) with RF as an aggregation method with R2, RMSE, and MAE values of 0.98, 32 mg/L, and 11 mg/L, respectively.

Download: (797.8Kb)
Show Full MetaData Hide Full MetaData
Get RIS
Item Order
Go To Publisher
Price: 5000 Rial
Statistics

Supervised Stacking Ensemble Machine Learning Approach for Enhancing Prediction of Total Suspended Solids Concentration in Urban Watersheds

URI

http://yetl.yabesh.ir/yetl1/handle/yetl/4283192

Collections

Journal of Environmental Engineering

Show full item record

contributor author	Mohammadreza Moeini
contributor author	Ali Shojaeizadeh
contributor author	Mengistu Geza
date accessioned	2022-05-07T21:00:46Z
date available	2022-05-07T21:00:46Z
date issued	2022-03-31
identifier other	(ASCE)EE.1943-7870.0001998.pdf
identifier uri	http://yetl.yabesh.ir/yetl1/handle/yetl/4283192
description abstract	The potential for stacking ensemble modeling to enhance the performance and generalizability of machine learning (ML) models for the estimation of total suspended solids (TSS) concentration was assessed by comparing the results with ensemble boosting, bagging, and single ML models. Seven stacking ensemble models (M1 to M7) were created using combinations of basic learners, including single, bagging, and boosting models. Adaptive Boosting (AdB) was used as an aggregation method in M1 to M6. The six models showed coefficient of determination (R2) values ranging from 0.87 to 0.95, root mean square error (RMSE) values ranging from 50 to 90 mg/L, and mean absolute error (MAE) values ranging from 11 to 86 mg/L where the best R2, RMSE, and MAE values were 0.95, 50 mg/L, and 12 mg/L, respectively. To further improve the predictions, we tested aggregation methods, including AdB, Random Forest (RF), Variable Weighting kNN (VW-kNN), Regression Tree (RT), and Support Vector Regression (SVR) using the structure of the highest-performing M6 model. This led to a new best fit model (M7) with RF as an aggregation method with R2, RMSE, and MAE values of 0.98, 32 mg/L, and 11 mg/L, respectively.
publisher	ASCE
title	Supervised Stacking Ensemble Machine Learning Approach for Enhancing Prediction of Total Suspended Solids Concentration in Urban Watersheds
type	Journal Paper
journal volume	148
journal issue	6
journal title	Journal of Environmental Engineering
identifier doi	10.1061/(ASCE)EE.1943-7870.0001998
journal fristpage	04022026
journal lastpage	04022026-12
page	12
tree	Journal of Environmental Engineering:;2022:;Volume ( 148 ):;issue: 006
contenttype	Fulltext

YaBeSH Engineering and Technology Library

Archive