Supervised Stacking Ensemble Machine Learning Approach for Enhancing Prediction of Total Suspended Solids Concentration in Urban WatershedsSource: Journal of Environmental Engineering:;2022:;Volume ( 148 ):;issue: 006::page 04022026DOI: 10.1061/(ASCE)EE.1943-7870.0001998Publisher: ASCE
Abstract: The potential for stacking ensemble modeling to enhance the performance and generalizability of machine learning (ML) models for the estimation of total suspended solids (TSS) concentration was assessed by comparing the results with ensemble boosting, bagging, and single ML models. Seven stacking ensemble models (M1 to M7) were created using combinations of basic learners, including single, bagging, and boosting models. Adaptive Boosting (AdB) was used as an aggregation method in M1 to M6. The six models showed coefficient of determination (R2) values ranging from 0.87 to 0.95, root mean square error (RMSE) values ranging from 50 to 90 mg/L, and mean absolute error (MAE) values ranging from 11 to 86 mg/L where the best R2, RMSE, and MAE values were 0.95, 50 mg/L, and 12 mg/L, respectively. To further improve the predictions, we tested aggregation methods, including AdB, Random Forest (RF), Variable Weighting kNN (VW-kNN), Regression Tree (RT), and Support Vector Regression (SVR) using the structure of the highest-performing M6 model. This led to a new best fit model (M7) with RF as an aggregation method with R2, RMSE, and MAE values of 0.98, 32 mg/L, and 11 mg/L, respectively.
|
Collections
Show full item record
| contributor author | Mohammadreza Moeini | |
| contributor author | Ali Shojaeizadeh | |
| contributor author | Mengistu Geza | |
| date accessioned | 2022-05-07T21:00:46Z | |
| date available | 2022-05-07T21:00:46Z | |
| date issued | 2022-03-31 | |
| identifier other | (ASCE)EE.1943-7870.0001998.pdf | |
| identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4283192 | |
| description abstract | The potential for stacking ensemble modeling to enhance the performance and generalizability of machine learning (ML) models for the estimation of total suspended solids (TSS) concentration was assessed by comparing the results with ensemble boosting, bagging, and single ML models. Seven stacking ensemble models (M1 to M7) were created using combinations of basic learners, including single, bagging, and boosting models. Adaptive Boosting (AdB) was used as an aggregation method in M1 to M6. The six models showed coefficient of determination (R2) values ranging from 0.87 to 0.95, root mean square error (RMSE) values ranging from 50 to 90 mg/L, and mean absolute error (MAE) values ranging from 11 to 86 mg/L where the best R2, RMSE, and MAE values were 0.95, 50 mg/L, and 12 mg/L, respectively. To further improve the predictions, we tested aggregation methods, including AdB, Random Forest (RF), Variable Weighting kNN (VW-kNN), Regression Tree (RT), and Support Vector Regression (SVR) using the structure of the highest-performing M6 model. This led to a new best fit model (M7) with RF as an aggregation method with R2, RMSE, and MAE values of 0.98, 32 mg/L, and 11 mg/L, respectively. | |
| publisher | ASCE | |
| title | Supervised Stacking Ensemble Machine Learning Approach for Enhancing Prediction of Total Suspended Solids Concentration in Urban Watersheds | |
| type | Journal Paper | |
| journal volume | 148 | |
| journal issue | 6 | |
| journal title | Journal of Environmental Engineering | |
| identifier doi | 10.1061/(ASCE)EE.1943-7870.0001998 | |
| journal fristpage | 04022026 | |
| journal lastpage | 04022026-12 | |
| page | 12 | |
| tree | Journal of Environmental Engineering:;2022:;Volume ( 148 ):;issue: 006 | |
| contenttype | Fulltext |