Evaluation of Data-Driven and Empirical Models for the Estimation of Sediment Removal Efficiency in Settling BasinsSource: Journal of Irrigation and Drainage Engineering:;2025:;Volume ( 151 ):;issue: 004::page 04025012-1DOI: 10.1061/JIDEDH.IRENG-10430Publisher: American Society of Civil Engineers
Abstract: Recent studies have demonstrated the ability of data-driven models to predict sediment removal efficiency for settling basins, hydropower schemes, canals, and water treatment units. Empirical models, such as those cited herein, have also been employed for the estimation of sediment removal efficiency. This study evaluated the effectiveness of these empirical models as well as of data-driven models, such as the adaptive boost regressor (ADBR), extreme gradient boost regressor (XGBR), random forest (RF), and random tree (RT). A set of 328 observations on removal efficiency was used to determine the effectiveness of these models. Data analysis found that there was a high degree of variability in the performance of empirical models when measured by statistical indicators, such as the coefficient of determination (R2), root-mean-square error (RMSE), Nash-Sutcliffe efficiency (NSE), mean absolute error (MAE), and index of agreement (IA). The data-driven models outperformed empirical models in the prediction of sediment removal efficiency. The values of R2, RMSE, NSE, MAE, and IA ranged from 0.320 to 0.528, 0.319 to 0.251, 0.121 to 0.457, 0.249 to 0.195, and 0.662 to 0.827, respectively, for empirical models, and from 0.931 to 0.974, 0.092 to 0.055, 0.927 to 0.974, 0.064 to 0.038, and 0.981 to 0.993 for data-driven models. Among the empirical models, the Salmasi model exhibited a higher degree of accuracy, followed by the USBR, Garde, Raju, Camp-Dobbins, and Sumer models. The data-driven models were ranked from best to worst as ADBR, XGBR, RF, and RT, respectively. Additionally, the most suitable range of input parameters was assessed using the best-performing data-driven model ADBR. Data-driven models effectively captured the relationships between variables and the dynamic nature of the system under consideration, resulting in significantly improved predictive accuracy.
|
Collections
Show full item record
contributor author | Sanjeev Kumar | |
contributor author | C. S. P. Ojha | |
contributor author | Vijay P. Singh | |
date accessioned | 2025-08-17T22:49:09Z | |
date available | 2025-08-17T22:49:09Z | |
date copyright | 8/1/2025 12:00:00 AM | |
date issued | 2025 | |
identifier other | JIDEDH.IRENG-10430.pdf | |
identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4307495 | |
description abstract | Recent studies have demonstrated the ability of data-driven models to predict sediment removal efficiency for settling basins, hydropower schemes, canals, and water treatment units. Empirical models, such as those cited herein, have also been employed for the estimation of sediment removal efficiency. This study evaluated the effectiveness of these empirical models as well as of data-driven models, such as the adaptive boost regressor (ADBR), extreme gradient boost regressor (XGBR), random forest (RF), and random tree (RT). A set of 328 observations on removal efficiency was used to determine the effectiveness of these models. Data analysis found that there was a high degree of variability in the performance of empirical models when measured by statistical indicators, such as the coefficient of determination (R2), root-mean-square error (RMSE), Nash-Sutcliffe efficiency (NSE), mean absolute error (MAE), and index of agreement (IA). The data-driven models outperformed empirical models in the prediction of sediment removal efficiency. The values of R2, RMSE, NSE, MAE, and IA ranged from 0.320 to 0.528, 0.319 to 0.251, 0.121 to 0.457, 0.249 to 0.195, and 0.662 to 0.827, respectively, for empirical models, and from 0.931 to 0.974, 0.092 to 0.055, 0.927 to 0.974, 0.064 to 0.038, and 0.981 to 0.993 for data-driven models. Among the empirical models, the Salmasi model exhibited a higher degree of accuracy, followed by the USBR, Garde, Raju, Camp-Dobbins, and Sumer models. The data-driven models were ranked from best to worst as ADBR, XGBR, RF, and RT, respectively. Additionally, the most suitable range of input parameters was assessed using the best-performing data-driven model ADBR. Data-driven models effectively captured the relationships between variables and the dynamic nature of the system under consideration, resulting in significantly improved predictive accuracy. | |
publisher | American Society of Civil Engineers | |
title | Evaluation of Data-Driven and Empirical Models for the Estimation of Sediment Removal Efficiency in Settling Basins | |
type | Journal Article | |
journal volume | 151 | |
journal issue | 4 | |
journal title | Journal of Irrigation and Drainage Engineering | |
identifier doi | 10.1061/JIDEDH.IRENG-10430 | |
journal fristpage | 04025012-1 | |
journal lastpage | 04025012-16 | |
page | 16 | |
tree | Journal of Irrigation and Drainage Engineering:;2025:;Volume ( 151 ):;issue: 004 | |
contenttype | Fulltext |