Evaluation of Data-Driven and Empirical Models for the Estimation of Sediment Removal Efficiency in Settling Basins

Sanjeev Kumar; C. S. P. Ojha; Vijay P. Singh

Source: Journal of Irrigation and Drainage Engineering:;2025:;Volume ( 151 ):;issue: 004::page 04025012-1

Author:

Sanjeev Kumar

C. S. P. Ojha

Vijay P. Singh

DOI: 10.1061/JIDEDH.IRENG-10430

Publisher: American Society of Civil Engineers

Abstract: Recent studies have demonstrated the ability of data-driven models to predict sediment removal efficiency for settling basins, hydropower schemes, canals, and water treatment units. Empirical models, such as those cited herein, have also been employed for the estimation of sediment removal efficiency. This study evaluated the effectiveness of these empirical models as well as of data-driven models, such as the adaptive boost regressor (ADBR), extreme gradient boost regressor (XGBR), random forest (RF), and random tree (RT). A set of 328 observations on removal efficiency was used to determine the effectiveness of these models. Data analysis found that there was a high degree of variability in the performance of empirical models when measured by statistical indicators, such as the coefficient of determination (R2), root-mean-square error (RMSE), Nash-Sutcliffe efficiency (NSE), mean absolute error (MAE), and index of agreement (IA). The data-driven models outperformed empirical models in the prediction of sediment removal efficiency. The values of R2, RMSE, NSE, MAE, and IA ranged from 0.320 to 0.528, 0.319 to 0.251, 0.121 to 0.457, 0.249 to 0.195, and 0.662 to 0.827, respectively, for empirical models, and from 0.931 to 0.974, 0.092 to 0.055, 0.927 to 0.974, 0.064 to 0.038, and 0.981 to 0.993 for data-driven models. Among the empirical models, the Salmasi model exhibited a higher degree of accuracy, followed by the USBR, Garde, Raju, Camp-Dobbins, and Sumer models. The data-driven models were ranked from best to worst as ADBR, XGBR, RF, and RT, respectively. Additionally, the most suitable range of input parameters was assessed using the best-performing data-driven model ADBR. Data-driven models effectively captured the relationships between variables and the dynamic nature of the system under consideration, resulting in significantly improved predictive accuracy.

Download: (5.978Mb)
Show Full MetaData Hide Full MetaData
Get RIS
Item Order
Go To Publisher
Price: 5000 Rial
Statistics

Evaluation of Data-Driven and Empirical Models for the Estimation of Sediment Removal Efficiency in Settling Basins

URI

http://yetl.yabesh.ir/yetl1/handle/yetl/4307495

Collections

Journal of Irrigation and Drainage Engineering

Show full item record

contributor author	Sanjeev Kumar
contributor author	C. S. P. Ojha
contributor author	Vijay P. Singh
date accessioned	2025-08-17T22:49:09Z
date available	2025-08-17T22:49:09Z
date copyright	8/1/2025 12:00:00 AM
date issued	2025
identifier other	JIDEDH.IRENG-10430.pdf
identifier uri	http://yetl.yabesh.ir/yetl1/handle/yetl/4307495
description abstract	Recent studies have demonstrated the ability of data-driven models to predict sediment removal efficiency for settling basins, hydropower schemes, canals, and water treatment units. Empirical models, such as those cited herein, have also been employed for the estimation of sediment removal efficiency. This study evaluated the effectiveness of these empirical models as well as of data-driven models, such as the adaptive boost regressor (ADBR), extreme gradient boost regressor (XGBR), random forest (RF), and random tree (RT). A set of 328 observations on removal efficiency was used to determine the effectiveness of these models. Data analysis found that there was a high degree of variability in the performance of empirical models when measured by statistical indicators, such as the coefficient of determination (R2), root-mean-square error (RMSE), Nash-Sutcliffe efficiency (NSE), mean absolute error (MAE), and index of agreement (IA). The data-driven models outperformed empirical models in the prediction of sediment removal efficiency. The values of R2, RMSE, NSE, MAE, and IA ranged from 0.320 to 0.528, 0.319 to 0.251, 0.121 to 0.457, 0.249 to 0.195, and 0.662 to 0.827, respectively, for empirical models, and from 0.931 to 0.974, 0.092 to 0.055, 0.927 to 0.974, 0.064 to 0.038, and 0.981 to 0.993 for data-driven models. Among the empirical models, the Salmasi model exhibited a higher degree of accuracy, followed by the USBR, Garde, Raju, Camp-Dobbins, and Sumer models. The data-driven models were ranked from best to worst as ADBR, XGBR, RF, and RT, respectively. Additionally, the most suitable range of input parameters was assessed using the best-performing data-driven model ADBR. Data-driven models effectively captured the relationships between variables and the dynamic nature of the system under consideration, resulting in significantly improved predictive accuracy.
publisher	American Society of Civil Engineers
title	Evaluation of Data-Driven and Empirical Models for the Estimation of Sediment Removal Efficiency in Settling Basins
type	Journal Article
journal volume	151
journal issue	4
journal title	Journal of Irrigation and Drainage Engineering
identifier doi	10.1061/JIDEDH.IRENG-10430
journal fristpage	04025012-1
journal lastpage	04025012-16
page	16
tree	Journal of Irrigation and Drainage Engineering:;2025:;Volume ( 151 ):;issue: 004
contenttype	Fulltext

YaBeSH Engineering and Technology Library

Archive