Enhancing Coagulation Prediction in Water Treatment Using a Similarity Score–Based Piecewise Machine Learning ModelSource: Journal of Environmental Engineering:;2025:;Volume ( 151 ):;issue: 006::page 04025023-1DOI: 10.1061/JOEEDU.EEENG-7969Publisher: American Society of Civil Engineers
Abstract: Coagulation is vital in water treatment, but predicting coagulant doses using mechanistic models is challenging due to complex processes. Traditionally, jar testing is used, but it struggles with varying water quality. This study explores machine learning (ML) techniques to predict effluent quality and coagulant requirements. It found that random data splits yield higher similarity scores and better performance than time-based splits. Random forest regressor (RFR) and linear support vector regressor (Linear SVR) performed well in high and low similarity scenarios, respectively, while multilayer perceptron (MLP) showed consistent performance across all conditions. A hybrid ML model combining RFR, Linear SVR, and MLP outperformed individual models, reducing errors by 1%–45%. Converted to ONNX format, the model was integrated into the digital twin of a water purification plant in Texas, improving real-time predictions. This study demonstrates the potential of ML in optimizing water treatment and reducing the need for traditional testing methods. This study demonstrates how machine learning (ML) can improve the coagulation process in water treatment, offering a more efficient alternative to traditional jar testing. By using ML models, water treatment plants can predict the necessary coagulant doses more accurately and quickly, without relying on time-consuming manual tests. The hybrid ML model developed in this study combines multiple techniques to handle varying water quality conditions, improving the precision of coagulant predictions. The model has practical applications in real-time operations. For example, it has been successfully integrated into the digital twin of a water purification plant in Texas, where it provides real-time predictions to optimize treatment processes. This results in more efficient water treatment, reduced chemical usage, and lower operational costs. By reducing the need for traditional testing methods, this approach allows water treatment plants to respond faster to changes in water quality, ensuring more consistent and reliable water supply. This research shows that machine learning can be a valuable tool for improving water treatment processes, offering both operational and economic benefits for plants worldwide.
|
Collections
Show full item record
contributor author | Jie Zhang | |
contributor author | Noah Tylor | |
contributor author | Wen Zhao | |
contributor author | Ran Rui | |
contributor author | Charlie He | |
date accessioned | 2025-08-17T23:01:33Z | |
date available | 2025-08-17T23:01:33Z | |
date copyright | 6/1/2025 12:00:00 AM | |
date issued | 2025 | |
identifier other | JOEEDU.EEENG-7969.pdf | |
identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4307794 | |
description abstract | Coagulation is vital in water treatment, but predicting coagulant doses using mechanistic models is challenging due to complex processes. Traditionally, jar testing is used, but it struggles with varying water quality. This study explores machine learning (ML) techniques to predict effluent quality and coagulant requirements. It found that random data splits yield higher similarity scores and better performance than time-based splits. Random forest regressor (RFR) and linear support vector regressor (Linear SVR) performed well in high and low similarity scenarios, respectively, while multilayer perceptron (MLP) showed consistent performance across all conditions. A hybrid ML model combining RFR, Linear SVR, and MLP outperformed individual models, reducing errors by 1%–45%. Converted to ONNX format, the model was integrated into the digital twin of a water purification plant in Texas, improving real-time predictions. This study demonstrates the potential of ML in optimizing water treatment and reducing the need for traditional testing methods. This study demonstrates how machine learning (ML) can improve the coagulation process in water treatment, offering a more efficient alternative to traditional jar testing. By using ML models, water treatment plants can predict the necessary coagulant doses more accurately and quickly, without relying on time-consuming manual tests. The hybrid ML model developed in this study combines multiple techniques to handle varying water quality conditions, improving the precision of coagulant predictions. The model has practical applications in real-time operations. For example, it has been successfully integrated into the digital twin of a water purification plant in Texas, where it provides real-time predictions to optimize treatment processes. This results in more efficient water treatment, reduced chemical usage, and lower operational costs. By reducing the need for traditional testing methods, this approach allows water treatment plants to respond faster to changes in water quality, ensuring more consistent and reliable water supply. This research shows that machine learning can be a valuable tool for improving water treatment processes, offering both operational and economic benefits for plants worldwide. | |
publisher | American Society of Civil Engineers | |
title | Enhancing Coagulation Prediction in Water Treatment Using a Similarity Score–Based Piecewise Machine Learning Model | |
type | Journal Article | |
journal volume | 151 | |
journal issue | 6 | |
journal title | Journal of Environmental Engineering | |
identifier doi | 10.1061/JOEEDU.EEENG-7969 | |
journal fristpage | 04025023-1 | |
journal lastpage | 04025023-11 | |
page | 11 | |
tree | Journal of Environmental Engineering:;2025:;Volume ( 151 ):;issue: 006 | |
contenttype | Fulltext |