Enhancing Coagulation Prediction in Water Treatment Using a Similarity Score–Based Piecewise Machine Learning Model

Jie Zhang; Noah Tylor; Wen Zhao; Ran Rui; Charlie He

Source: Journal of Environmental Engineering:;2025:;Volume ( 151 ):;issue: 006::page 04025023-1

Author:

DOI: 10.1061/JOEEDU.EEENG-7969

Publisher: American Society of Civil Engineers

Abstract: Coagulation is vital in water treatment, but predicting coagulant doses using mechanistic models is challenging due to complex processes. Traditionally, jar testing is used, but it struggles with varying water quality. This study explores machine learning (ML) techniques to predict effluent quality and coagulant requirements. It found that random data splits yield higher similarity scores and better performance than time-based splits. Random forest regressor (RFR) and linear support vector regressor (Linear SVR) performed well in high and low similarity scenarios, respectively, while multilayer perceptron (MLP) showed consistent performance across all conditions. A hybrid ML model combining RFR, Linear SVR, and MLP outperformed individual models, reducing errors by 1%–45%. Converted to ONNX format, the model was integrated into the digital twin of a water purification plant in Texas, improving real-time predictions. This study demonstrates the potential of ML in optimizing water treatment and reducing the need for traditional testing methods. This study demonstrates how machine learning (ML) can improve the coagulation process in water treatment, offering a more efficient alternative to traditional jar testing. By using ML models, water treatment plants can predict the necessary coagulant doses more accurately and quickly, without relying on time-consuming manual tests. The hybrid ML model developed in this study combines multiple techniques to handle varying water quality conditions, improving the precision of coagulant predictions. The model has practical applications in real-time operations. For example, it has been successfully integrated into the digital twin of a water purification plant in Texas, where it provides real-time predictions to optimize treatment processes. This results in more efficient water treatment, reduced chemical usage, and lower operational costs. By reducing the need for traditional testing methods, this approach allows water treatment plants to respond faster to changes in water quality, ensuring more consistent and reliable water supply. This research shows that machine learning can be a valuable tool for improving water treatment processes, offering both operational and economic benefits for plants worldwide.

Download: (1.691Mb)
Show Full MetaData Hide Full MetaData
Get RIS
Item Order
Go To Publisher
Price: 5000 Rial
Statistics

Enhancing Coagulation Prediction in Water Treatment Using a Similarity Score–Based Piecewise Machine Learning Model

URI

http://yetl.yabesh.ir/yetl1/handle/yetl/4307794

Collections

Journal of Environmental Engineering

Show full item record

contributor author	Jie Zhang
contributor author	Noah Tylor
contributor author	Wen Zhao
contributor author	Ran Rui
contributor author	Charlie He
date accessioned	2025-08-17T23:01:33Z
date available	2025-08-17T23:01:33Z
date copyright	6/1/2025 12:00:00 AM
date issued	2025
identifier other	JOEEDU.EEENG-7969.pdf
identifier uri	http://yetl.yabesh.ir/yetl1/handle/yetl/4307794
description abstract	Coagulation is vital in water treatment, but predicting coagulant doses using mechanistic models is challenging due to complex processes. Traditionally, jar testing is used, but it struggles with varying water quality. This study explores machine learning (ML) techniques to predict effluent quality and coagulant requirements. It found that random data splits yield higher similarity scores and better performance than time-based splits. Random forest regressor (RFR) and linear support vector regressor (Linear SVR) performed well in high and low similarity scenarios, respectively, while multilayer perceptron (MLP) showed consistent performance across all conditions. A hybrid ML model combining RFR, Linear SVR, and MLP outperformed individual models, reducing errors by 1%–45%. Converted to ONNX format, the model was integrated into the digital twin of a water purification plant in Texas, improving real-time predictions. This study demonstrates the potential of ML in optimizing water treatment and reducing the need for traditional testing methods. This study demonstrates how machine learning (ML) can improve the coagulation process in water treatment, offering a more efficient alternative to traditional jar testing. By using ML models, water treatment plants can predict the necessary coagulant doses more accurately and quickly, without relying on time-consuming manual tests. The hybrid ML model developed in this study combines multiple techniques to handle varying water quality conditions, improving the precision of coagulant predictions. The model has practical applications in real-time operations. For example, it has been successfully integrated into the digital twin of a water purification plant in Texas, where it provides real-time predictions to optimize treatment processes. This results in more efficient water treatment, reduced chemical usage, and lower operational costs. By reducing the need for traditional testing methods, this approach allows water treatment plants to respond faster to changes in water quality, ensuring more consistent and reliable water supply. This research shows that machine learning can be a valuable tool for improving water treatment processes, offering both operational and economic benefits for plants worldwide.
publisher	American Society of Civil Engineers
title	Enhancing Coagulation Prediction in Water Treatment Using a Similarity Score–Based Piecewise Machine Learning Model
type	Journal Article
journal volume	151
journal issue	6
journal title	Journal of Environmental Engineering
identifier doi	10.1061/JOEEDU.EEENG-7969
journal fristpage	04025023-1
journal lastpage	04025023-11
page	11
tree	Journal of Environmental Engineering:;2025:;Volume ( 151 ):;issue: 006
contenttype	Fulltext

YaBeSH Engineering and Technology Library

Archive