Multivariate Analysis and Anomaly Detection of a US Reservoir Sedimentation Data Set

Alejandra Botero-Acosta; Amanda L. Cox; Vasit Sagan; Ibrahim Demir; Marian Muste; Paul Boyd; Chandra Pathak

Source: Journal of Hydrologic Engineering:;2024:;Volume ( 029 ):;issue: 005::page 04024031-1

Author:

Alejandra Botero-Acosta

DOI: 10.1061/JHYEFF.HEENG-6206

Publisher: American Society of Civil Engineers

Abstract: Sedimentation processes in reservoirs can jeopardize their functionality and compromise dam safety. Climate change and associated hydrologic uncertainty are introducing additional stressors to US reservoirs, and data-driven indicators of climate impacts on upstream soil erosion and reservoir sedimentation processes are crucial to evaluate their aggradation and life expectancy. The US Army Corps of Engineers developed the Enhancing Reservoir Sedimentation Information for Climate Preparedness and Resilience (RSI) system to consolidate historical information of elevation-capacity surveys. However, the multiple surveying technologies, protocols, and computational analysis methods used over the service life of reservoirs can impact the quality of reservoir survey data in the RSI system. The objective of this study was to develop a methodology to detect anomalous records and identify multivariate relationships between historical sedimentation data for 184 US reservoirs and associated watershed variables. For this purpose, unsupervised machine learning techniques including principal component analysis (PCA), autonomous anomaly detection, and Kolmogorov–Smirnov and Efron anomaly detection were assembled in an anomaly detection protocol that led to the detection of 20 reservoirs with anomalous records. The variables contributing most to anomaly detection were related to elevation characteristics (watershed and channel slopes, and minimum elevation), precipitation (maximum and cumulative monthly precipitation), dam properties (time since dam completion and initial trap efficiency), and curve number (CN). PCA results indicated that reservoirs in the Mediterranean California ecoregion, although experiencing substantial extreme precipitation events, had small basin areas and CN values that reflected in small capacity losses, contrasting with larger capacity losses found at reservoirs in the Great Plains and Eastern Temperate Forests ecoregions. The developed anomaly detection protocol represents a powerful tool for the analysis and monitoring of this large and heterogeneous data set with the potential of providing reliable information on the impacts of historical climate and watershed properties on erosion and sedimentation processes in US reservoirs. The US Army Corps of Engineers created the Reservoir Sedimentation Information (RSI) system to compile historical reservoir elevation-capacity data collected using various measurement protocols, instruments, and analysis methods. These differences in data collection and analysis methods, in addition to any human error, can result in anomalies that require detection and correction before the dissemination of the data set for further usage. Data anomalies are values that deviate from normal or expected patterns. Apparent erroneous data, related to duplicate records or increases in reservoir capacities, can be flagged through a preliminary analysis. However, the detection of anomalies in an automated and fully data-driven way represents a powerful tool for the maintenance and monitoring of this large and heterogeneous data set. A depurated RSI data set is a potential major data source for large-scale and long-term studies related to sedimentation rates and suspended solid loads in freshwater systems due to the spatial and temporal scale of its records. This kind of data set will allow the development of effective management plans for reservoir operation, maintenance, and upstream erosion control as well as enabling the indirect monitoring of suspended sediment loads in freshwater systems at a nationwide scale.

Download: (3.105Mb)
Show Full MetaData Hide Full MetaData
Get RIS
Item Order
Go To Publisher
Price: 5000 Rial
Statistics

Multivariate Analysis and Anomaly Detection of a US Reservoir Sedimentation Data Set

URI

http://yetl.yabesh.ir/yetl1/handle/yetl/4299057

Collections

Journal of Hydrologic Engineering

Show full item record

contributor author	Alejandra Botero-Acosta
contributor author	Amanda L. Cox
contributor author	Vasit Sagan
contributor author	Ibrahim Demir
contributor author	Marian Muste
contributor author	Paul Boyd
contributor author	Chandra Pathak
date accessioned	2024-12-24T10:30:43Z
date available	2024-12-24T10:30:43Z
date copyright	10/1/2024 12:00:00 AM
date issued	2024
identifier other	JHYEFF.HEENG-6206.pdf
identifier uri	http://yetl.yabesh.ir/yetl1/handle/yetl/4299057
description abstract	Sedimentation processes in reservoirs can jeopardize their functionality and compromise dam safety. Climate change and associated hydrologic uncertainty are introducing additional stressors to US reservoirs, and data-driven indicators of climate impacts on upstream soil erosion and reservoir sedimentation processes are crucial to evaluate their aggradation and life expectancy. The US Army Corps of Engineers developed the Enhancing Reservoir Sedimentation Information for Climate Preparedness and Resilience (RSI) system to consolidate historical information of elevation-capacity surveys. However, the multiple surveying technologies, protocols, and computational analysis methods used over the service life of reservoirs can impact the quality of reservoir survey data in the RSI system. The objective of this study was to develop a methodology to detect anomalous records and identify multivariate relationships between historical sedimentation data for 184 US reservoirs and associated watershed variables. For this purpose, unsupervised machine learning techniques including principal component analysis (PCA), autonomous anomaly detection, and Kolmogorov–Smirnov and Efron anomaly detection were assembled in an anomaly detection protocol that led to the detection of 20 reservoirs with anomalous records. The variables contributing most to anomaly detection were related to elevation characteristics (watershed and channel slopes, and minimum elevation), precipitation (maximum and cumulative monthly precipitation), dam properties (time since dam completion and initial trap efficiency), and curve number (CN). PCA results indicated that reservoirs in the Mediterranean California ecoregion, although experiencing substantial extreme precipitation events, had small basin areas and CN values that reflected in small capacity losses, contrasting with larger capacity losses found at reservoirs in the Great Plains and Eastern Temperate Forests ecoregions. The developed anomaly detection protocol represents a powerful tool for the analysis and monitoring of this large and heterogeneous data set with the potential of providing reliable information on the impacts of historical climate and watershed properties on erosion and sedimentation processes in US reservoirs. The US Army Corps of Engineers created the Reservoir Sedimentation Information (RSI) system to compile historical reservoir elevation-capacity data collected using various measurement protocols, instruments, and analysis methods. These differences in data collection and analysis methods, in addition to any human error, can result in anomalies that require detection and correction before the dissemination of the data set for further usage. Data anomalies are values that deviate from normal or expected patterns. Apparent erroneous data, related to duplicate records or increases in reservoir capacities, can be flagged through a preliminary analysis. However, the detection of anomalies in an automated and fully data-driven way represents a powerful tool for the maintenance and monitoring of this large and heterogeneous data set. A depurated RSI data set is a potential major data source for large-scale and long-term studies related to sedimentation rates and suspended solid loads in freshwater systems due to the spatial and temporal scale of its records. This kind of data set will allow the development of effective management plans for reservoir operation, maintenance, and upstream erosion control as well as enabling the indirect monitoring of suspended sediment loads in freshwater systems at a nationwide scale.
publisher	American Society of Civil Engineers
title	Multivariate Analysis and Anomaly Detection of a US Reservoir Sedimentation Data Set
type	Journal Article
journal volume	29
journal issue	5
journal title	Journal of Hydrologic Engineering
identifier doi	10.1061/JHYEFF.HEENG-6206
journal fristpage	04024031-1
journal lastpage	04024031-14
page	14
tree	Journal of Hydrologic Engineering:;2024:;Volume ( 029 ):;issue: 005
contenttype	Fulltext

YaBeSH Engineering and Technology Library

Archive