Multivariate Analysis and Anomaly Detection of a US Reservoir Sedimentation Data SetSource: Journal of Hydrologic Engineering:;2024:;Volume ( 029 ):;issue: 005::page 04024031-1Author:Alejandra Botero-Acosta
,
Amanda L. Cox
,
Vasit Sagan
,
Ibrahim Demir
,
Marian Muste
,
Paul Boyd
,
Chandra Pathak
DOI: 10.1061/JHYEFF.HEENG-6206Publisher: American Society of Civil Engineers
Abstract: Sedimentation processes in reservoirs can jeopardize their functionality and compromise dam safety. Climate change and associated hydrologic uncertainty are introducing additional stressors to US reservoirs, and data-driven indicators of climate impacts on upstream soil erosion and reservoir sedimentation processes are crucial to evaluate their aggradation and life expectancy. The US Army Corps of Engineers developed the Enhancing Reservoir Sedimentation Information for Climate Preparedness and Resilience (RSI) system to consolidate historical information of elevation-capacity surveys. However, the multiple surveying technologies, protocols, and computational analysis methods used over the service life of reservoirs can impact the quality of reservoir survey data in the RSI system. The objective of this study was to develop a methodology to detect anomalous records and identify multivariate relationships between historical sedimentation data for 184 US reservoirs and associated watershed variables. For this purpose, unsupervised machine learning techniques including principal component analysis (PCA), autonomous anomaly detection, and Kolmogorov–Smirnov and Efron anomaly detection were assembled in an anomaly detection protocol that led to the detection of 20 reservoirs with anomalous records. The variables contributing most to anomaly detection were related to elevation characteristics (watershed and channel slopes, and minimum elevation), precipitation (maximum and cumulative monthly precipitation), dam properties (time since dam completion and initial trap efficiency), and curve number (CN). PCA results indicated that reservoirs in the Mediterranean California ecoregion, although experiencing substantial extreme precipitation events, had small basin areas and CN values that reflected in small capacity losses, contrasting with larger capacity losses found at reservoirs in the Great Plains and Eastern Temperate Forests ecoregions. The developed anomaly detection protocol represents a powerful tool for the analysis and monitoring of this large and heterogeneous data set with the potential of providing reliable information on the impacts of historical climate and watershed properties on erosion and sedimentation processes in US reservoirs. The US Army Corps of Engineers created the Reservoir Sedimentation Information (RSI) system to compile historical reservoir elevation-capacity data collected using various measurement protocols, instruments, and analysis methods. These differences in data collection and analysis methods, in addition to any human error, can result in anomalies that require detection and correction before the dissemination of the data set for further usage. Data anomalies are values that deviate from normal or expected patterns. Apparent erroneous data, related to duplicate records or increases in reservoir capacities, can be flagged through a preliminary analysis. However, the detection of anomalies in an automated and fully data-driven way represents a powerful tool for the maintenance and monitoring of this large and heterogeneous data set. A depurated RSI data set is a potential major data source for large-scale and long-term studies related to sedimentation rates and suspended solid loads in freshwater systems due to the spatial and temporal scale of its records. This kind of data set will allow the development of effective management plans for reservoir operation, maintenance, and upstream erosion control as well as enabling the indirect monitoring of suspended sediment loads in freshwater systems at a nationwide scale.
|
Collections
Show full item record
contributor author | Alejandra Botero-Acosta | |
contributor author | Amanda L. Cox | |
contributor author | Vasit Sagan | |
contributor author | Ibrahim Demir | |
contributor author | Marian Muste | |
contributor author | Paul Boyd | |
contributor author | Chandra Pathak | |
date accessioned | 2024-12-24T10:30:43Z | |
date available | 2024-12-24T10:30:43Z | |
date copyright | 10/1/2024 12:00:00 AM | |
date issued | 2024 | |
identifier other | JHYEFF.HEENG-6206.pdf | |
identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4299057 | |
description abstract | Sedimentation processes in reservoirs can jeopardize their functionality and compromise dam safety. Climate change and associated hydrologic uncertainty are introducing additional stressors to US reservoirs, and data-driven indicators of climate impacts on upstream soil erosion and reservoir sedimentation processes are crucial to evaluate their aggradation and life expectancy. The US Army Corps of Engineers developed the Enhancing Reservoir Sedimentation Information for Climate Preparedness and Resilience (RSI) system to consolidate historical information of elevation-capacity surveys. However, the multiple surveying technologies, protocols, and computational analysis methods used over the service life of reservoirs can impact the quality of reservoir survey data in the RSI system. The objective of this study was to develop a methodology to detect anomalous records and identify multivariate relationships between historical sedimentation data for 184 US reservoirs and associated watershed variables. For this purpose, unsupervised machine learning techniques including principal component analysis (PCA), autonomous anomaly detection, and Kolmogorov–Smirnov and Efron anomaly detection were assembled in an anomaly detection protocol that led to the detection of 20 reservoirs with anomalous records. The variables contributing most to anomaly detection were related to elevation characteristics (watershed and channel slopes, and minimum elevation), precipitation (maximum and cumulative monthly precipitation), dam properties (time since dam completion and initial trap efficiency), and curve number (CN). PCA results indicated that reservoirs in the Mediterranean California ecoregion, although experiencing substantial extreme precipitation events, had small basin areas and CN values that reflected in small capacity losses, contrasting with larger capacity losses found at reservoirs in the Great Plains and Eastern Temperate Forests ecoregions. The developed anomaly detection protocol represents a powerful tool for the analysis and monitoring of this large and heterogeneous data set with the potential of providing reliable information on the impacts of historical climate and watershed properties on erosion and sedimentation processes in US reservoirs. The US Army Corps of Engineers created the Reservoir Sedimentation Information (RSI) system to compile historical reservoir elevation-capacity data collected using various measurement protocols, instruments, and analysis methods. These differences in data collection and analysis methods, in addition to any human error, can result in anomalies that require detection and correction before the dissemination of the data set for further usage. Data anomalies are values that deviate from normal or expected patterns. Apparent erroneous data, related to duplicate records or increases in reservoir capacities, can be flagged through a preliminary analysis. However, the detection of anomalies in an automated and fully data-driven way represents a powerful tool for the maintenance and monitoring of this large and heterogeneous data set. A depurated RSI data set is a potential major data source for large-scale and long-term studies related to sedimentation rates and suspended solid loads in freshwater systems due to the spatial and temporal scale of its records. This kind of data set will allow the development of effective management plans for reservoir operation, maintenance, and upstream erosion control as well as enabling the indirect monitoring of suspended sediment loads in freshwater systems at a nationwide scale. | |
publisher | American Society of Civil Engineers | |
title | Multivariate Analysis and Anomaly Detection of a US Reservoir Sedimentation Data Set | |
type | Journal Article | |
journal volume | 29 | |
journal issue | 5 | |
journal title | Journal of Hydrologic Engineering | |
identifier doi | 10.1061/JHYEFF.HEENG-6206 | |
journal fristpage | 04024031-1 | |
journal lastpage | 04024031-14 | |
page | 14 | |
tree | Journal of Hydrologic Engineering:;2024:;Volume ( 029 ):;issue: 005 | |
contenttype | Fulltext |