Evaluating different strategies for machine learning training applied to flow forecasting based on clustering of flood events

ABSTRACT This paper presents a new hydrological modeling approach for discharge prediction based on flood clustering. Combined with Machine Learning techniques, river flow simulation is optimized through increased data similarity within clusters. Using daily mean discharge from 1964 to 2015 in União...

Full description

Saved in:
Bibliographic Details
Main Authors: Patrícia Cristina Steffen, Júlio Gomes, Eloy Kaviski, Daniel Henrique Marco Detzel
Format: Article
Language:English
Published: Associação Brasileira de Recursos Hídricos 2025-05-01
Series:Revista Brasileira de Recursos Hídricos
Subjects:
Online Access:http://www.scielo.br/scielo.php?script=sci_arttext&pid=S2318-03312025000100218&lng=en&tlng=en
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:ABSTRACT This paper presents a new hydrological modeling approach for discharge prediction based on flood clustering. Combined with Machine Learning techniques, river flow simulation is optimized through increased data similarity within clusters. Using daily mean discharge from 1964 to 2015 in União da Vitória (Iguaçu River basin, Paraná State, Brazil), the Fuzzy C-Means algorithm clustered flood events into three groups. So, five models were trained: one for the complete series, one for all flood events, and one for each cluster. The Support Vector Regression algorithm was used to develop Artificial Intelligence (AI) models, that had better performance in predicting discharge for each group they were trained and showed similar efficiency to the model trained for the entire series for a 1-day forecast time. The present paper discusses only the results from the training and testing phases. A future paper (in elaboration) will present the development and evaluation of the flow forecast models based on the proposed methodology.
ISSN:2318-0331