Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting

The hydrological process has a dynamic nature characterised by randomness and complex phenomena. The application of machine learning (ML) models in forecasting river flow has grown rapidly. This is owing to their capacity to simulate the complex phenomena associated with hydrological and environment...

Full description

Saved in:
Bibliographic Details
Main Authors: Hai Tao, Ali Omran Al-Sulttani, Ameen Mohammed Salih Ameen, Zainab Hasan Ali, Nadhir Al-Ansari, Sinan Q. Salih, Reham R. Mostafa
Format: Article
Language:English
Published: Wiley 2020-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2020/8844367
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832550884751441920
author Hai Tao
Ali Omran Al-Sulttani
Ameen Mohammed Salih Ameen
Zainab Hasan Ali
Nadhir Al-Ansari
Sinan Q. Salih
Reham R. Mostafa
author_facet Hai Tao
Ali Omran Al-Sulttani
Ameen Mohammed Salih Ameen
Zainab Hasan Ali
Nadhir Al-Ansari
Sinan Q. Salih
Reham R. Mostafa
author_sort Hai Tao
collection DOAJ
description The hydrological process has a dynamic nature characterised by randomness and complex phenomena. The application of machine learning (ML) models in forecasting river flow has grown rapidly. This is owing to their capacity to simulate the complex phenomena associated with hydrological and environmental processes. Four different ML models were developed for river flow forecasting located in semiarid region, Iraq. The effectiveness of data division influence on the ML models process was investigated. Three data division modeling scenarios were inspected including 70%–30%, 80%–20, and 90%–10%. Several statistical indicators are computed to verify the performance of the models. The results revealed the potential of the hybridized support vector regression model with a genetic algorithm (SVR-GA) over the other ML forecasting models for monthly river flow forecasting using 90%–10% data division. In addition, it was found to improve the accuracy in forecasting high flow events. The unique architecture of developed SVR-GA due to the ability of the GA optimizer to tune the internal parameters of the SVR model provides a robust learning process. This has made it more efficient in forecasting stochastic river flow behaviour compared to the other developed hybrid models.
format Article
id doaj-art-18274ab1aa554f9e9c66b9647cc10b03
institution Kabale University
issn 1076-2787
1099-0526
language English
publishDate 2020-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-18274ab1aa554f9e9c66b9647cc10b032025-02-03T06:05:28ZengWileyComplexity1076-27871099-05262020-01-01202010.1155/2020/88443678844367Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow ForecastingHai Tao0Ali Omran Al-Sulttani1Ameen Mohammed Salih Ameen2Zainab Hasan Ali3Nadhir Al-Ansari4Sinan Q. Salih5Reham R. Mostafa6Computer Science Department, Baoji University of Arts and Sciences, Baoji, Shaanxi, ChinaDepartment of Water Resources Engineering, College of Engineering, University of Baghdad, Baghdad, IraqDepartment of Water Resources Engineering, College of Engineering, University of Baghdad, Baghdad, IraqCivil Engineering Department, College of Engineering, University of Diyala, Baquba, IraqCivil, Environmental and Natural Resources Engineering, Lulea University of Technology, 97187 Lulea, SwedenInstitute of Research and Development, Duy Tan University, Da Nang 550000, VietnamInformation Systems Department, Faculty of Computers and Information Sciences, Mansoura University, Mansoura 35516, EgyptThe hydrological process has a dynamic nature characterised by randomness and complex phenomena. The application of machine learning (ML) models in forecasting river flow has grown rapidly. This is owing to their capacity to simulate the complex phenomena associated with hydrological and environmental processes. Four different ML models were developed for river flow forecasting located in semiarid region, Iraq. The effectiveness of data division influence on the ML models process was investigated. Three data division modeling scenarios were inspected including 70%–30%, 80%–20, and 90%–10%. Several statistical indicators are computed to verify the performance of the models. The results revealed the potential of the hybridized support vector regression model with a genetic algorithm (SVR-GA) over the other ML forecasting models for monthly river flow forecasting using 90%–10% data division. In addition, it was found to improve the accuracy in forecasting high flow events. The unique architecture of developed SVR-GA due to the ability of the GA optimizer to tune the internal parameters of the SVR model provides a robust learning process. This has made it more efficient in forecasting stochastic river flow behaviour compared to the other developed hybrid models.http://dx.doi.org/10.1155/2020/8844367
spellingShingle Hai Tao
Ali Omran Al-Sulttani
Ameen Mohammed Salih Ameen
Zainab Hasan Ali
Nadhir Al-Ansari
Sinan Q. Salih
Reham R. Mostafa
Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting
Complexity
title Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting
title_full Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting
title_fullStr Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting
title_full_unstemmed Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting
title_short Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting
title_sort training and testing data division influence on hybrid machine learning model process application of river flow forecasting
url http://dx.doi.org/10.1155/2020/8844367
work_keys_str_mv AT haitao trainingandtestingdatadivisioninfluenceonhybridmachinelearningmodelprocessapplicationofriverflowforecasting
AT aliomranalsulttani trainingandtestingdatadivisioninfluenceonhybridmachinelearningmodelprocessapplicationofriverflowforecasting
AT ameenmohammedsalihameen trainingandtestingdatadivisioninfluenceonhybridmachinelearningmodelprocessapplicationofriverflowforecasting
AT zainabhasanali trainingandtestingdatadivisioninfluenceonhybridmachinelearningmodelprocessapplicationofriverflowforecasting
AT nadhiralansari trainingandtestingdatadivisioninfluenceonhybridmachinelearningmodelprocessapplicationofriverflowforecasting
AT sinanqsalih trainingandtestingdatadivisioninfluenceonhybridmachinelearningmodelprocessapplicationofriverflowforecasting
AT rehamrmostafa trainingandtestingdatadivisioninfluenceonhybridmachinelearningmodelprocessapplicationofriverflowforecasting