Prediction of river dissolved oxygen (DO) based on multi-source data and various machine learning coupling models.

Too low a concentration of dissolved oxygen (DO) in a river can disrupt the ecological balance, while too high a concentration may lead to eutrophication of the water body and threaten the health of the aquatic environment. Therefore, accurate prediction of DO concentration is crucial for water reso...

Full description

Saved in:
Bibliographic Details
Main Authors: Yubo Zhao, Mo Chen
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0319256
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849313284596957184
author Yubo Zhao
Mo Chen
author_facet Yubo Zhao
Mo Chen
author_sort Yubo Zhao
collection DOAJ
description Too low a concentration of dissolved oxygen (DO) in a river can disrupt the ecological balance, while too high a concentration may lead to eutrophication of the water body and threaten the health of the aquatic environment. Therefore, accurate prediction of DO concentration is crucial for water resource protection. In this study, a hybrid machine learning model for river DO prediction, called DWT-KPCA-GWO-XGBoost, is proposed, which combines the discrete wavelet transform (DWT), kernel principal component analysis (KPCA), gray wolf optimization algorithm (GWO), and extreme gradient boosting (XGBoost). Firstly, DWT-db4 was used to denoise the noisy water quality feature data; secondly, the meteorological data were simplified into four principal components by KPCA; finally, the water quality features and meteorological principal components were inputted into the GWO-optimized XGBoost model as features for training and prediction. The prediction performance of the model was comprehensively assessed by comparison with other machine learning models using MAE, MSE, MAPE, NSE, KGE and WI evaluation metrics. The model was tested at three different locations and the results showed that the model outperformed the other models, performing as follows: 0.5925, 0.6482, 6.3322, 0.8523, 0.8902, 0.9403; 0.4933, 0.4325, 6.2351, 0.8952, 0.7928, 0.8632; 0.2912, 0.2001, 4.0523, 0.7823, 0.8425, 0.8463 and the PICP values exceed 95%. The hybrid model demonstrated significant results in predicting dissolved oxygen concentrations for the next 15 days. Compared with other studies, we innovatively improved the prediction accuracy of the model significantly through noise removal and the introduction of multi-source features.
format Article
id doaj-art-5c5b63f1105f43f79f5b9ecc3f6e2de7
institution Kabale University
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-5c5b63f1105f43f79f5b9ecc3f6e2de72025-08-20T03:52:48ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01203e031925610.1371/journal.pone.0319256Prediction of river dissolved oxygen (DO) based on multi-source data and various machine learning coupling models.Yubo ZhaoMo ChenToo low a concentration of dissolved oxygen (DO) in a river can disrupt the ecological balance, while too high a concentration may lead to eutrophication of the water body and threaten the health of the aquatic environment. Therefore, accurate prediction of DO concentration is crucial for water resource protection. In this study, a hybrid machine learning model for river DO prediction, called DWT-KPCA-GWO-XGBoost, is proposed, which combines the discrete wavelet transform (DWT), kernel principal component analysis (KPCA), gray wolf optimization algorithm (GWO), and extreme gradient boosting (XGBoost). Firstly, DWT-db4 was used to denoise the noisy water quality feature data; secondly, the meteorological data were simplified into four principal components by KPCA; finally, the water quality features and meteorological principal components were inputted into the GWO-optimized XGBoost model as features for training and prediction. The prediction performance of the model was comprehensively assessed by comparison with other machine learning models using MAE, MSE, MAPE, NSE, KGE and WI evaluation metrics. The model was tested at three different locations and the results showed that the model outperformed the other models, performing as follows: 0.5925, 0.6482, 6.3322, 0.8523, 0.8902, 0.9403; 0.4933, 0.4325, 6.2351, 0.8952, 0.7928, 0.8632; 0.2912, 0.2001, 4.0523, 0.7823, 0.8425, 0.8463 and the PICP values exceed 95%. The hybrid model demonstrated significant results in predicting dissolved oxygen concentrations for the next 15 days. Compared with other studies, we innovatively improved the prediction accuracy of the model significantly through noise removal and the introduction of multi-source features.https://doi.org/10.1371/journal.pone.0319256
spellingShingle Yubo Zhao
Mo Chen
Prediction of river dissolved oxygen (DO) based on multi-source data and various machine learning coupling models.
PLoS ONE
title Prediction of river dissolved oxygen (DO) based on multi-source data and various machine learning coupling models.
title_full Prediction of river dissolved oxygen (DO) based on multi-source data and various machine learning coupling models.
title_fullStr Prediction of river dissolved oxygen (DO) based on multi-source data and various machine learning coupling models.
title_full_unstemmed Prediction of river dissolved oxygen (DO) based on multi-source data and various machine learning coupling models.
title_short Prediction of river dissolved oxygen (DO) based on multi-source data and various machine learning coupling models.
title_sort prediction of river dissolved oxygen do based on multi source data and various machine learning coupling models
url https://doi.org/10.1371/journal.pone.0319256
work_keys_str_mv AT yubozhao predictionofriverdissolvedoxygendobasedonmultisourcedataandvariousmachinelearningcouplingmodels
AT mochen predictionofriverdissolvedoxygendobasedonmultisourcedataandvariousmachinelearningcouplingmodels