Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration

Changes in pH during CO2 injection into oceans can lead to significant negative environmental impacts, making it particularly important to track these changes. However, previous studies have not comprehensively investigated the development of machine learning models to estimate this parameter. To fi...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohammad Rasool Dehghani, Moein Kafi, Hamed Nikravesh, Maryam Aghel, Erfan Mohammadian, Yousef Kazemzadeh, Reza Azin
Format: Article
Language:English
Published: Elsevier 2024-12-01
Series:Results in Engineering
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2590123024011447
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846115896328716288
author Mohammad Rasool Dehghani
Moein Kafi
Hamed Nikravesh
Maryam Aghel
Erfan Mohammadian
Yousef Kazemzadeh
Reza Azin
author_facet Mohammad Rasool Dehghani
Moein Kafi
Hamed Nikravesh
Maryam Aghel
Erfan Mohammadian
Yousef Kazemzadeh
Reza Azin
author_sort Mohammad Rasool Dehghani
collection DOAJ
description Changes in pH during CO2 injection into oceans can lead to significant negative environmental impacts, making it particularly important to track these changes. However, previous studies have not comprehensively investigated the development of machine learning models to estimate this parameter. To fill this research gap, this study developed 15 models comprising five machine learning methods: regression trees, support vector regression, Gaussian process regression, bagged trees, and boosted trees, and three optimization algorithms: random search, grid search, and Bayesian optimization. A total of 170 data points were used to develop these models. After data preprocessing and model development, it was determined that the boosted trees model optimized with grid search, with an R2 = 0.9964 and RMSE = 0.0156, performed the best, while the support vector regression model optimized with random search, with an R2 = 0.8426 and RMSE = 0.1030, had the lowest accuracy. The boosted trees model optimized with grid search was found to estimate all data points with a residual error of less than 0.15 and an absolute relative error of 4.62 %. The 95 % confidence interval for the RMSE was calculated based on the best model, showing that the error lies between 0.009060 and 0.023256 with 95 % confidence. Pearson correlation analysis was used for sensitivity analysis. The results showed that in all models, temperature, solubility, and pressure have a negative correlation with pH, while salinity has a positive correlation. Additionally, solubility and salinity exhibited the highest and lowest correlations, with average values of −0.9225 and 0.0594, respectively. Due to the accuracy of the developed models, these models can help to optimize the operation of injecting CO2 into oceans to reduce harmful environmental effects.
format Article
id doaj-art-9a290c67c5904a2b8f580639051a56ee
institution Kabale University
issn 2590-1230
language English
publishDate 2024-12-01
publisher Elsevier
record_format Article
series Results in Engineering
spelling doaj-art-9a290c67c5904a2b8f580639051a56ee2024-12-19T10:57:16ZengElsevierResults in Engineering2590-12302024-12-0124102889Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestrationMohammad Rasool Dehghani0Moein Kafi1Hamed Nikravesh2Maryam Aghel3Erfan Mohammadian4Yousef Kazemzadeh5Reza Azin6Department of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, IranDepartment of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, IranDepartment of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, IranDepartment of Environmental Health Engineering, Faculty of Health and Nutrition, Bushehr University of Medical Sciences, Bushehr, IranDepartment of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, IranDepartment of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, Iran; Corresponding author.Department of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, IranChanges in pH during CO2 injection into oceans can lead to significant negative environmental impacts, making it particularly important to track these changes. However, previous studies have not comprehensively investigated the development of machine learning models to estimate this parameter. To fill this research gap, this study developed 15 models comprising five machine learning methods: regression trees, support vector regression, Gaussian process regression, bagged trees, and boosted trees, and three optimization algorithms: random search, grid search, and Bayesian optimization. A total of 170 data points were used to develop these models. After data preprocessing and model development, it was determined that the boosted trees model optimized with grid search, with an R2 = 0.9964 and RMSE = 0.0156, performed the best, while the support vector regression model optimized with random search, with an R2 = 0.8426 and RMSE = 0.1030, had the lowest accuracy. The boosted trees model optimized with grid search was found to estimate all data points with a residual error of less than 0.15 and an absolute relative error of 4.62 %. The 95 % confidence interval for the RMSE was calculated based on the best model, showing that the error lies between 0.009060 and 0.023256 with 95 % confidence. Pearson correlation analysis was used for sensitivity analysis. The results showed that in all models, temperature, solubility, and pressure have a negative correlation with pH, while salinity has a positive correlation. Additionally, solubility and salinity exhibited the highest and lowest correlations, with average values of −0.9225 and 0.0594, respectively. Due to the accuracy of the developed models, these models can help to optimize the operation of injecting CO2 into oceans to reduce harmful environmental effects.http://www.sciencedirect.com/science/article/pii/S2590123024011447CO2 sequestrationpH predictionOceanMachine learningOptimization
spellingShingle Mohammad Rasool Dehghani
Moein Kafi
Hamed Nikravesh
Maryam Aghel
Erfan Mohammadian
Yousef Kazemzadeh
Reza Azin
Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration
Results in Engineering
CO2 sequestration
pH prediction
Ocean
Machine learning
Optimization
title Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration
title_full Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration
title_fullStr Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration
title_full_unstemmed Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration
title_short Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration
title_sort data driven models for predicting ph of co2 in aqueous solutions implications for co2 sequestration
topic CO2 sequestration
pH prediction
Ocean
Machine learning
Optimization
url http://www.sciencedirect.com/science/article/pii/S2590123024011447
work_keys_str_mv AT mohammadrasooldehghani datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration
AT moeinkafi datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration
AT hamednikravesh datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration
AT maryamaghel datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration
AT erfanmohammadian datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration
AT yousefkazemzadeh datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration
AT rezaazin datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration