Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration
Changes in pH during CO2 injection into oceans can lead to significant negative environmental impacts, making it particularly important to track these changes. However, previous studies have not comprehensively investigated the development of machine learning models to estimate this parameter. To fi...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2024-12-01
|
Series: | Results in Engineering |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2590123024011447 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1846115896328716288 |
---|---|
author | Mohammad Rasool Dehghani Moein Kafi Hamed Nikravesh Maryam Aghel Erfan Mohammadian Yousef Kazemzadeh Reza Azin |
author_facet | Mohammad Rasool Dehghani Moein Kafi Hamed Nikravesh Maryam Aghel Erfan Mohammadian Yousef Kazemzadeh Reza Azin |
author_sort | Mohammad Rasool Dehghani |
collection | DOAJ |
description | Changes in pH during CO2 injection into oceans can lead to significant negative environmental impacts, making it particularly important to track these changes. However, previous studies have not comprehensively investigated the development of machine learning models to estimate this parameter. To fill this research gap, this study developed 15 models comprising five machine learning methods: regression trees, support vector regression, Gaussian process regression, bagged trees, and boosted trees, and three optimization algorithms: random search, grid search, and Bayesian optimization. A total of 170 data points were used to develop these models. After data preprocessing and model development, it was determined that the boosted trees model optimized with grid search, with an R2 = 0.9964 and RMSE = 0.0156, performed the best, while the support vector regression model optimized with random search, with an R2 = 0.8426 and RMSE = 0.1030, had the lowest accuracy. The boosted trees model optimized with grid search was found to estimate all data points with a residual error of less than 0.15 and an absolute relative error of 4.62 %. The 95 % confidence interval for the RMSE was calculated based on the best model, showing that the error lies between 0.009060 and 0.023256 with 95 % confidence. Pearson correlation analysis was used for sensitivity analysis. The results showed that in all models, temperature, solubility, and pressure have a negative correlation with pH, while salinity has a positive correlation. Additionally, solubility and salinity exhibited the highest and lowest correlations, with average values of −0.9225 and 0.0594, respectively. Due to the accuracy of the developed models, these models can help to optimize the operation of injecting CO2 into oceans to reduce harmful environmental effects. |
format | Article |
id | doaj-art-9a290c67c5904a2b8f580639051a56ee |
institution | Kabale University |
issn | 2590-1230 |
language | English |
publishDate | 2024-12-01 |
publisher | Elsevier |
record_format | Article |
series | Results in Engineering |
spelling | doaj-art-9a290c67c5904a2b8f580639051a56ee2024-12-19T10:57:16ZengElsevierResults in Engineering2590-12302024-12-0124102889Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestrationMohammad Rasool Dehghani0Moein Kafi1Hamed Nikravesh2Maryam Aghel3Erfan Mohammadian4Yousef Kazemzadeh5Reza Azin6Department of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, IranDepartment of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, IranDepartment of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, IranDepartment of Environmental Health Engineering, Faculty of Health and Nutrition, Bushehr University of Medical Sciences, Bushehr, IranDepartment of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, IranDepartment of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, Iran; Corresponding author.Department of Petroleum Engineering, Faculty of Petroleum, Gas and Petrochemical Engineering, Persian Gulf University, Bushehr, IranChanges in pH during CO2 injection into oceans can lead to significant negative environmental impacts, making it particularly important to track these changes. However, previous studies have not comprehensively investigated the development of machine learning models to estimate this parameter. To fill this research gap, this study developed 15 models comprising five machine learning methods: regression trees, support vector regression, Gaussian process regression, bagged trees, and boosted trees, and three optimization algorithms: random search, grid search, and Bayesian optimization. A total of 170 data points were used to develop these models. After data preprocessing and model development, it was determined that the boosted trees model optimized with grid search, with an R2 = 0.9964 and RMSE = 0.0156, performed the best, while the support vector regression model optimized with random search, with an R2 = 0.8426 and RMSE = 0.1030, had the lowest accuracy. The boosted trees model optimized with grid search was found to estimate all data points with a residual error of less than 0.15 and an absolute relative error of 4.62 %. The 95 % confidence interval for the RMSE was calculated based on the best model, showing that the error lies between 0.009060 and 0.023256 with 95 % confidence. Pearson correlation analysis was used for sensitivity analysis. The results showed that in all models, temperature, solubility, and pressure have a negative correlation with pH, while salinity has a positive correlation. Additionally, solubility and salinity exhibited the highest and lowest correlations, with average values of −0.9225 and 0.0594, respectively. Due to the accuracy of the developed models, these models can help to optimize the operation of injecting CO2 into oceans to reduce harmful environmental effects.http://www.sciencedirect.com/science/article/pii/S2590123024011447CO2 sequestrationpH predictionOceanMachine learningOptimization |
spellingShingle | Mohammad Rasool Dehghani Moein Kafi Hamed Nikravesh Maryam Aghel Erfan Mohammadian Yousef Kazemzadeh Reza Azin Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration Results in Engineering CO2 sequestration pH prediction Ocean Machine learning Optimization |
title | Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration |
title_full | Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration |
title_fullStr | Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration |
title_full_unstemmed | Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration |
title_short | Data driven models for predicting pH of CO2 in aqueous solutions: Implications for CO2 sequestration |
title_sort | data driven models for predicting ph of co2 in aqueous solutions implications for co2 sequestration |
topic | CO2 sequestration pH prediction Ocean Machine learning Optimization |
url | http://www.sciencedirect.com/science/article/pii/S2590123024011447 |
work_keys_str_mv | AT mohammadrasooldehghani datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration AT moeinkafi datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration AT hamednikravesh datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration AT maryamaghel datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration AT erfanmohammadian datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration AT yousefkazemzadeh datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration AT rezaazin datadrivenmodelsforpredictingphofco2inaqueoussolutionsimplicationsforco2sequestration |