An improved long-term high-resolution surface pCO2 data product for the Indian Ocean using machine learning

Abstract Accurate estimation of surface ocean pCO2 is crucial for understanding the ocean’s role in the global carbon cycle and its response to climate change. In this study, we employ a machine learning algorithm to correct the deviations in high-resolution (1/12°) model simulations of surface pCO2...

Full description

Saved in:
Bibliographic Details
Main Authors: Prasanna Kanti Ghoshal, A.P. Joshi, Kunal Chakraborty
Format: Article
Language:English
Published: Nature Portfolio 2025-04-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-04914-z
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Accurate estimation of surface ocean pCO2 is crucial for understanding the ocean’s role in the global carbon cycle and its response to climate change. In this study, we employ a machine learning algorithm to correct the deviations in high-resolution (1/12°) model simulations of surface pCO2 from the INCOIS-BIO-ROMS model (pCO2 model) for the period 1980–2019, using available observations (pCO2 obs). We train the XGBoost model to generate spatio-temporal deviations (pCO2 obs − pCO2 model) of pCO2 model. The interannually and climatologically varying deviations are then added back to the original model separately, which results in an improved surface pCO2 data product. A comparison of our surface pCO2 data product with moored observations, gridded SOCAT, CMEMS-LSCE-FFNN, and OceanSODA demonstrates an improvement by approximately 40% ± 3.31% in RMSE. Further analysis reveals that adding climatological deviations to pCO2 model results in greater improvements than adding interannual deviations. This analysis underscores the ability of machine learning algorithms to enhance the accuracy of model-simulated surface pCO2 outputs.
ISSN:2052-4463