Correcting forest aboveground biomass biases by incorporating independent canopy height retrieval with conventional machine learning models using GEDI and ICESat-2 data

Spaceborne LiDAR satellites, including GEDI and ICESat-2, have shown significant potential in estimating aboveground biomass (AGB) using machine learning (ML) methods. In contrast to advances focused on the refinement of ML algorithms, this study aims to enhance AGB estimation accuracy by integratin...

Full description

Saved in:
Bibliographic Details
Main Authors: Biao Zhang, Zhichao Wang, Tiantian Ma, Zhihao Wang, Hao Li, Wenxu Ji, Mingyang He, Ao Jiao, Zhongke Feng
Format: Article
Language:English
Published: Elsevier 2025-05-01
Series:Ecological Informatics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1574954125000548
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832573230446018560
author Biao Zhang
Zhichao Wang
Tiantian Ma
Zhihao Wang
Hao Li
Wenxu Ji
Mingyang He
Ao Jiao
Zhongke Feng
author_facet Biao Zhang
Zhichao Wang
Tiantian Ma
Zhihao Wang
Hao Li
Wenxu Ji
Mingyang He
Ao Jiao
Zhongke Feng
author_sort Biao Zhang
collection DOAJ
description Spaceborne LiDAR satellites, including GEDI and ICESat-2, have shown significant potential in estimating aboveground biomass (AGB) using machine learning (ML) methods. In contrast to advances focused on the refinement of ML algorithms, this study aims to enhance AGB estimation accuracy by integrating an additional Canopy Height (CH) information. To obtain CH data, this study utilized three spaceborne LiDAR datasets: ICESat-2 ATL08, ICESat-2 ATL03/ATL08 fusion data, and GEDI-L2A. Random Forest (RF) and Monte Carlo-based uncertainty analysis were employed to evaluate the most suitable spaceborne LiDAR dataset for CH estimation. The accuracy of CH features in AGB estimation was then compared using both Linear Regression (LR) and RF models. The spectral saturation point was computed using a semi-variance function, and the contribution of CH features to AGB estimates was quantified across different gradients, especially when AGB neared or surpassed the saturation point. The findings demonstrate that the ATL03/08 fusion dataset surpasses the other datasets in terms of CH estimation accuracy and uncertainty, delivering enhanced precision and stability. Incorporating CH features notably improved AGB model performance, as evidenced by R2 increases of 13.89 % and 10.34 % in the LR and RF models, respectively. The correction of AGB estimates across various gradients with CH features demonstrated a nonlinear pattern, initially increasing, then decreasing, and subsequently rebounding. Notable inflection points were identified at 26 Mg/ha and 123 Mg/ha, marking significant transitions in the correction trend. Both positive and negative bias corrections were observed during the correction process, with their proportions varying according to AGB values. When AGB approached or exceeded the spectral saturation point, the ability of CH features to improve positive bias correction was markedly enhanced, resulting in a greater proportion of positively corrected pixels and more significant correction values. The results of this study provide new insights into the role of CH features in AGB estimation, offering important implications for enhancing biomass mapping accuracy in forest ecosystems.
format Article
id doaj-art-932255f5df7d446a8e1cb68ce61faa4f
institution Kabale University
issn 1574-9541
language English
publishDate 2025-05-01
publisher Elsevier
record_format Article
series Ecological Informatics
spelling doaj-art-932255f5df7d446a8e1cb68ce61faa4f2025-02-02T05:26:59ZengElsevierEcological Informatics1574-95412025-05-0186103045Correcting forest aboveground biomass biases by incorporating independent canopy height retrieval with conventional machine learning models using GEDI and ICESat-2 dataBiao Zhang0Zhichao Wang1Tiantian Ma2Zhihao Wang3Hao Li4Wenxu Ji5Mingyang He6Ao Jiao7Zhongke Feng8Precision Forestry Key Laboratory of Beijing, Beijing Forestry University, Beijing 100083, ChinaPrecision Forestry Key Laboratory of Beijing, Beijing Forestry University, Beijing 100083, China; Corresponding authors.Precision Forestry Key Laboratory of Beijing, Beijing Forestry University, Beijing 100083, ChinaDepartment of Land, Air, and Water Resource, University of California at Davis, California 95616, USAPrecision Forestry Key Laboratory of Beijing, Beijing Forestry University, Beijing 100083, ChinaPrecision Forestry Key Laboratory of Beijing, Beijing Forestry University, Beijing 100083, ChinaPrecision Forestry Key Laboratory of Beijing, Beijing Forestry University, Beijing 100083, ChinaPrecision Forestry Key Laboratory of Beijing, Beijing Forestry University, Beijing 100083, ChinaPrecision Forestry Key Laboratory of Beijing, Beijing Forestry University, Beijing 100083, China; Corresponding authors.Spaceborne LiDAR satellites, including GEDI and ICESat-2, have shown significant potential in estimating aboveground biomass (AGB) using machine learning (ML) methods. In contrast to advances focused on the refinement of ML algorithms, this study aims to enhance AGB estimation accuracy by integrating an additional Canopy Height (CH) information. To obtain CH data, this study utilized three spaceborne LiDAR datasets: ICESat-2 ATL08, ICESat-2 ATL03/ATL08 fusion data, and GEDI-L2A. Random Forest (RF) and Monte Carlo-based uncertainty analysis were employed to evaluate the most suitable spaceborne LiDAR dataset for CH estimation. The accuracy of CH features in AGB estimation was then compared using both Linear Regression (LR) and RF models. The spectral saturation point was computed using a semi-variance function, and the contribution of CH features to AGB estimates was quantified across different gradients, especially when AGB neared or surpassed the saturation point. The findings demonstrate that the ATL03/08 fusion dataset surpasses the other datasets in terms of CH estimation accuracy and uncertainty, delivering enhanced precision and stability. Incorporating CH features notably improved AGB model performance, as evidenced by R2 increases of 13.89 % and 10.34 % in the LR and RF models, respectively. The correction of AGB estimates across various gradients with CH features demonstrated a nonlinear pattern, initially increasing, then decreasing, and subsequently rebounding. Notable inflection points were identified at 26 Mg/ha and 123 Mg/ha, marking significant transitions in the correction trend. Both positive and negative bias corrections were observed during the correction process, with their proportions varying according to AGB values. When AGB approached or exceeded the spectral saturation point, the ability of CH features to improve positive bias correction was markedly enhanced, resulting in a greater proportion of positively corrected pixels and more significant correction values. The results of this study provide new insights into the role of CH features in AGB estimation, offering important implications for enhancing biomass mapping accuracy in forest ecosystems.http://www.sciencedirect.com/science/article/pii/S1574954125000548Aboveground biomassCanopy heightBias correctionUncertainty assessmentSpectral saturationLiDAR
spellingShingle Biao Zhang
Zhichao Wang
Tiantian Ma
Zhihao Wang
Hao Li
Wenxu Ji
Mingyang He
Ao Jiao
Zhongke Feng
Correcting forest aboveground biomass biases by incorporating independent canopy height retrieval with conventional machine learning models using GEDI and ICESat-2 data
Ecological Informatics
Aboveground biomass
Canopy height
Bias correction
Uncertainty assessment
Spectral saturation
LiDAR
title Correcting forest aboveground biomass biases by incorporating independent canopy height retrieval with conventional machine learning models using GEDI and ICESat-2 data
title_full Correcting forest aboveground biomass biases by incorporating independent canopy height retrieval with conventional machine learning models using GEDI and ICESat-2 data
title_fullStr Correcting forest aboveground biomass biases by incorporating independent canopy height retrieval with conventional machine learning models using GEDI and ICESat-2 data
title_full_unstemmed Correcting forest aboveground biomass biases by incorporating independent canopy height retrieval with conventional machine learning models using GEDI and ICESat-2 data
title_short Correcting forest aboveground biomass biases by incorporating independent canopy height retrieval with conventional machine learning models using GEDI and ICESat-2 data
title_sort correcting forest aboveground biomass biases by incorporating independent canopy height retrieval with conventional machine learning models using gedi and icesat 2 data
topic Aboveground biomass
Canopy height
Bias correction
Uncertainty assessment
Spectral saturation
LiDAR
url http://www.sciencedirect.com/science/article/pii/S1574954125000548
work_keys_str_mv AT biaozhang correctingforestabovegroundbiomassbiasesbyincorporatingindependentcanopyheightretrievalwithconventionalmachinelearningmodelsusinggediandicesat2data
AT zhichaowang correctingforestabovegroundbiomassbiasesbyincorporatingindependentcanopyheightretrievalwithconventionalmachinelearningmodelsusinggediandicesat2data
AT tiantianma correctingforestabovegroundbiomassbiasesbyincorporatingindependentcanopyheightretrievalwithconventionalmachinelearningmodelsusinggediandicesat2data
AT zhihaowang correctingforestabovegroundbiomassbiasesbyincorporatingindependentcanopyheightretrievalwithconventionalmachinelearningmodelsusinggediandicesat2data
AT haoli correctingforestabovegroundbiomassbiasesbyincorporatingindependentcanopyheightretrievalwithconventionalmachinelearningmodelsusinggediandicesat2data
AT wenxuji correctingforestabovegroundbiomassbiasesbyincorporatingindependentcanopyheightretrievalwithconventionalmachinelearningmodelsusinggediandicesat2data
AT mingyanghe correctingforestabovegroundbiomassbiasesbyincorporatingindependentcanopyheightretrievalwithconventionalmachinelearningmodelsusinggediandicesat2data
AT aojiao correctingforestabovegroundbiomassbiasesbyincorporatingindependentcanopyheightretrievalwithconventionalmachinelearningmodelsusinggediandicesat2data
AT zhongkefeng correctingforestabovegroundbiomassbiasesbyincorporatingindependentcanopyheightretrievalwithconventionalmachinelearningmodelsusinggediandicesat2data