Understanding forest insect outbreak dynamics: a comparative analysis of machine learning techniques

Accurate modeling and simulation of forest land cover change resulting from epidemic insect outbreaks play a crucial role in equipping scientists and forest managers with essential insights. These insights enable proactive planning and the formulation of effective strategies to mitigate the impact o...

Full description

Saved in:
Bibliographic Details
Main Authors: Roberto Molowny-Horas, Saeed Harati-Asl, Liliana Perez
Format: Article
Language:English
Published: Taylor & Francis Group 2025-07-01
Series:Geo-spatial Information Science
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/10095020.2025.2529992
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849421995246092288
author Roberto Molowny-Horas
Saeed Harati-Asl
Liliana Perez
author_facet Roberto Molowny-Horas
Saeed Harati-Asl
Liliana Perez
author_sort Roberto Molowny-Horas
collection DOAJ
description Accurate modeling and simulation of forest land cover change resulting from epidemic insect outbreaks play a crucial role in equipping scientists and forest managers with essential insights. These insights enable proactive planning and the formulation of effective strategies to mitigate the impact of such disturbances. By employing advanced modeling techniques, researchers and managers can anticipate the evolving dynamics of forest ecosystems, thereby facilitating timely interventions and sustainable management practices. In this study, we applied sixteen machine-learning models, plus two ensemble averaging procedures, to Mountain Pine Beetle (Dendroctonus ponderosae) infestation data in British Columbia, to calculate projections of insect-induced deforestation. Model drivers included topographic, climatic and adjacency variables. We verified the results of the simulations by randomly splitting datasets between training and test subsets (aka Validation assessment), as well as by comparing future projections with observations (aka Prediction assessment). All calculations were carried out for different mountain pine beetle map sets and time differences, and we employed up to seven performance metrics (six threshold-dependent and one threshold-independent) and four error metrics to assess goodness of prediction. ANCOVA tests were then run on metric results to test differences between Validation and Prediction assessments. In addition, we computed Friedman rankings for all simulation and metrics. Our results showed that validation assessments were, most of the time, significantly more optimistic than prediction assessments. We also noted that different conclusions could be reached for different performance metrics. We conclude that, for prediction purposes, error metrics and components of the confusion table were most helpful in understanding the ability and limitations of Mountain Pine Beetle predictive maps. These results also suggest that, in general, care must be taken in assessing prediction performance of machine-learning models based solely on validation tests.
format Article
id doaj-art-4d843e2eaa4546e19efc09adb006739d
institution Kabale University
issn 1009-5020
1993-5153
language English
publishDate 2025-07-01
publisher Taylor & Francis Group
record_format Article
series Geo-spatial Information Science
spelling doaj-art-4d843e2eaa4546e19efc09adb006739d2025-08-20T03:31:16ZengTaylor & Francis GroupGeo-spatial Information Science1009-50201993-51532025-07-0111810.1080/10095020.2025.2529992Understanding forest insect outbreak dynamics: a comparative analysis of machine learning techniquesRoberto Molowny-Horas0Saeed Harati-Asl1Liliana Perez2The Ecosystem Modelling Facility, CREAF, Cerdanyola del Vallès, SpainRoger Tomlinson laboratory, Department of Geography, McGill University, Montreal, CanadaLaboratoire de Géosimulation Environnementale (LEDGE), Département de Géographie, Université de Montréal, Montréal, CanadaAccurate modeling and simulation of forest land cover change resulting from epidemic insect outbreaks play a crucial role in equipping scientists and forest managers with essential insights. These insights enable proactive planning and the formulation of effective strategies to mitigate the impact of such disturbances. By employing advanced modeling techniques, researchers and managers can anticipate the evolving dynamics of forest ecosystems, thereby facilitating timely interventions and sustainable management practices. In this study, we applied sixteen machine-learning models, plus two ensemble averaging procedures, to Mountain Pine Beetle (Dendroctonus ponderosae) infestation data in British Columbia, to calculate projections of insect-induced deforestation. Model drivers included topographic, climatic and adjacency variables. We verified the results of the simulations by randomly splitting datasets between training and test subsets (aka Validation assessment), as well as by comparing future projections with observations (aka Prediction assessment). All calculations were carried out for different mountain pine beetle map sets and time differences, and we employed up to seven performance metrics (six threshold-dependent and one threshold-independent) and four error metrics to assess goodness of prediction. ANCOVA tests were then run on metric results to test differences between Validation and Prediction assessments. In addition, we computed Friedman rankings for all simulation and metrics. Our results showed that validation assessments were, most of the time, significantly more optimistic than prediction assessments. We also noted that different conclusions could be reached for different performance metrics. We conclude that, for prediction purposes, error metrics and components of the confusion table were most helpful in understanding the ability and limitations of Mountain Pine Beetle predictive maps. These results also suggest that, in general, care must be taken in assessing prediction performance of machine-learning models based solely on validation tests.https://www.tandfonline.com/doi/10.1080/10095020.2025.2529992Land cover changesforest disturbancesmodel calibrationmachine-learningcross-validationmap comparison
spellingShingle Roberto Molowny-Horas
Saeed Harati-Asl
Liliana Perez
Understanding forest insect outbreak dynamics: a comparative analysis of machine learning techniques
Geo-spatial Information Science
Land cover changes
forest disturbances
model calibration
machine-learning
cross-validation
map comparison
title Understanding forest insect outbreak dynamics: a comparative analysis of machine learning techniques
title_full Understanding forest insect outbreak dynamics: a comparative analysis of machine learning techniques
title_fullStr Understanding forest insect outbreak dynamics: a comparative analysis of machine learning techniques
title_full_unstemmed Understanding forest insect outbreak dynamics: a comparative analysis of machine learning techniques
title_short Understanding forest insect outbreak dynamics: a comparative analysis of machine learning techniques
title_sort understanding forest insect outbreak dynamics a comparative analysis of machine learning techniques
topic Land cover changes
forest disturbances
model calibration
machine-learning
cross-validation
map comparison
url https://www.tandfonline.com/doi/10.1080/10095020.2025.2529992
work_keys_str_mv AT robertomolownyhoras understandingforestinsectoutbreakdynamicsacomparativeanalysisofmachinelearningtechniques
AT saeedharatiasl understandingforestinsectoutbreakdynamicsacomparativeanalysisofmachinelearningtechniques
AT lilianaperez understandingforestinsectoutbreakdynamicsacomparativeanalysisofmachinelearningtechniques