A Data-Driven Approach to Improve Cocoa Crop Establishment in Colombia: Insights and Agricultural Practice Recommendations from an Ensemble Machine Learning Model

This study addresses the critical challenge of the limited understanding of environmental factors influencing cocoa cultivation in Colombia, a region with significant production potential but diverse agroecological conditions. The fragmented nature of the existing agricultural data and the lack of t...

Full description

Saved in:
Bibliographic Details
Main Authors: Leonardo Talero-Sarmiento, Sebastian Roa-Prada, Luz Caicedo-Chacon, Oscar Gavanzo-Cardenas
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:AgriEngineering
Subjects:
Online Access:https://www.mdpi.com/2624-7402/7/1/6
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589489149575168
author Leonardo Talero-Sarmiento
Sebastian Roa-Prada
Luz Caicedo-Chacon
Oscar Gavanzo-Cardenas
author_facet Leonardo Talero-Sarmiento
Sebastian Roa-Prada
Luz Caicedo-Chacon
Oscar Gavanzo-Cardenas
author_sort Leonardo Talero-Sarmiento
collection DOAJ
description This study addresses the critical challenge of the limited understanding of environmental factors influencing cocoa cultivation in Colombia, a region with significant production potential but diverse agroecological conditions. The fragmented nature of the existing agricultural data and the lack of targeted research hinder efforts to optimize productivity and sustainability. To bridge this gap, this research employs a data-driven approach, using advanced machine learning techniques such as supervised, unsupervised, and ensemble models, to analyze environmental datasets and provide actionable recommendations. By integrating data from official Colombian sources, as well as the NASA POWER database, and geographical APIs, the present study proposes a methodology to systematically assess environmental conditions and classify regions for optimal cocoa cultivation. The use of an assembled model, combining clustering with targeted machine learning for each cluster, offers a more precise and scalable understanding of cocoa establishment under diverse conditions. Despite challenges such as limited dataset resolution and localized climate variability, this research provides valuable insights for a more comprehensive understanding of the environmental conditions impacting cocoa plantation establishment in a given location. The key findings reveal that temperature, humidity, and wind speed are crucial determinants of cocoa growth, with complex interactions affecting regional suitability. The results offer valuable guidance for the implementation of adaptive agricultural practices and resilience strategies, enabling sustainable cocoa production systems. By implementing better practices, countries such as Colombia can achieve higher market shares under growing global cocoa demand conditions.
format Article
id doaj-art-a06d1f24cf4a425286f0056fd9e4e990
institution Kabale University
issn 2624-7402
language English
publishDate 2024-12-01
publisher MDPI AG
record_format Article
series AgriEngineering
spelling doaj-art-a06d1f24cf4a425286f0056fd9e4e9902025-01-24T13:16:12ZengMDPI AGAgriEngineering2624-74022024-12-0171610.3390/agriengineering7010006A Data-Driven Approach to Improve Cocoa Crop Establishment in Colombia: Insights and Agricultural Practice Recommendations from an Ensemble Machine Learning ModelLeonardo Talero-Sarmiento0Sebastian Roa-Prada1Luz Caicedo-Chacon2Oscar Gavanzo-Cardenas3Facultad de Ingenieria, Ingenieria Industrial, Universidad Autonoma de Bucaramanga (UNAB), Bucaramanga 680003, ColombiaFacultad de Ingenieria, Ingenieria Mecatrónica, Universidad Autonoma de Bucaramanga (UNAB), Bucaramanga 680003, ColombiaIngenieria de Sistemas, Fundacion Universitaria de San Gil (Unisangil), San Gil 684031, ColombiaFEDECACAO, Bucaramanga 680006, ColombiaThis study addresses the critical challenge of the limited understanding of environmental factors influencing cocoa cultivation in Colombia, a region with significant production potential but diverse agroecological conditions. The fragmented nature of the existing agricultural data and the lack of targeted research hinder efforts to optimize productivity and sustainability. To bridge this gap, this research employs a data-driven approach, using advanced machine learning techniques such as supervised, unsupervised, and ensemble models, to analyze environmental datasets and provide actionable recommendations. By integrating data from official Colombian sources, as well as the NASA POWER database, and geographical APIs, the present study proposes a methodology to systematically assess environmental conditions and classify regions for optimal cocoa cultivation. The use of an assembled model, combining clustering with targeted machine learning for each cluster, offers a more precise and scalable understanding of cocoa establishment under diverse conditions. Despite challenges such as limited dataset resolution and localized climate variability, this research provides valuable insights for a more comprehensive understanding of the environmental conditions impacting cocoa plantation establishment in a given location. The key findings reveal that temperature, humidity, and wind speed are crucial determinants of cocoa growth, with complex interactions affecting regional suitability. The results offer valuable guidance for the implementation of adaptive agricultural practices and resilience strategies, enabling sustainable cocoa production systems. By implementing better practices, countries such as Colombia can achieve higher market shares under growing global cocoa demand conditions.https://www.mdpi.com/2624-7402/7/1/6cocoa productionColombiaclimate changedata-driven agricultureensemble modelmachine learning
spellingShingle Leonardo Talero-Sarmiento
Sebastian Roa-Prada
Luz Caicedo-Chacon
Oscar Gavanzo-Cardenas
A Data-Driven Approach to Improve Cocoa Crop Establishment in Colombia: Insights and Agricultural Practice Recommendations from an Ensemble Machine Learning Model
AgriEngineering
cocoa production
Colombia
climate change
data-driven agriculture
ensemble model
machine learning
title A Data-Driven Approach to Improve Cocoa Crop Establishment in Colombia: Insights and Agricultural Practice Recommendations from an Ensemble Machine Learning Model
title_full A Data-Driven Approach to Improve Cocoa Crop Establishment in Colombia: Insights and Agricultural Practice Recommendations from an Ensemble Machine Learning Model
title_fullStr A Data-Driven Approach to Improve Cocoa Crop Establishment in Colombia: Insights and Agricultural Practice Recommendations from an Ensemble Machine Learning Model
title_full_unstemmed A Data-Driven Approach to Improve Cocoa Crop Establishment in Colombia: Insights and Agricultural Practice Recommendations from an Ensemble Machine Learning Model
title_short A Data-Driven Approach to Improve Cocoa Crop Establishment in Colombia: Insights and Agricultural Practice Recommendations from an Ensemble Machine Learning Model
title_sort data driven approach to improve cocoa crop establishment in colombia insights and agricultural practice recommendations from an ensemble machine learning model
topic cocoa production
Colombia
climate change
data-driven agriculture
ensemble model
machine learning
url https://www.mdpi.com/2624-7402/7/1/6
work_keys_str_mv AT leonardotalerosarmiento adatadrivenapproachtoimprovecocoacropestablishmentincolombiainsightsandagriculturalpracticerecommendationsfromanensemblemachinelearningmodel
AT sebastianroaprada adatadrivenapproachtoimprovecocoacropestablishmentincolombiainsightsandagriculturalpracticerecommendationsfromanensemblemachinelearningmodel
AT luzcaicedochacon adatadrivenapproachtoimprovecocoacropestablishmentincolombiainsightsandagriculturalpracticerecommendationsfromanensemblemachinelearningmodel
AT oscargavanzocardenas adatadrivenapproachtoimprovecocoacropestablishmentincolombiainsightsandagriculturalpracticerecommendationsfromanensemblemachinelearningmodel
AT leonardotalerosarmiento datadrivenapproachtoimprovecocoacropestablishmentincolombiainsightsandagriculturalpracticerecommendationsfromanensemblemachinelearningmodel
AT sebastianroaprada datadrivenapproachtoimprovecocoacropestablishmentincolombiainsightsandagriculturalpracticerecommendationsfromanensemblemachinelearningmodel
AT luzcaicedochacon datadrivenapproachtoimprovecocoacropestablishmentincolombiainsightsandagriculturalpracticerecommendationsfromanensemblemachinelearningmodel
AT oscargavanzocardenas datadrivenapproachtoimprovecocoacropestablishmentincolombiainsightsandagriculturalpracticerecommendationsfromanensemblemachinelearningmodel