Stacked ensemble model for accurate crop yield prediction using machine learning techniques

Predicting crop yields remains a crucial challenge in agriculture, as these forecasts influence decision-making at global, regional, and individual crop levels. Historically, such predictions have utilized diverse data sources, including agricultural, land, climatic, atmospheric, and other pertinent...

Full description

Saved in:
Bibliographic Details
Main Authors: Ramesh V, Kumaresan P
Format: Article
Language:English
Published: IOP Publishing 2025-01-01
Series:Environmental Research Communications
Subjects:
Online Access:https://doi.org/10.1088/2515-7620/adb9c0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849706030937669632
author Ramesh V
Kumaresan P
author_facet Ramesh V
Kumaresan P
author_sort Ramesh V
collection DOAJ
description Predicting crop yields remains a crucial challenge in agriculture, as these forecasts influence decision-making at global, regional, and individual crop levels. Historically, such predictions have utilized diverse data sources, including agricultural, land, climatic, atmospheric, and other pertinent information. Over the past several years, the application of machine learning techniques has emerged as a valuable analytical approach for estimating agricultural productivity, thereby informing decisions regarding crop selection and management strategies throughout the entire growing cycle. Various kinds of machine learning models have been utilized in research to forecast crop yields. Our work proposes a stacked ensemble model designed for the purpose of predicting crop yield. The proposed model employs a stacked ensemble learning approach, with a Decision Tree Regressor functioning as the meta-model to amalgamate predictions from six distinct base learner models: Linear Regression (LR), Elastic Net, XGBoost Regressor, K-Neighbors Regressor (KNR), AdaBoost Regressor, and Random Forest Regressor (RFR). The proposed stacked ensemble model achieves superior crop yield prediction performance, evidenced by a notable enhancement in accuracy and a significant decrease in RMSE, surpassing the predictive capabilities of traditional machine learning models. The ensemble model’s performance was assessed using several metrics, including a Mean Absolute Error of 7.20 tons/hectare, Mean Square Error of 15570.32 tons ^2 /hectare ^2 , Root Mean Square Error of 124.78 tons/hectare, and Coefficient of Determination (R ^2 Score) of 0.98. The performance results demonstrate that stacked ensemble model outperforms other conventional machine learning approaches, achieving a high R-squared score of 98%.
format Article
id doaj-art-7c17f6b48ded41e1b2d5a5352e0e0ff9
institution DOAJ
issn 2515-7620
language English
publishDate 2025-01-01
publisher IOP Publishing
record_format Article
series Environmental Research Communications
spelling doaj-art-7c17f6b48ded41e1b2d5a5352e0e0ff92025-08-20T03:16:18ZengIOP PublishingEnvironmental Research Communications2515-76202025-01-017303500610.1088/2515-7620/adb9c0Stacked ensemble model for accurate crop yield prediction using machine learning techniquesRamesh V0https://orcid.org/0009-0000-4971-1602Kumaresan P1https://orcid.org/0000-0001-5563-8325School of Computer Science Engineering and Information Systems, Vellore Institute of Technology , Vellore - 632014, IndiaSchool of Computer Science Engineering and Information Systems, Vellore Institute of Technology , Vellore - 632014, IndiaPredicting crop yields remains a crucial challenge in agriculture, as these forecasts influence decision-making at global, regional, and individual crop levels. Historically, such predictions have utilized diverse data sources, including agricultural, land, climatic, atmospheric, and other pertinent information. Over the past several years, the application of machine learning techniques has emerged as a valuable analytical approach for estimating agricultural productivity, thereby informing decisions regarding crop selection and management strategies throughout the entire growing cycle. Various kinds of machine learning models have been utilized in research to forecast crop yields. Our work proposes a stacked ensemble model designed for the purpose of predicting crop yield. The proposed model employs a stacked ensemble learning approach, with a Decision Tree Regressor functioning as the meta-model to amalgamate predictions from six distinct base learner models: Linear Regression (LR), Elastic Net, XGBoost Regressor, K-Neighbors Regressor (KNR), AdaBoost Regressor, and Random Forest Regressor (RFR). The proposed stacked ensemble model achieves superior crop yield prediction performance, evidenced by a notable enhancement in accuracy and a significant decrease in RMSE, surpassing the predictive capabilities of traditional machine learning models. The ensemble model’s performance was assessed using several metrics, including a Mean Absolute Error of 7.20 tons/hectare, Mean Square Error of 15570.32 tons ^2 /hectare ^2 , Root Mean Square Error of 124.78 tons/hectare, and Coefficient of Determination (R ^2 Score) of 0.98. The performance results demonstrate that stacked ensemble model outperforms other conventional machine learning approaches, achieving a high R-squared score of 98%.https://doi.org/10.1088/2515-7620/adb9c0crop yield predictionlinear regressionrandom forestdecision tree regressorAdaBoost regressorelastic Net
spellingShingle Ramesh V
Kumaresan P
Stacked ensemble model for accurate crop yield prediction using machine learning techniques
Environmental Research Communications
crop yield prediction
linear regression
random forest
decision tree regressor
AdaBoost regressor
elastic Net
title Stacked ensemble model for accurate crop yield prediction using machine learning techniques
title_full Stacked ensemble model for accurate crop yield prediction using machine learning techniques
title_fullStr Stacked ensemble model for accurate crop yield prediction using machine learning techniques
title_full_unstemmed Stacked ensemble model for accurate crop yield prediction using machine learning techniques
title_short Stacked ensemble model for accurate crop yield prediction using machine learning techniques
title_sort stacked ensemble model for accurate crop yield prediction using machine learning techniques
topic crop yield prediction
linear regression
random forest
decision tree regressor
AdaBoost regressor
elastic Net
url https://doi.org/10.1088/2515-7620/adb9c0
work_keys_str_mv AT rameshv stackedensemblemodelforaccuratecropyieldpredictionusingmachinelearningtechniques
AT kumaresanp stackedensemblemodelforaccuratecropyieldpredictionusingmachinelearningtechniques