Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data

Smart precision farming leverages IoT, cloud computing, and big data to optimize agricultural productivity, lower costs, and promote sustainability through digitalization and intelligent methodologies. However, it faces challenges such as managing complex variables, addressing multicollinearity, ha...

Full description

Saved in:
Bibliographic Details
Main Authors: Nour Hamad Abu Afouna, Majid Khan Majahar Ali
Format: Article
Language:English
Published: Nigerian Society of Physical Sciences 2025-02-01
Series:Journal of Nigerian Society of Physical Sciences
Subjects:
Online Access:https://journal.nsps.org.ng/index.php/jnsps/article/view/2314
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841525204525776896
author Nour Hamad Abu Afouna
Majid Khan Majahar Ali
author_facet Nour Hamad Abu Afouna
Majid Khan Majahar Ali
author_sort Nour Hamad Abu Afouna
collection DOAJ
description Smart precision farming leverages IoT, cloud computing, and big data to optimize agricultural productivity, lower costs, and promote sustainability through digitalization and intelligent methodologies. However, it faces challenges such as managing complex variables, addressing multicollinearity, handling outliers, ensuring model robustness, and enhancing accuracy, particularly with small to medium-sized datasets. To overcome these obstacles, reducing retraining time and resolving the complexity issue is essential for improving the machine learning algorithm’s performance, scalability, and efficiency, especially when dealing with large or high-dimensional datasets. In a recent study involving 435 drying parameters and 1,914 observations, two machine learning algorithms - Ridge and Lasso - were employed to analyze and compare the impact of two variable selection techniques, specifically the regularization methods Ridge and Lasso, before and after addressing heterogeneity in highly ranked variables (50, 100, 150, 200, 250, 300). Additionally, robust regression methods such as S, M, MM, M-Hampel, M-Huber, M-Tukey, MM-bisquare, MM-Hampel, and MM-Huber were applied. The results demonstrated that the robust methods, when applied to Ridge and Lasso, achieved the highest efficiency, with the smallest values for MAPE, MSE, SSE, and the highest R2 values, both before and after accounting for heterogeneity. As a result of the study, the best models are the Ridge model with the MM bisquares before heterogeneity, the Ridge model with the MM method after heterogeneity, and the Lasso model with the MM method before heterogeneity and the Lasso model with MM Hampel after heterogeneity.
format Article
id doaj-art-3530b57c940449ce88a00bb613d0ca97
institution Kabale University
issn 2714-2817
2714-4704
language English
publishDate 2025-02-01
publisher Nigerian Society of Physical Sciences
record_format Article
series Journal of Nigerian Society of Physical Sciences
spelling doaj-art-3530b57c940449ce88a00bb613d0ca972025-01-17T18:52:26ZengNigerian Society of Physical SciencesJournal of Nigerian Society of Physical Sciences2714-28172714-47042025-02-017110.46481/jnsps.2025.2314Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional dataNour Hamad Abu Afouna0Majid Khan Majahar Ali1School of Mathematical Sciences, Universiti Sains Malaysia 11800 USM, Penang, MalaysiaSchool of Mathematical Sciences, Universiti Sains Malaysia 11800 USM, Penang, Malaysia Smart precision farming leverages IoT, cloud computing, and big data to optimize agricultural productivity, lower costs, and promote sustainability through digitalization and intelligent methodologies. However, it faces challenges such as managing complex variables, addressing multicollinearity, handling outliers, ensuring model robustness, and enhancing accuracy, particularly with small to medium-sized datasets. To overcome these obstacles, reducing retraining time and resolving the complexity issue is essential for improving the machine learning algorithm’s performance, scalability, and efficiency, especially when dealing with large or high-dimensional datasets. In a recent study involving 435 drying parameters and 1,914 observations, two machine learning algorithms - Ridge and Lasso - were employed to analyze and compare the impact of two variable selection techniques, specifically the regularization methods Ridge and Lasso, before and after addressing heterogeneity in highly ranked variables (50, 100, 150, 200, 250, 300). Additionally, robust regression methods such as S, M, MM, M-Hampel, M-Huber, M-Tukey, MM-bisquare, MM-Hampel, and MM-Huber were applied. The results demonstrated that the robust methods, when applied to Ridge and Lasso, achieved the highest efficiency, with the smallest values for MAPE, MSE, SSE, and the highest R2 values, both before and after accounting for heterogeneity. As a result of the study, the best models are the Ridge model with the MM bisquares before heterogeneity, the Ridge model with the MM method after heterogeneity, and the Lasso model with the MM method before heterogeneity and the Lasso model with MM Hampel after heterogeneity. https://journal.nsps.org.ng/index.php/jnsps/article/view/2314lassoRidgeM-estimationMM-estimationRobust Regression
spellingShingle Nour Hamad Abu Afouna
Majid Khan Majahar Ali
Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data
Journal of Nigerian Society of Physical Sciences
lasso
Ridge
M-estimation
MM-estimation
Robust Regression
title Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data
title_full Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data
title_fullStr Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data
title_full_unstemmed Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data
title_short Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data
title_sort optimizing precision farming enhancing machine learning efficiency with robust regression techniques in high dimensional data
topic lasso
Ridge
M-estimation
MM-estimation
Robust Regression
url https://journal.nsps.org.ng/index.php/jnsps/article/view/2314
work_keys_str_mv AT nourhamadabuafouna optimizingprecisionfarmingenhancingmachinelearningefficiencywithrobustregressiontechniquesinhighdimensionaldata
AT majidkhanmajaharali optimizingprecisionfarmingenhancingmachinelearningefficiencywithrobustregressiontechniquesinhighdimensionaldata