Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data
Smart precision farming leverages IoT, cloud computing, and big data to optimize agricultural productivity, lower costs, and promote sustainability through digitalization and intelligent methodologies. However, it faces challenges such as managing complex variables, addressing multicollinearity, ha...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nigerian Society of Physical Sciences
2025-02-01
|
Series: | Journal of Nigerian Society of Physical Sciences |
Subjects: | |
Online Access: | https://journal.nsps.org.ng/index.php/jnsps/article/view/2314 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841525204525776896 |
---|---|
author | Nour Hamad Abu Afouna Majid Khan Majahar Ali |
author_facet | Nour Hamad Abu Afouna Majid Khan Majahar Ali |
author_sort | Nour Hamad Abu Afouna |
collection | DOAJ |
description |
Smart precision farming leverages IoT, cloud computing, and big data to optimize agricultural productivity, lower costs, and promote sustainability through digitalization and intelligent methodologies. However, it faces challenges such as managing complex variables, addressing multicollinearity, handling outliers, ensuring model robustness, and enhancing accuracy, particularly with small to medium-sized datasets. To overcome these obstacles, reducing retraining time and resolving the complexity issue is essential for improving the machine learning algorithm’s performance, scalability, and efficiency, especially when dealing with large or high-dimensional datasets. In a recent study involving 435 drying parameters and 1,914 observations, two machine learning algorithms - Ridge and Lasso - were employed to analyze and compare the impact of two variable selection techniques, specifically the regularization methods Ridge and Lasso, before and after addressing heterogeneity in highly ranked variables (50, 100, 150, 200, 250, 300). Additionally, robust regression methods such as S, M, MM, M-Hampel, M-Huber, M-Tukey, MM-bisquare, MM-Hampel, and MM-Huber were applied. The results demonstrated that the robust methods, when applied to Ridge and Lasso, achieved the highest efficiency, with the smallest values for MAPE, MSE, SSE, and the highest R2 values, both before and after accounting for heterogeneity. As a result of the study, the best models are the Ridge model with the MM bisquares before heterogeneity, the Ridge model with the MM method after heterogeneity, and the Lasso model with the MM method before heterogeneity and the Lasso model with MM Hampel after heterogeneity.
|
format | Article |
id | doaj-art-3530b57c940449ce88a00bb613d0ca97 |
institution | Kabale University |
issn | 2714-2817 2714-4704 |
language | English |
publishDate | 2025-02-01 |
publisher | Nigerian Society of Physical Sciences |
record_format | Article |
series | Journal of Nigerian Society of Physical Sciences |
spelling | doaj-art-3530b57c940449ce88a00bb613d0ca972025-01-17T18:52:26ZengNigerian Society of Physical SciencesJournal of Nigerian Society of Physical Sciences2714-28172714-47042025-02-017110.46481/jnsps.2025.2314Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional dataNour Hamad Abu Afouna0Majid Khan Majahar Ali1School of Mathematical Sciences, Universiti Sains Malaysia 11800 USM, Penang, MalaysiaSchool of Mathematical Sciences, Universiti Sains Malaysia 11800 USM, Penang, Malaysia Smart precision farming leverages IoT, cloud computing, and big data to optimize agricultural productivity, lower costs, and promote sustainability through digitalization and intelligent methodologies. However, it faces challenges such as managing complex variables, addressing multicollinearity, handling outliers, ensuring model robustness, and enhancing accuracy, particularly with small to medium-sized datasets. To overcome these obstacles, reducing retraining time and resolving the complexity issue is essential for improving the machine learning algorithm’s performance, scalability, and efficiency, especially when dealing with large or high-dimensional datasets. In a recent study involving 435 drying parameters and 1,914 observations, two machine learning algorithms - Ridge and Lasso - were employed to analyze and compare the impact of two variable selection techniques, specifically the regularization methods Ridge and Lasso, before and after addressing heterogeneity in highly ranked variables (50, 100, 150, 200, 250, 300). Additionally, robust regression methods such as S, M, MM, M-Hampel, M-Huber, M-Tukey, MM-bisquare, MM-Hampel, and MM-Huber were applied. The results demonstrated that the robust methods, when applied to Ridge and Lasso, achieved the highest efficiency, with the smallest values for MAPE, MSE, SSE, and the highest R2 values, both before and after accounting for heterogeneity. As a result of the study, the best models are the Ridge model with the MM bisquares before heterogeneity, the Ridge model with the MM method after heterogeneity, and the Lasso model with the MM method before heterogeneity and the Lasso model with MM Hampel after heterogeneity. https://journal.nsps.org.ng/index.php/jnsps/article/view/2314lassoRidgeM-estimationMM-estimationRobust Regression |
spellingShingle | Nour Hamad Abu Afouna Majid Khan Majahar Ali Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data Journal of Nigerian Society of Physical Sciences lasso Ridge M-estimation MM-estimation Robust Regression |
title | Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data |
title_full | Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data |
title_fullStr | Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data |
title_full_unstemmed | Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data |
title_short | Optimizing precision farming: enhancing machine learning efficiency with robust regression techniques in high-dimensional data |
title_sort | optimizing precision farming enhancing machine learning efficiency with robust regression techniques in high dimensional data |
topic | lasso Ridge M-estimation MM-estimation Robust Regression |
url | https://journal.nsps.org.ng/index.php/jnsps/article/view/2314 |
work_keys_str_mv | AT nourhamadabuafouna optimizingprecisionfarmingenhancingmachinelearningefficiencywithrobustregressiontechniquesinhighdimensionaldata AT majidkhanmajaharali optimizingprecisionfarmingenhancingmachinelearningefficiencywithrobustregressiontechniquesinhighdimensionaldata |