Machine learning in environmental sustainability factor analysis in the agricultural sector

The study employed several key data analysis methods aimed at enhancing the understanding of relationships between variables and improving prediction accuracy. The primary tool used was correlation analysis, which allowed for the identification of the degree of association between two variables by d...

Full description

Saved in:
Bibliographic Details
Main Authors: Kukartsev Vladislav, Kozlova Anastasia, Kukarceva Svetlana
Format: Article
Language:English
Published: EDP Sciences 2024-01-01
Series:BIO Web of Conferences
Online Access:https://www.bio-conferences.org/articles/bioconf/pdf/2024/60/bioconf_AgriculturalScience2024_04050.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850175923383435264
author Kukartsev Vladislav
Kozlova Anastasia
Kukarceva Svetlana
author_facet Kukartsev Vladislav
Kozlova Anastasia
Kukarceva Svetlana
author_sort Kukartsev Vladislav
collection DOAJ
description The study employed several key data analysis methods aimed at enhancing the understanding of relationships between variables and improving prediction accuracy. The primary tool used was correlation analysis, which allowed for the identification of the degree of association between two variables by determining how changes in one variable relate to changes in another. This established a foundation for further in-depth data analysis. For a deeper understanding and simplified interpretation of the data, factor analysis was utilized. This method helped to identify latent factors that explain the relationships between observed variables and to reduce the number of variables by grouping them. This made the analysis easier and facilitated the identification of key components affecting the data. Logistic regression was applied to build data models. This method is used to model the probability of a specific event occurring based on independent variables, allowing for the classification and prediction of categorical outcomes. The logistic function was used to estimate probabilities and the relationship between the dependent variable and predictors. To enhance the performance of the logistic regression model, a Weight of Evidence (WoE) analysis was conducted. This method converts categorical and continuous variables into numerical formats, simplifying data interpretation and improving the model’s predictive capabilities. WoE analysis helps to identify significant factors, improve the linear relationship between predictors and the dependent variable, and reduce the impact of outliers, which is particularly important in areas such as credit scoring. The results of applying these methods showed that the model based on correlation and factor analysis explained 27.51% of the information on the training set and 76.04% on the test set.
format Article
id doaj-art-384bea86f6174bb7983ca413e106a222
institution OA Journals
issn 2117-4458
language English
publishDate 2024-01-01
publisher EDP Sciences
record_format Article
series BIO Web of Conferences
spelling doaj-art-384bea86f6174bb7983ca413e106a2222025-08-20T02:19:22ZengEDP SciencesBIO Web of Conferences2117-44582024-01-011410405010.1051/bioconf/202414104050bioconf_AgriculturalScience2024_04050Machine learning in environmental sustainability factor analysis in the agricultural sectorKukartsev Vladislav0Kozlova Anastasia1Kukarceva Svetlana2Moscow Timiryazev Agricultural Academy, Russian State Agrarian UniversityReshetnev Siberian State University of Science and TechnologyMoscow Timiryazev Agricultural Academy, Russian State Agrarian UniversityThe study employed several key data analysis methods aimed at enhancing the understanding of relationships between variables and improving prediction accuracy. The primary tool used was correlation analysis, which allowed for the identification of the degree of association between two variables by determining how changes in one variable relate to changes in another. This established a foundation for further in-depth data analysis. For a deeper understanding and simplified interpretation of the data, factor analysis was utilized. This method helped to identify latent factors that explain the relationships between observed variables and to reduce the number of variables by grouping them. This made the analysis easier and facilitated the identification of key components affecting the data. Logistic regression was applied to build data models. This method is used to model the probability of a specific event occurring based on independent variables, allowing for the classification and prediction of categorical outcomes. The logistic function was used to estimate probabilities and the relationship between the dependent variable and predictors. To enhance the performance of the logistic regression model, a Weight of Evidence (WoE) analysis was conducted. This method converts categorical and continuous variables into numerical formats, simplifying data interpretation and improving the model’s predictive capabilities. WoE analysis helps to identify significant factors, improve the linear relationship between predictors and the dependent variable, and reduce the impact of outliers, which is particularly important in areas such as credit scoring. The results of applying these methods showed that the model based on correlation and factor analysis explained 27.51% of the information on the training set and 76.04% on the test set.https://www.bio-conferences.org/articles/bioconf/pdf/2024/60/bioconf_AgriculturalScience2024_04050.pdf
spellingShingle Kukartsev Vladislav
Kozlova Anastasia
Kukarceva Svetlana
Machine learning in environmental sustainability factor analysis in the agricultural sector
BIO Web of Conferences
title Machine learning in environmental sustainability factor analysis in the agricultural sector
title_full Machine learning in environmental sustainability factor analysis in the agricultural sector
title_fullStr Machine learning in environmental sustainability factor analysis in the agricultural sector
title_full_unstemmed Machine learning in environmental sustainability factor analysis in the agricultural sector
title_short Machine learning in environmental sustainability factor analysis in the agricultural sector
title_sort machine learning in environmental sustainability factor analysis in the agricultural sector
url https://www.bio-conferences.org/articles/bioconf/pdf/2024/60/bioconf_AgriculturalScience2024_04050.pdf
work_keys_str_mv AT kukartsevvladislav machinelearninginenvironmentalsustainabilityfactoranalysisintheagriculturalsector
AT kozlovaanastasia machinelearninginenvironmentalsustainabilityfactoranalysisintheagriculturalsector
AT kukarcevasvetlana machinelearninginenvironmentalsustainabilityfactoranalysisintheagriculturalsector