Machine learning in environmental sustainability factor analysis in the agricultural sector
The study employed several key data analysis methods aimed at enhancing the understanding of relationships between variables and improving prediction accuracy. The primary tool used was correlation analysis, which allowed for the identification of the degree of association between two variables by d...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
EDP Sciences
2024-01-01
|
| Series: | BIO Web of Conferences |
| Online Access: | https://www.bio-conferences.org/articles/bioconf/pdf/2024/60/bioconf_AgriculturalScience2024_04050.pdf |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850175923383435264 |
|---|---|
| author | Kukartsev Vladislav Kozlova Anastasia Kukarceva Svetlana |
| author_facet | Kukartsev Vladislav Kozlova Anastasia Kukarceva Svetlana |
| author_sort | Kukartsev Vladislav |
| collection | DOAJ |
| description | The study employed several key data analysis methods aimed at enhancing the understanding of relationships between variables and improving prediction accuracy. The primary tool used was correlation analysis, which allowed for the identification of the degree of association between two variables by determining how changes in one variable relate to changes in another. This established a foundation for further in-depth data analysis. For a deeper understanding and simplified interpretation of the data, factor analysis was utilized. This method helped to identify latent factors that explain the relationships between observed variables and to reduce the number of variables by grouping them. This made the analysis easier and facilitated the identification of key components affecting the data. Logistic regression was applied to build data models. This method is used to model the probability of a specific event occurring based on independent variables, allowing for the classification and prediction of categorical outcomes. The logistic function was used to estimate probabilities and the relationship between the dependent variable and predictors. To enhance the performance of the logistic regression model, a Weight of Evidence (WoE) analysis was conducted. This method converts categorical and continuous variables into numerical formats, simplifying data interpretation and improving the model’s predictive capabilities. WoE analysis helps to identify significant factors, improve the linear relationship between predictors and the dependent variable, and reduce the impact of outliers, which is particularly important in areas such as credit scoring. The results of applying these methods showed that the model based on correlation and factor analysis explained 27.51% of the information on the training set and 76.04% on the test set. |
| format | Article |
| id | doaj-art-384bea86f6174bb7983ca413e106a222 |
| institution | OA Journals |
| issn | 2117-4458 |
| language | English |
| publishDate | 2024-01-01 |
| publisher | EDP Sciences |
| record_format | Article |
| series | BIO Web of Conferences |
| spelling | doaj-art-384bea86f6174bb7983ca413e106a2222025-08-20T02:19:22ZengEDP SciencesBIO Web of Conferences2117-44582024-01-011410405010.1051/bioconf/202414104050bioconf_AgriculturalScience2024_04050Machine learning in environmental sustainability factor analysis in the agricultural sectorKukartsev Vladislav0Kozlova Anastasia1Kukarceva Svetlana2Moscow Timiryazev Agricultural Academy, Russian State Agrarian UniversityReshetnev Siberian State University of Science and TechnologyMoscow Timiryazev Agricultural Academy, Russian State Agrarian UniversityThe study employed several key data analysis methods aimed at enhancing the understanding of relationships between variables and improving prediction accuracy. The primary tool used was correlation analysis, which allowed for the identification of the degree of association between two variables by determining how changes in one variable relate to changes in another. This established a foundation for further in-depth data analysis. For a deeper understanding and simplified interpretation of the data, factor analysis was utilized. This method helped to identify latent factors that explain the relationships between observed variables and to reduce the number of variables by grouping them. This made the analysis easier and facilitated the identification of key components affecting the data. Logistic regression was applied to build data models. This method is used to model the probability of a specific event occurring based on independent variables, allowing for the classification and prediction of categorical outcomes. The logistic function was used to estimate probabilities and the relationship between the dependent variable and predictors. To enhance the performance of the logistic regression model, a Weight of Evidence (WoE) analysis was conducted. This method converts categorical and continuous variables into numerical formats, simplifying data interpretation and improving the model’s predictive capabilities. WoE analysis helps to identify significant factors, improve the linear relationship between predictors and the dependent variable, and reduce the impact of outliers, which is particularly important in areas such as credit scoring. The results of applying these methods showed that the model based on correlation and factor analysis explained 27.51% of the information on the training set and 76.04% on the test set.https://www.bio-conferences.org/articles/bioconf/pdf/2024/60/bioconf_AgriculturalScience2024_04050.pdf |
| spellingShingle | Kukartsev Vladislav Kozlova Anastasia Kukarceva Svetlana Machine learning in environmental sustainability factor analysis in the agricultural sector BIO Web of Conferences |
| title | Machine learning in environmental sustainability factor analysis in the agricultural sector |
| title_full | Machine learning in environmental sustainability factor analysis in the agricultural sector |
| title_fullStr | Machine learning in environmental sustainability factor analysis in the agricultural sector |
| title_full_unstemmed | Machine learning in environmental sustainability factor analysis in the agricultural sector |
| title_short | Machine learning in environmental sustainability factor analysis in the agricultural sector |
| title_sort | machine learning in environmental sustainability factor analysis in the agricultural sector |
| url | https://www.bio-conferences.org/articles/bioconf/pdf/2024/60/bioconf_AgriculturalScience2024_04050.pdf |
| work_keys_str_mv | AT kukartsevvladislav machinelearninginenvironmentalsustainabilityfactoranalysisintheagriculturalsector AT kozlovaanastasia machinelearninginenvironmentalsustainabilityfactoranalysisintheagriculturalsector AT kukarcevasvetlana machinelearninginenvironmentalsustainabilityfactoranalysisintheagriculturalsector |