Maize Hybrids Performance Evaluation with Data Fusion by Matrix Factorization Algorithm

Crop breeders often face challenges due to limited data availability when making crucial decisions, such as selecting top-performing varieties/hybrids for further experiments, registration, and commercialization. Evaluating all varieties/hybrids across all fields is impractical due to high experimen...

Full description

Saved in:
Bibliographic Details
Main Authors: Milica Brkić, Stefan Hačko, Miloš Radovanović, Vladimir Crnojević, Sanja Brdar
Format: Article
Language:English
Published: Taylor & Francis Group 2024-12-01
Series:Applied Artificial Intelligence
Online Access:https://www.tandfonline.com/doi/10.1080/08839514.2024.2421687
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850064136253210624
author Milica Brkić
Stefan Hačko
Miloš Radovanović
Vladimir Crnojević
Sanja Brdar
author_facet Milica Brkić
Stefan Hačko
Miloš Radovanović
Vladimir Crnojević
Sanja Brdar
author_sort Milica Brkić
collection DOAJ
description Crop breeders often face challenges due to limited data availability when making crucial decisions, such as selecting top-performing varieties/hybrids for further experiments, registration, and commercialization. Evaluating all varieties/hybrids across all fields is impractical due to high experimental and time costs, as well as the limited number of locations for planting. This article aims to evaluate the performance of various maize hybrids in untested locations using historical data. The problem is approached through a matrix framework, where hybrids and fields correspond to rows and columns, respectively, with entries representing the yield of a specific hybrid at a given location. As this matrix is typically sparse, the task is to fill in missing data. Agronomists are primarily interested in the performance of top hybrids at specific locations for smart seed selection. To address this, we introduce a novel application of the Data Fusion by Matrix Factorization (DFMF) algorithm for predicting crop yields using maize data from the 2019 Syngenta Crop Challenge. The DFMF results are compared with the Random Forest (RF) algorithm as a benchmark, focusing on model performance for smart seed selection. Our analysis highlights the advantages of the DFMF approach over the traditional RF method in this context.
format Article
id doaj-art-da48dccd98e645819a3a22b3affbc6a7
institution DOAJ
issn 0883-9514
1087-6545
language English
publishDate 2024-12-01
publisher Taylor & Francis Group
record_format Article
series Applied Artificial Intelligence
spelling doaj-art-da48dccd98e645819a3a22b3affbc6a72025-08-20T02:49:22ZengTaylor & Francis GroupApplied Artificial Intelligence0883-95141087-65452024-12-0138110.1080/08839514.2024.2421687Maize Hybrids Performance Evaluation with Data Fusion by Matrix Factorization AlgorithmMilica Brkić0Stefan Hačko1Miloš Radovanović2Vladimir Crnojević3Sanja BrdarCenter of Information Technologies, BioSense Institute, Novi Sad, SerbiaCenter of Information Technologies, BioSense Institute, Novi Sad, SerbiaDepartment of Mathematics and Informatics, Faculty of Sciences, University of Novi Sad, Novi Sad, SerbiaCenter of Information Technologies, BioSense Institute, Novi Sad, SerbiaCrop breeders often face challenges due to limited data availability when making crucial decisions, such as selecting top-performing varieties/hybrids for further experiments, registration, and commercialization. Evaluating all varieties/hybrids across all fields is impractical due to high experimental and time costs, as well as the limited number of locations for planting. This article aims to evaluate the performance of various maize hybrids in untested locations using historical data. The problem is approached through a matrix framework, where hybrids and fields correspond to rows and columns, respectively, with entries representing the yield of a specific hybrid at a given location. As this matrix is typically sparse, the task is to fill in missing data. Agronomists are primarily interested in the performance of top hybrids at specific locations for smart seed selection. To address this, we introduce a novel application of the Data Fusion by Matrix Factorization (DFMF) algorithm for predicting crop yields using maize data from the 2019 Syngenta Crop Challenge. The DFMF results are compared with the Random Forest (RF) algorithm as a benchmark, focusing on model performance for smart seed selection. Our analysis highlights the advantages of the DFMF approach over the traditional RF method in this context.https://www.tandfonline.com/doi/10.1080/08839514.2024.2421687
spellingShingle Milica Brkić
Stefan Hačko
Miloš Radovanović
Vladimir Crnojević
Sanja Brdar
Maize Hybrids Performance Evaluation with Data Fusion by Matrix Factorization Algorithm
Applied Artificial Intelligence
title Maize Hybrids Performance Evaluation with Data Fusion by Matrix Factorization Algorithm
title_full Maize Hybrids Performance Evaluation with Data Fusion by Matrix Factorization Algorithm
title_fullStr Maize Hybrids Performance Evaluation with Data Fusion by Matrix Factorization Algorithm
title_full_unstemmed Maize Hybrids Performance Evaluation with Data Fusion by Matrix Factorization Algorithm
title_short Maize Hybrids Performance Evaluation with Data Fusion by Matrix Factorization Algorithm
title_sort maize hybrids performance evaluation with data fusion by matrix factorization algorithm
url https://www.tandfonline.com/doi/10.1080/08839514.2024.2421687
work_keys_str_mv AT milicabrkic maizehybridsperformanceevaluationwithdatafusionbymatrixfactorizationalgorithm
AT stefanhacko maizehybridsperformanceevaluationwithdatafusionbymatrixfactorizationalgorithm
AT milosradovanovic maizehybridsperformanceevaluationwithdatafusionbymatrixfactorizationalgorithm
AT vladimircrnojevic maizehybridsperformanceevaluationwithdatafusionbymatrixfactorizationalgorithm
AT sanjabrdar maizehybridsperformanceevaluationwithdatafusionbymatrixfactorizationalgorithm