Genotype-Driven Phenotype Prediction in Onion Breeding: Machine Learning Models for Enhanced Bulb Weight Selection

Onions (<i>Allium cepa</i> L.) are a globally significant horticultural crop, ranking second only to tomatoes in terms of cultivation and consumption. However, due to the crop’s complex genome structure, lengthy growth cycle, self-incompatibility, and susceptibility to disease, onion bre...

Full description

Saved in:
Bibliographic Details
Main Authors: Junhwa Choi, Sunghyun Cho, Subin Choi, Myunghee Jung, Yu-jin Lim, Eunchae Lee, Jaewon Lim, Han Yong Park, Younhee Shin
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Agriculture
Subjects:
Online Access:https://www.mdpi.com/2077-0472/14/12/2239
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Onions (<i>Allium cepa</i> L.) are a globally significant horticultural crop, ranking second only to tomatoes in terms of cultivation and consumption. However, due to the crop’s complex genome structure, lengthy growth cycle, self-incompatibility, and susceptibility to disease, onion breeding is challenging. To address these issues, we implemented digital breeding techniques utilizing genomic data from 98 elite onion lines. We identified 51,499 high-quality variants and employed these data to construct a genomic estimated breeding value (GEBV) model and apply machine learning methods for bulb weight prediction. Validation with 260 new individuals revealed that the machine learning model achieved an accuracy of 83.2% and required only thirty-nine SNPs. Subsequent in silico crossbreeding simulations indicated that offspring from the top 5% of elite lines exhibited the highest bulb weights, aligning with traditional phenotypic selection methods. This approach demonstrates that early-stage selection based on genotypic information followed by crossbreeding can achieve economically viable breeding results. This methodology is not restricted to bulb weight and can be applied to various horticultural traits, significantly improving the efficiency of onion breeding through advanced digital technologies. The integration of genomic data, machine learning, and computer simulations provides a powerful framework for data-driven breeding strategies, accelerating the development of superior onion varieties to meet global demand.
ISSN:2077-0472