Genomic Selection in Alfalfa Across Multiple Ploidy Levels: A Comparative Study Using Machine Learning and Bayesian Methods

Agronomic traits and quality traits of alfalfa are of great importance to the feed industry. Genomic selection (GS) based on genotyping-by-sequencing (GBS) data, if it achieves moderate to high accuracy, has the potential to significantly shorten breeding cycles for complex traits and accelerate gen...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaoyue Zhu, Ruixin Zhang, Tianxiang Zhang, Changhong Guo, Yongjun Shu
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Agronomy
Subjects:
Online Access:https://www.mdpi.com/2073-4395/14/12/2768
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850240010031202304
author Xiaoyue Zhu
Ruixin Zhang
Tianxiang Zhang
Changhong Guo
Yongjun Shu
author_facet Xiaoyue Zhu
Ruixin Zhang
Tianxiang Zhang
Changhong Guo
Yongjun Shu
author_sort Xiaoyue Zhu
collection DOAJ
description Agronomic traits and quality traits of alfalfa are of great importance to the feed industry. Genomic selection (GS) based on genotyping-by-sequencing (GBS) data, if it achieves moderate to high accuracy, has the potential to significantly shorten breeding cycles for complex traits and accelerate genetic progress. This study aims to investigate the effect of different reference genomes on the prediction accuracy of genomic selection. A total of 11 Bayesian and machine learning models and nine different reference genomes were used to conduct genomic selection on five traits in 385 alfalfa accessions. The accuracy of GS was evaluated using five-fold cross-validation, based on the correlation between genomic estimated breeding values (GEBVs) and estimated breeding values (EBVs). For the five traits, it was found that traits with high heritability exhibited significantly higher prediction accuracy. The prediction accuracy fluctuated minimally across different reference genomes, with the diploid genome showing relatively higher accuracy. For two high-heritability traits, fall dormancy and plant height, predictions were made after SNP density reduction, and it was observed that density had little effect on prediction accuracy. However, for the fall dormancy trait in the diploid genome, more than half of the models showed regular fluctuations, with prediction accuracy increasing as SNP density increased. In conclusion, this study provides a theoretical basis for precision breeding of alfalfa and other polyploid crops by combining different reference genomes and models, and offers important guidance for optimizing future genomic selection strategies.
format Article
id doaj-art-678d7f0a46114a39b8ba772215a2fb9d
institution OA Journals
issn 2073-4395
language English
publishDate 2024-11-01
publisher MDPI AG
record_format Article
series Agronomy
spelling doaj-art-678d7f0a46114a39b8ba772215a2fb9d2025-08-20T02:00:59ZengMDPI AGAgronomy2073-43952024-11-011412276810.3390/agronomy14122768Genomic Selection in Alfalfa Across Multiple Ploidy Levels: A Comparative Study Using Machine Learning and Bayesian MethodsXiaoyue Zhu0Ruixin Zhang1Tianxiang Zhang2Changhong Guo3Yongjun Shu4Key Laboratory of Molecular Cytogenetics and Genetic Breeding of Heilongjiang Province, College of Life Science and Technology, Harbin Normal University, Harbin 150025, ChinaKey Laboratory of Molecular Cytogenetics and Genetic Breeding of Heilongjiang Province, College of Life Science and Technology, Harbin Normal University, Harbin 150025, ChinaKey Laboratory of Molecular Cytogenetics and Genetic Breeding of Heilongjiang Province, College of Life Science and Technology, Harbin Normal University, Harbin 150025, ChinaKey Laboratory of Molecular Cytogenetics and Genetic Breeding of Heilongjiang Province, College of Life Science and Technology, Harbin Normal University, Harbin 150025, ChinaKey Laboratory of Molecular Cytogenetics and Genetic Breeding of Heilongjiang Province, College of Life Science and Technology, Harbin Normal University, Harbin 150025, ChinaAgronomic traits and quality traits of alfalfa are of great importance to the feed industry. Genomic selection (GS) based on genotyping-by-sequencing (GBS) data, if it achieves moderate to high accuracy, has the potential to significantly shorten breeding cycles for complex traits and accelerate genetic progress. This study aims to investigate the effect of different reference genomes on the prediction accuracy of genomic selection. A total of 11 Bayesian and machine learning models and nine different reference genomes were used to conduct genomic selection on five traits in 385 alfalfa accessions. The accuracy of GS was evaluated using five-fold cross-validation, based on the correlation between genomic estimated breeding values (GEBVs) and estimated breeding values (EBVs). For the five traits, it was found that traits with high heritability exhibited significantly higher prediction accuracy. The prediction accuracy fluctuated minimally across different reference genomes, with the diploid genome showing relatively higher accuracy. For two high-heritability traits, fall dormancy and plant height, predictions were made after SNP density reduction, and it was observed that density had little effect on prediction accuracy. However, for the fall dormancy trait in the diploid genome, more than half of the models showed regular fluctuations, with prediction accuracy increasing as SNP density increased. In conclusion, this study provides a theoretical basis for precision breeding of alfalfa and other polyploid crops by combining different reference genomes and models, and offers important guidance for optimizing future genomic selection strategies.https://www.mdpi.com/2073-4395/14/12/2768genomic selectionalfalfaBayesian modelmachine learningSNP densityheritability
spellingShingle Xiaoyue Zhu
Ruixin Zhang
Tianxiang Zhang
Changhong Guo
Yongjun Shu
Genomic Selection in Alfalfa Across Multiple Ploidy Levels: A Comparative Study Using Machine Learning and Bayesian Methods
Agronomy
genomic selection
alfalfa
Bayesian model
machine learning
SNP density
heritability
title Genomic Selection in Alfalfa Across Multiple Ploidy Levels: A Comparative Study Using Machine Learning and Bayesian Methods
title_full Genomic Selection in Alfalfa Across Multiple Ploidy Levels: A Comparative Study Using Machine Learning and Bayesian Methods
title_fullStr Genomic Selection in Alfalfa Across Multiple Ploidy Levels: A Comparative Study Using Machine Learning and Bayesian Methods
title_full_unstemmed Genomic Selection in Alfalfa Across Multiple Ploidy Levels: A Comparative Study Using Machine Learning and Bayesian Methods
title_short Genomic Selection in Alfalfa Across Multiple Ploidy Levels: A Comparative Study Using Machine Learning and Bayesian Methods
title_sort genomic selection in alfalfa across multiple ploidy levels a comparative study using machine learning and bayesian methods
topic genomic selection
alfalfa
Bayesian model
machine learning
SNP density
heritability
url https://www.mdpi.com/2073-4395/14/12/2768
work_keys_str_mv AT xiaoyuezhu genomicselectioninalfalfaacrossmultipleploidylevelsacomparativestudyusingmachinelearningandbayesianmethods
AT ruixinzhang genomicselectioninalfalfaacrossmultipleploidylevelsacomparativestudyusingmachinelearningandbayesianmethods
AT tianxiangzhang genomicselectioninalfalfaacrossmultipleploidylevelsacomparativestudyusingmachinelearningandbayesianmethods
AT changhongguo genomicselectioninalfalfaacrossmultipleploidylevelsacomparativestudyusingmachinelearningandbayesianmethods
AT yongjunshu genomicselectioninalfalfaacrossmultipleploidylevelsacomparativestudyusingmachinelearningandbayesianmethods