Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle

Abstract Background Integrating multi-layered information can enhance the accuracy of genomic prediction for complex traits. However, the improvement and application of effective strategies for genomic prediction (GP) using multi-omics data remains challenging. Methods We generated 11 feature sets f...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhida Zhao, Qunhao Niu, Jiayuan Wu, Tianyi Wu, Xueyuan Xie, Zezhao Wang, Lupei Zhang, Huijiang Gao, Xue Gao, Lingyang Xu, Bo Zhu, Junya Li
Format: Article
Language:English
Published: BMC 2024-12-01
Series:Biology Direct
Online Access:https://doi.org/10.1186/s13062-024-00574-y
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841559845193383936
author Zhida Zhao
Qunhao Niu
Jiayuan Wu
Tianyi Wu
Xueyuan Xie
Zezhao Wang
Lupei Zhang
Huijiang Gao
Xue Gao
Lingyang Xu
Bo Zhu
Junya Li
author_facet Zhida Zhao
Qunhao Niu
Jiayuan Wu
Tianyi Wu
Xueyuan Xie
Zezhao Wang
Lupei Zhang
Huijiang Gao
Xue Gao
Lingyang Xu
Bo Zhu
Junya Li
author_sort Zhida Zhao
collection DOAJ
description Abstract Background Integrating multi-layered information can enhance the accuracy of genomic prediction for complex traits. However, the improvement and application of effective strategies for genomic prediction (GP) using multi-omics data remains challenging. Methods We generated 11 feature sets for sequencing variants from genomics, transcriptomics, metabolomics, and epigenetics data in beef cattle, then we assessed the contribution of functional variants using genomic restricted maximum likelihood (GREML). We next estimated and ranked variant scores for 43 economically important traits, and compared the prediction accuracy of the top and bottom sets using genomic best linear unbiased prediction (GBLUP) and BayesB model. In addition, we annotated the variants from GWAS with functional feature sets and performed enrichment analysis. Results We observed significant enrichments for 32 functional categories in 11 feature sets. The evolutionary related sets (conservation regions and selection signatures) contributed significantly to heritability (31.78-fold and 14.48-fold enrichment), while metabolomics and transcriptomics showed low heritability enrichments. We observed a significant increase in prediction accuracy using the top feature set variants compared to whole-genome sequencing (WGS) data. The prediction accuracy based on the top 10% variant set showed an average increase of 11.6% and 7.54% using BayesB and GBLUP across traits, respectively. Notably, the greatest increase of 31.52% was obtained for spleen weight (SW) using BayesB. Also, we found that the top 10% of variants show strong enrichment with weight related QTLs based on the Cattle QTL database. Conclusions Our findings suggest that integrating biological prior information from multiple layers can enhance our understanding of the genetic architecture underlying complex traits and further improve genomic prediction in beef cattle.
format Article
id doaj-art-d7a984ac7b494153bdae66fef7bd10e5
institution Kabale University
issn 1745-6150
language English
publishDate 2024-12-01
publisher BMC
record_format Article
series Biology Direct
spelling doaj-art-d7a984ac7b494153bdae66fef7bd10e52025-01-05T12:11:00ZengBMCBiology Direct1745-61502024-12-0119111610.1186/s13062-024-00574-yIntegrating multi-layered biological priors to improve genomic prediction accuracy in beef cattleZhida Zhao0Qunhao Niu1Jiayuan Wu2Tianyi Wu3Xueyuan Xie4Zezhao Wang5Lupei Zhang6Huijiang Gao7Xue Gao8Lingyang Xu9Bo Zhu10Junya Li11Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesKey Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural SciencesAbstract Background Integrating multi-layered information can enhance the accuracy of genomic prediction for complex traits. However, the improvement and application of effective strategies for genomic prediction (GP) using multi-omics data remains challenging. Methods We generated 11 feature sets for sequencing variants from genomics, transcriptomics, metabolomics, and epigenetics data in beef cattle, then we assessed the contribution of functional variants using genomic restricted maximum likelihood (GREML). We next estimated and ranked variant scores for 43 economically important traits, and compared the prediction accuracy of the top and bottom sets using genomic best linear unbiased prediction (GBLUP) and BayesB model. In addition, we annotated the variants from GWAS with functional feature sets and performed enrichment analysis. Results We observed significant enrichments for 32 functional categories in 11 feature sets. The evolutionary related sets (conservation regions and selection signatures) contributed significantly to heritability (31.78-fold and 14.48-fold enrichment), while metabolomics and transcriptomics showed low heritability enrichments. We observed a significant increase in prediction accuracy using the top feature set variants compared to whole-genome sequencing (WGS) data. The prediction accuracy based on the top 10% variant set showed an average increase of 11.6% and 7.54% using BayesB and GBLUP across traits, respectively. Notably, the greatest increase of 31.52% was obtained for spleen weight (SW) using BayesB. Also, we found that the top 10% of variants show strong enrichment with weight related QTLs based on the Cattle QTL database. Conclusions Our findings suggest that integrating biological prior information from multiple layers can enhance our understanding of the genetic architecture underlying complex traits and further improve genomic prediction in beef cattle.https://doi.org/10.1186/s13062-024-00574-y
spellingShingle Zhida Zhao
Qunhao Niu
Jiayuan Wu
Tianyi Wu
Xueyuan Xie
Zezhao Wang
Lupei Zhang
Huijiang Gao
Xue Gao
Lingyang Xu
Bo Zhu
Junya Li
Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle
Biology Direct
title Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle
title_full Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle
title_fullStr Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle
title_full_unstemmed Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle
title_short Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle
title_sort integrating multi layered biological priors to improve genomic prediction accuracy in beef cattle
url https://doi.org/10.1186/s13062-024-00574-y
work_keys_str_mv AT zhidazhao integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT qunhaoniu integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT jiayuanwu integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT tianyiwu integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT xueyuanxie integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT zezhaowang integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT lupeizhang integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT huijianggao integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT xuegao integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT lingyangxu integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT bozhu integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle
AT junyali integratingmultilayeredbiologicalpriorstoimprovegenomicpredictionaccuracyinbeefcattle