Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traits

Abstract Background Milk production traits are complex and influenced by many genetic and environmental factors. Although extensive research has been performed for these traits, with many associations unveiled thus far, due to their crucial economic importance, complex genetic architecture, and the...

Full description

Saved in:
Bibliographic Details
Main Authors: Ana-Marija Križanac, Christian Reimer, Johannes Heise, Zengting Liu, Jennie E. Pryce, Jörn Bennewitz, Georg Thaller, Clemens Falker-Gieske, Jens Tetens
Format: Article
Language:deu
Published: BMC 2025-02-01
Series:Genetics Selection Evolution
Online Access:https://doi.org/10.1186/s12711-025-00951-9
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823863595775033344
author Ana-Marija Križanac
Christian Reimer
Johannes Heise
Zengting Liu
Jennie E. Pryce
Jörn Bennewitz
Georg Thaller
Clemens Falker-Gieske
Jens Tetens
author_facet Ana-Marija Križanac
Christian Reimer
Johannes Heise
Zengting Liu
Jennie E. Pryce
Jörn Bennewitz
Georg Thaller
Clemens Falker-Gieske
Jens Tetens
author_sort Ana-Marija Križanac
collection DOAJ
description Abstract Background Milk production traits are complex and influenced by many genetic and environmental factors. Although extensive research has been performed for these traits, with many associations unveiled thus far, due to their crucial economic importance, complex genetic architecture, and the fact that causal variants in cattle are still scarce, there is a need for a better understanding of their genetic background. In this study, we aimed to identify new candidate loci associated with milk production traits in German Holstein cattle, the most important dairy breed in Germany and worldwide. For that purpose, 180,217 cattle were imputed to the sequence level and large-scale genome-wide association study (GWAS) followed by fine-mapping and evolutionary and functional annotation were carried out to identify and prioritize new association signals. Results Using the imputed sequence data of a large cattle dataset, we identified 50,876 significant variants, confirming many known and identifying previously unreported candidate variants for milk (MY), fat (FY), and protein yield (PY). Genome-wide significant signals were fine-mapped with the Bayesian approach that determines the credible variant sets and generates the probability of causality for each signal. The variants with the highest probabilities of being causal were further classified using external information about the function and evolution, making the prioritization for subsequent validation experiments easier. The top potential causal variants determined with fine-mapping explained a large percentage of genetic variance compared to random ones; 178 variants explained 11.5%, 104 explained 7.7%, and 68 variants explained 3.9% of the variance for MY, FY, and PY, respectively, demonstrating the potential for causality. Conclusions Our findings proved the power of large samples and sequence-based GWAS in detecting new association signals. In order to fully exploit the power of GWAS, one should aim at very large samples combined with whole-genome sequence data. These can also come with both computational and time burdens, as presented in our study. Although milk production traits in cattle are comprehensively investigated, the genetic background of these traits is still not fully understood, with the potential for many new associations to be revealed, as shown. With constantly growing sample sizes, we expect more insights into the genetic architecture of milk production traits in the future.
format Article
id doaj-art-4d950d2b0ecf470f8214888608a8a6bb
institution Kabale University
issn 1297-9686
language deu
publishDate 2025-02-01
publisher BMC
record_format Article
series Genetics Selection Evolution
spelling doaj-art-4d950d2b0ecf470f8214888608a8a6bb2025-02-09T12:04:32ZdeuBMCGenetics Selection Evolution1297-96862025-02-0157112010.1186/s12711-025-00951-9Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traitsAna-Marija Križanac0Christian Reimer1Johannes Heise2Zengting Liu3Jennie E. Pryce4Jörn Bennewitz5Georg Thaller6Clemens Falker-Gieske7Jens Tetens8Department of Animal Sciences, University of GoettingenCenter for Integrated Breeding Research, Department of Animal Sciences, University of GoettingenVereinigte Informationssysteme Tierhaltung w.V. (VIT)Vereinigte Informationssysteme Tierhaltung w.V. (VIT)Agriculture Victoria Research, AgriBio, Centre for AgriBioscienceInstitute of Animal Science, University of HohenheimInstitute of Animal Breeding and Husbandry, Christian-Albrechts-UniversityDepartment of Animal Sciences, University of GoettingenDepartment of Animal Sciences, University of GoettingenAbstract Background Milk production traits are complex and influenced by many genetic and environmental factors. Although extensive research has been performed for these traits, with many associations unveiled thus far, due to their crucial economic importance, complex genetic architecture, and the fact that causal variants in cattle are still scarce, there is a need for a better understanding of their genetic background. In this study, we aimed to identify new candidate loci associated with milk production traits in German Holstein cattle, the most important dairy breed in Germany and worldwide. For that purpose, 180,217 cattle were imputed to the sequence level and large-scale genome-wide association study (GWAS) followed by fine-mapping and evolutionary and functional annotation were carried out to identify and prioritize new association signals. Results Using the imputed sequence data of a large cattle dataset, we identified 50,876 significant variants, confirming many known and identifying previously unreported candidate variants for milk (MY), fat (FY), and protein yield (PY). Genome-wide significant signals were fine-mapped with the Bayesian approach that determines the credible variant sets and generates the probability of causality for each signal. The variants with the highest probabilities of being causal were further classified using external information about the function and evolution, making the prioritization for subsequent validation experiments easier. The top potential causal variants determined with fine-mapping explained a large percentage of genetic variance compared to random ones; 178 variants explained 11.5%, 104 explained 7.7%, and 68 variants explained 3.9% of the variance for MY, FY, and PY, respectively, demonstrating the potential for causality. Conclusions Our findings proved the power of large samples and sequence-based GWAS in detecting new association signals. In order to fully exploit the power of GWAS, one should aim at very large samples combined with whole-genome sequence data. These can also come with both computational and time burdens, as presented in our study. Although milk production traits in cattle are comprehensively investigated, the genetic background of these traits is still not fully understood, with the potential for many new associations to be revealed, as shown. With constantly growing sample sizes, we expect more insights into the genetic architecture of milk production traits in the future.https://doi.org/10.1186/s12711-025-00951-9
spellingShingle Ana-Marija Križanac
Christian Reimer
Johannes Heise
Zengting Liu
Jennie E. Pryce
Jörn Bennewitz
Georg Thaller
Clemens Falker-Gieske
Jens Tetens
Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traits
Genetics Selection Evolution
title Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traits
title_full Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traits
title_fullStr Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traits
title_full_unstemmed Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traits
title_short Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traits
title_sort sequence based gwas in 180 000 german holstein cattle reveals new candidate variants for milk production traits
url https://doi.org/10.1186/s12711-025-00951-9
work_keys_str_mv AT anamarijakrizanac sequencebasedgwasin180000germanholsteincattlerevealsnewcandidatevariantsformilkproductiontraits
AT christianreimer sequencebasedgwasin180000germanholsteincattlerevealsnewcandidatevariantsformilkproductiontraits
AT johannesheise sequencebasedgwasin180000germanholsteincattlerevealsnewcandidatevariantsformilkproductiontraits
AT zengtingliu sequencebasedgwasin180000germanholsteincattlerevealsnewcandidatevariantsformilkproductiontraits
AT jennieepryce sequencebasedgwasin180000germanholsteincattlerevealsnewcandidatevariantsformilkproductiontraits
AT jornbennewitz sequencebasedgwasin180000germanholsteincattlerevealsnewcandidatevariantsformilkproductiontraits
AT georgthaller sequencebasedgwasin180000germanholsteincattlerevealsnewcandidatevariantsformilkproductiontraits
AT clemensfalkergieske sequencebasedgwasin180000germanholsteincattlerevealsnewcandidatevariantsformilkproductiontraits
AT jenstetens sequencebasedgwasin180000germanholsteincattlerevealsnewcandidatevariantsformilkproductiontraits