TY - JOUR
T1 - Assessment of the genomic variation in a cattle population by re-sequencing of key animals at low to medium coverage
AU - Jansen, Sandra
AU - Aigner, Bernhard
AU - Pausch, Hubert
AU - Wysocki, Michal
AU - Eck, Sebastian
AU - Benet-Pagès, Anna
AU - Graf, Elisabeth
AU - Wieland, Thomas
AU - Strom, Tim M.
AU - Meitinger, Thomas
AU - Fries, Ruedi
N1 - Funding Information:
We thank the Arbeitsgemeinschaft Süddeutscher Rinderzüchter und Besamungsorganisation e.V (ASR) and the Förderverein Biotechnologieforschung e.V. (FBF) for sharing genotype data. Swissgenetics is acknowledged for providing DNA material of a swiss Simmental bull and Dr. Annette Pfitzner (Clinic for Ruminants, Faculty of Veterinary Science, University of Munich) for providing DNA material of six thrombopathic animals. The study was supported by the German Federal Ministry of Education and Research (BMBF) within the AgroClustEr “Synbreed – Synergistic plant and animal breeding” (FKZ: 0315528A).
PY - 2013/7/4
Y1 - 2013/7/4
N2 - Background: Genome- and population-wide re-sequencing would allow for most efficient detection of causal trait variants. However, despite a strong decrease of costs for next-generation sequencing in the last few years, re-sequencing of large numbers of individuals is not yet affordable. We therefore resorted to re-sequencing of a limited number of bovine animals selected to explain a major proportion of the population's genomic variation, so called key animals, in order to provide a catalogue of functional variants and a substrate for population- and genome-wide imputation of variable sites.Results: Forty-three animals accounting for about 69 percent of the genetic diversity of the Fleckvieh population, a cattle breed of Southern Germany and Austria, were sequenced with coverages ranging from 4.17 to 24.98 and averaging 7.46. After alignment to the reference genome (UMD3.1) and multi-sample variant calling, more than 17 million variant positions were identified, about 90 percent biallelic single nucleotide variants (SNVs) and 10 percent short insertions and deletions (InDels). The comparison with high-density chip data revealed a sensitivity of at least 92 percent and a specificity of 81 percent for sequencing based genotyping, and 97 percent and 93 percent when a imputation step was included. There are 91,733 variants in coding regions of 18,444 genes, 46 percent being non-synonymous exchanges, of which 575 variants are predicted to cause premature stop codons. Three variants are listed in the OMIA database as causal for specific phenotypes.Conclusions: Low- to medium-coverage re-sequencing of individuals explaining a major fraction of a population's genomic variation allows for the efficient and reliable detection of most variants. Imputation strongly improves genotype quality of lowly covered samples and thus enables maximum density genotyping by sequencing. The functional annotation of variants provides the basis for exhaustive genotype imputation in the population, e.g., for highest-resolution genome-wide association studies.
AB - Background: Genome- and population-wide re-sequencing would allow for most efficient detection of causal trait variants. However, despite a strong decrease of costs for next-generation sequencing in the last few years, re-sequencing of large numbers of individuals is not yet affordable. We therefore resorted to re-sequencing of a limited number of bovine animals selected to explain a major proportion of the population's genomic variation, so called key animals, in order to provide a catalogue of functional variants and a substrate for population- and genome-wide imputation of variable sites.Results: Forty-three animals accounting for about 69 percent of the genetic diversity of the Fleckvieh population, a cattle breed of Southern Germany and Austria, were sequenced with coverages ranging from 4.17 to 24.98 and averaging 7.46. After alignment to the reference genome (UMD3.1) and multi-sample variant calling, more than 17 million variant positions were identified, about 90 percent biallelic single nucleotide variants (SNVs) and 10 percent short insertions and deletions (InDels). The comparison with high-density chip data revealed a sensitivity of at least 92 percent and a specificity of 81 percent for sequencing based genotyping, and 97 percent and 93 percent when a imputation step was included. There are 91,733 variants in coding regions of 18,444 genes, 46 percent being non-synonymous exchanges, of which 575 variants are predicted to cause premature stop codons. Three variants are listed in the OMIA database as causal for specific phenotypes.Conclusions: Low- to medium-coverage re-sequencing of individuals explaining a major fraction of a population's genomic variation allows for the efficient and reliable detection of most variants. Imputation strongly improves genotype quality of lowly covered samples and thus enables maximum density genotyping by sequencing. The functional annotation of variants provides the basis for exhaustive genotype imputation in the population, e.g., for highest-resolution genome-wide association studies.
KW - Genotyping by sequencing
KW - Low-coverage
KW - Next-generation sequencing
KW - Variant annotation
UR - http://www.scopus.com/inward/record.url?scp=84879815835&partnerID=8YFLogxK
U2 - 10.1186/1471-2164-14-446
DO - 10.1186/1471-2164-14-446
M3 - Article
C2 - 23826801
AN - SCOPUS:84879815835
SN - 1471-2164
VL - 14
JO - BMC Genomics
JF - BMC Genomics
IS - 1
M1 - 446
ER -