TY - JOUR
T1 - Whole genome sequencing and imputation in isolated populations identify genetic associations with medically-relevant complex traits
AU - Southam, Lorraine
AU - Gilly, Arthur
AU - Süveges, Daniel
AU - Farmaki, Aliki Eleni
AU - Schwartzentruber, Jeremy
AU - Tachmazidou, Ioanna
AU - Matchan, Angela
AU - Rayner, Nigel W.
AU - Tsafantakis, Emmanouil
AU - Karaleftheri, Maria
AU - Xue, Yali
AU - Dedoussis, George
AU - Zeggini, Eleftheria
N1 - Publisher Copyright:
© 2017 Japan Antibiotics Research Association All rights reserved.
PY - 2017/5/26
Y1 - 2017/5/26
N2 - Next-generation association studies can be empowered by sequence-based imputation and by studying founder populations. Here we report ∼9.5 million variants from whole-genome sequencing (WGS) of a Cretan-isolated population, and show enrichment of rare and low-frequency variants with predicted functional consequences. We use a WGS-based imputation approach utilizing 10,422 reference haplotypes to perform genome-wide association analyses and observe 17 genome-wide significant, independent signals, including replicating evidence for association at eight novel low-frequency variant signals. Two novel cardiometabolic associations are at lead variants unique to the founder population sequences: Chr16:70790626 (high-density lipoprotein levels beta -1.71 (SE 0.25), P=1.57 × 10 -11, effect allele frequency (EAF) 0.006); and rs145556679 (triglycerides levels beta -1.13 (SE 0.17), P=2.53 × 10 -11, EAF 0.013). Our findings add empirical support to the contribution of low-frequency variants in complex traits, demonstrate the advantage of including population-specific sequences in imputation panels and exemplify the power gains afforded by population isolates.
AB - Next-generation association studies can be empowered by sequence-based imputation and by studying founder populations. Here we report ∼9.5 million variants from whole-genome sequencing (WGS) of a Cretan-isolated population, and show enrichment of rare and low-frequency variants with predicted functional consequences. We use a WGS-based imputation approach utilizing 10,422 reference haplotypes to perform genome-wide association analyses and observe 17 genome-wide significant, independent signals, including replicating evidence for association at eight novel low-frequency variant signals. Two novel cardiometabolic associations are at lead variants unique to the founder population sequences: Chr16:70790626 (high-density lipoprotein levels beta -1.71 (SE 0.25), P=1.57 × 10 -11, effect allele frequency (EAF) 0.006); and rs145556679 (triglycerides levels beta -1.13 (SE 0.17), P=2.53 × 10 -11, EAF 0.013). Our findings add empirical support to the contribution of low-frequency variants in complex traits, demonstrate the advantage of including population-specific sequences in imputation panels and exemplify the power gains afforded by population isolates.
UR - http://www.scopus.com/inward/record.url?scp=85020050811&partnerID=8YFLogxK
U2 - 10.1038/ncomms15606
DO - 10.1038/ncomms15606
M3 - Article
C2 - 28548082
AN - SCOPUS:85020050811
SN - 2041-1723
VL - 8
JO - Nature Communications
JF - Nature Communications
M1 - 15606
ER -