TY - JOUR
T1 - DeepWAS
T2 - Multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning
AU - Arloth, Janine
AU - Eraslan, Gökcen
AU - Andlauer, Till F.M.
AU - Martins, Jade
AU - Iurato, Stella
AU - Kühnel, Brigitte
AU - Waldenberger, Melanie
AU - Frank, Josef
AU - Gold, Ralf
AU - Hemmer, Bernhard
AU - Luessi, Felix
AU - Nischwitz, Sandra
AU - Paul, Friedemann
AU - Wiendl, Heinz
AU - Gieger, Christian
AU - Heilmann-Heimbach, Stefanie
AU - Kacprowski, Tim
AU - Laudes, Matthias
AU - Meitinger, Thomas
AU - Peters, Annette
AU - Rawal, Rajesh
AU - Strauch, Konstantin
AU - Lucae, Susanne
AU - Müller-Myhsok, Bertram
AU - Rietschel, Marcella
AU - Theis, Fabian J.
AU - Binder, Elisabeth B.
AU - Mueller, Nikola S.
N1 - Publisher Copyright:
© 2020 Arloth et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2020/2
Y1 - 2020/2
N2 - Genome-wide association studies (GWAS) identify genetic variants associated with traits or diseases. GWAS never directly link variants to regulatory mechanisms. Instead, the functional annotation of variants is typically inferred by post hoc analyses. A specific class of deep learning-based methods allows for the prediction of regulatory effects per variant on several cell type-specific chromatin features. We here describe “DeepWAS”, a new approach that integrates these regulatory effect predictions of single variants into a multivariate GWAS setting. Thereby, single variants associated with a trait or disease are directly coupled to their impact on a chromatin feature in a cell type. Up to 61 regulatory SNPs, called dSNPs, were associated with multiple sclerosis (MS, 4,888 cases and 10,395 controls), major depressive disorder (MDD, 1,475 cases and 2,144 controls), and height (5,974 individuals). These variants were mainly non-coding and reached at least nominal significance in classical GWAS. The prediction accuracy was higher for DeepWAS than for classical GWAS models for 91% of the genome-wide significant, MS-specific dSNPs. DSNPs were enriched in public or cohort-matched expression and methylation quantitative trait loci and we demonstrated the potential of DeepWAS to generate testable functional hypotheses based on genotype data alone. DeepWAS is available at https://github.com/ cellmapslab/DeepWAS.
AB - Genome-wide association studies (GWAS) identify genetic variants associated with traits or diseases. GWAS never directly link variants to regulatory mechanisms. Instead, the functional annotation of variants is typically inferred by post hoc analyses. A specific class of deep learning-based methods allows for the prediction of regulatory effects per variant on several cell type-specific chromatin features. We here describe “DeepWAS”, a new approach that integrates these regulatory effect predictions of single variants into a multivariate GWAS setting. Thereby, single variants associated with a trait or disease are directly coupled to their impact on a chromatin feature in a cell type. Up to 61 regulatory SNPs, called dSNPs, were associated with multiple sclerosis (MS, 4,888 cases and 10,395 controls), major depressive disorder (MDD, 1,475 cases and 2,144 controls), and height (5,974 individuals). These variants were mainly non-coding and reached at least nominal significance in classical GWAS. The prediction accuracy was higher for DeepWAS than for classical GWAS models for 91% of the genome-wide significant, MS-specific dSNPs. DSNPs were enriched in public or cohort-matched expression and methylation quantitative trait loci and we demonstrated the potential of DeepWAS to generate testable functional hypotheses based on genotype data alone. DeepWAS is available at https://github.com/ cellmapslab/DeepWAS.
UR - http://www.scopus.com/inward/record.url?scp=85079346794&partnerID=8YFLogxK
U2 - 10.1371/JOURNAL.PCBI.1007616
DO - 10.1371/JOURNAL.PCBI.1007616
M3 - Article
C2 - 32012148
AN - SCOPUS:85079346794
SN - 1553-734X
VL - 16
JO - PLoS Computational Biology
JF - PLoS Computational Biology
IS - 2
M1 - e1007616
ER -