TY - JOUR
T1 - RNA2DNAlign
T2 - Nucleotide resolution allele asymmetries through quantitative assessment of RNA and DNA paired sequencing data
AU - Movassagh, Mercedeh
AU - Alomran, Nawaf
AU - Mudvari, Prakriti
AU - Dede, Merve
AU - Dede, Cem
AU - Kowsari, Kamran
AU - Restrepo, Paula
AU - Cauley, Edmund
AU - Bahl, Sonali
AU - Li, Muzi
AU - Waterhouse, Wesley
AU - Tsaneva-Atanasova, Krasimira
AU - Edwards, Nathan
AU - Horvath, Anelia
N1 - Publisher Copyright:
© The Author(s) 2016.
PY - 2016/12
Y1 - 2016/12
N2 - We introduce RNA2DNAlign, a computational framework for quantitative assessment of allele counts across paired RNA and DNA sequencing datasets. RNA2DNAlign is based on quantitation of the relative abundance of variant and reference read counts, followed by binomial tests for genotype and allelic status at SNV positions between compatible sequences. RNA2DNAlign detects positions with differential allele distribution, suggesting asymmetries due to regulatory/structural events. Based on the type of asymmetry, RNA2DNAlign outlines positions likely to be implicated in RNA editing, allele-specific expression or loss, somatic mutagenesis or loss-ofheterozygosity (the first three also in a tumor-specific setting). We applied RNA2DNAlign on 360 matching normal and tumor exomes and transcriptomes from 90 breast cancer patients from TCGA. Under highconfidence settings, RNA2DNAlign identified 2038 distinct SNV sites associated with one of the aforementioned asymetries, the majority of which have not been linked to functionality before. The performance assessment shows very high specificity and sensitivity, due to the corroboration of signals across multiple matching datasets. RNA2DNAlign is freely available from http://github.com/HorvathLab/NGS as a self-contained binary package for 64-bit Linux systems.
AB - We introduce RNA2DNAlign, a computational framework for quantitative assessment of allele counts across paired RNA and DNA sequencing datasets. RNA2DNAlign is based on quantitation of the relative abundance of variant and reference read counts, followed by binomial tests for genotype and allelic status at SNV positions between compatible sequences. RNA2DNAlign detects positions with differential allele distribution, suggesting asymmetries due to regulatory/structural events. Based on the type of asymmetry, RNA2DNAlign outlines positions likely to be implicated in RNA editing, allele-specific expression or loss, somatic mutagenesis or loss-ofheterozygosity (the first three also in a tumor-specific setting). We applied RNA2DNAlign on 360 matching normal and tumor exomes and transcriptomes from 90 breast cancer patients from TCGA. Under highconfidence settings, RNA2DNAlign identified 2038 distinct SNV sites associated with one of the aforementioned asymetries, the majority of which have not been linked to functionality before. The performance assessment shows very high specificity and sensitivity, due to the corroboration of signals across multiple matching datasets. RNA2DNAlign is freely available from http://github.com/HorvathLab/NGS as a self-contained binary package for 64-bit Linux systems.
UR - http://www.scopus.com/inward/record.url?scp=85016136193&partnerID=8YFLogxK
U2 - 10.1093/nar/gkw757
DO - 10.1093/nar/gkw757
M3 - Article
C2 - 27576531
AN - SCOPUS:85016136193
SN - 0305-1048
VL - 44
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 22
M1 - e161
ER -