TY - JOUR
T1 - Transmicron
T2 - Accurate prediction of insertion probabilities improves detection of cancer driver genes from transposon mutagenesis screens
AU - Bredthauer, Carl
AU - Fischer, Anja
AU - Ahari, Ata Jadid
AU - Cao, Xueqi
AU - Weber, Julia
AU - Rad, Lena
AU - Rad, Roland
AU - Wachutka, Leonhard
AU - Gagneur, Julien
N1 - Publisher Copyright:
© 2023 The Author(s). Published by Oxford University Press on behalf of Nucleic Acids Research.
PY - 2023/2/28
Y1 - 2023/2/28
N2 - Transposon screens are powerful in vivo assays used to identify loci driving carcinogenesis. These loci are identified as Common Insertion Sites (CISs), i.e. regions with more transposon insertions than expected by chance. However, the identification of CISs is affected by biases in the insertion behaviour of transposon systems. Here, we introduce Transmicron, a novel method that differs from previous methods by (i) modelling neutral insertion rates based on chromatin accessibility, transcriptional activity and sequence context and (ii) estimating oncogenic selection for each genomic region using Poisson regression to model insertion counts while controlling for neutral insertion rates. To assess the benefits of our approach, we generated a dataset applying two different transposon systems under comparable conditions. Benchmarking for enrichment of known cancer genes showed improved performance of Transmicron against state-of-The-Art methods. Modelling neutral insertion rates allowed for better control of false positives and stronger agreement of the results between transposon systems. Moreover, using Poisson regression to consider intra-sample and inter-sample information proved beneficial in small and moderately-sized datasets. Transmicron is open-source and freely available. Overall, this study contributes to the understanding of transposon biology and introduces a novel approach to use this knowledge for discovering cancer driver genes.
AB - Transposon screens are powerful in vivo assays used to identify loci driving carcinogenesis. These loci are identified as Common Insertion Sites (CISs), i.e. regions with more transposon insertions than expected by chance. However, the identification of CISs is affected by biases in the insertion behaviour of transposon systems. Here, we introduce Transmicron, a novel method that differs from previous methods by (i) modelling neutral insertion rates based on chromatin accessibility, transcriptional activity and sequence context and (ii) estimating oncogenic selection for each genomic region using Poisson regression to model insertion counts while controlling for neutral insertion rates. To assess the benefits of our approach, we generated a dataset applying two different transposon systems under comparable conditions. Benchmarking for enrichment of known cancer genes showed improved performance of Transmicron against state-of-The-Art methods. Modelling neutral insertion rates allowed for better control of false positives and stronger agreement of the results between transposon systems. Moreover, using Poisson regression to consider intra-sample and inter-sample information proved beneficial in small and moderately-sized datasets. Transmicron is open-source and freely available. Overall, this study contributes to the understanding of transposon biology and introduces a novel approach to use this knowledge for discovering cancer driver genes.
UR - http://www.scopus.com/inward/record.url?scp=85149187072&partnerID=8YFLogxK
U2 - 10.1093/nar/gkac1215
DO - 10.1093/nar/gkac1215
M3 - Article
C2 - 36617985
AN - SCOPUS:85149187072
SN - 0305-1048
VL - 51
SP - E21-E21
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 4
ER -