TY - GEN
T1 - TarDis
T2 - 29th International Conference on Research in Computational Molecular Biology, RECOMB 2025
AU - Inecik, Kemal
AU - Kara, Aleyna
AU - Rose, Antony
AU - Haniffa, Muzlifah
AU - Theis, Fabian J.
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - Addressing challenges in domain invariance within single-cell genomics necessitates innovative strategies to manage the heterogeneity of multi-source datasets while maintaining the integrity of biological signals. We introduce TarDis, a novel deep generative model designed to disentangle intricate covariate structures across diverse biological datasets, distinguishing technical artifacts from true biological variations. By employing tailored covariate-specific loss components and a self-supervised approach, TarDis effectively generates multiple latent space representations that capture each continuous and categorical target covariate separately, along with unexplained variation. Our extensive evaluations demonstrate that TarDis outperforms existing methods in data integration, covariate disentanglement, and robust out-of-distribution predictions. The model’s capacity to produce interpretable and structured latent spaces, including its pioneering work in ordered latent representations for continuous covariates, markedly enhances its utility in hypothesis-driven research. Consequently, TarDis offers a promising analytical platform for advancing scientific discovery, providing insights into cellular dynamics, and enabling targeted therapeutic interventions (The full paper can be accessed at: https://doi.org/10.1101/2024.06.20.599903).
AB - Addressing challenges in domain invariance within single-cell genomics necessitates innovative strategies to manage the heterogeneity of multi-source datasets while maintaining the integrity of biological signals. We introduce TarDis, a novel deep generative model designed to disentangle intricate covariate structures across diverse biological datasets, distinguishing technical artifacts from true biological variations. By employing tailored covariate-specific loss components and a self-supervised approach, TarDis effectively generates multiple latent space representations that capture each continuous and categorical target covariate separately, along with unexplained variation. Our extensive evaluations demonstrate that TarDis outperforms existing methods in data integration, covariate disentanglement, and robust out-of-distribution predictions. The model’s capacity to produce interpretable and structured latent spaces, including its pioneering work in ordered latent representations for continuous covariates, markedly enhances its utility in hypothesis-driven research. Consequently, TarDis offers a promising analytical platform for advancing scientific discovery, providing insights into cellular dynamics, and enabling targeted therapeutic interventions (The full paper can be accessed at: https://doi.org/10.1101/2024.06.20.599903).
KW - disentanglement
KW - generative models
KW - representation learning
KW - self-supervised learning
KW - single-cell genomics
UR - http://www.scopus.com/inward/record.url?scp=105004254705&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-90252-9_23
DO - 10.1007/978-3-031-90252-9_23
M3 - Conference contribution
AN - SCOPUS:105004254705
SN - 9783031902512
T3 - Lecture Notes in Computer Science
SP - 285
EP - 289
BT - Research in Computational Molecular Biology - 29th International Conference, RECOMB 2025, Proceedings
A2 - Sankararaman, Sriram
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 26 April 2025 through 29 April 2025
ER -