TY - JOUR
T1 - Multimodal single cell data integration challenge
T2 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021
AU - NeurIPS 2021 Multimodal data integration competition participants
AU - Lance, Christopher
AU - Luecken, Malte D.
AU - Burkhardt, Daniel B.
AU - Cannoodt, Robrecht
AU - Rautenstrauch, Pia
AU - Laddach, Anna
AU - Ubingazhibov, Aidyn
AU - Cao, Zhi Jie
AU - Deng, Kaiwen
AU - Khan, Sumeer
AU - Liu, Qiao
AU - Russkikh, Nikolay
AU - Ryazantsev, Gleb
AU - Ohler, Uwe
AU - Pisco, Angela Oliveira
AU - Bloom, Jonathan
AU - Krishnaswamy, Smita
AU - Theis, Fabian J.
N1 - Publisher Copyright:
© 2022 C. Lance et al.
PY - 2022
Y1 - 2022
N2 - Biology has become a data-intensive science. Recent technological advances in single-cell genomics have enabled the measurement of multiple facets of cellular state, producing datasets with millions of single-cell observations. While these data hold great promise for understanding molecular mechanisms in health and disease, analysis challenges arising from sparsity, technical and biological variability, and high dimensionality of the data hinder the derivation of such mechanistic insights. To promote the innovation of algorithms for analysis of multimodal single-cell data, we organized a competition at NeurIPS 2021 applying the Common Task Framework to multimodal single-cell data integration. For this competition we generated the first multimodal benchmarking dataset for single-cell biology and defined three tasks in this domain: prediction of missing modalities, aligning modalities, and learning a joint representation across modalities. We further specified evaluation metrics and developed a cloud-based algorithm evaluation pipeline. Using this setup, 280 competitors submitted over 2600 proposed solutions within a 3 month period, showcasing substantial innovation especially in the modality alignment task. Here, we present the results, describe trends of well performing approaches, and discuss challenges associated with running the competition.
AB - Biology has become a data-intensive science. Recent technological advances in single-cell genomics have enabled the measurement of multiple facets of cellular state, producing datasets with millions of single-cell observations. While these data hold great promise for understanding molecular mechanisms in health and disease, analysis challenges arising from sparsity, technical and biological variability, and high dimensionality of the data hinder the derivation of such mechanistic insights. To promote the innovation of algorithms for analysis of multimodal single-cell data, we organized a competition at NeurIPS 2021 applying the Common Task Framework to multimodal single-cell data integration. For this competition we generated the first multimodal benchmarking dataset for single-cell biology and defined three tasks in this domain: prediction of missing modalities, aligning modalities, and learning a joint representation across modalities. We further specified evaluation metrics and developed a cloud-based algorithm evaluation pipeline. Using this setup, 280 competitors submitted over 2600 proposed solutions within a 3 month period, showcasing substantial innovation especially in the modality alignment task. Here, we present the results, describe trends of well performing approaches, and discuss challenges associated with running the competition.
KW - benchmarking datasets
KW - big data integration
KW - computational biology
KW - multimodal
KW - multiomics
KW - single-cell genomics
UR - http://www.scopus.com/inward/record.url?scp=85152379730&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85152379730
SN - 2640-3498
VL - 176
SP - 162
EP - 176
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
Y2 - 6 December 2021 through 14 December 2021
ER -