TY - JOUR
T1 - CIARA
T2 - a cluster-independent algorithm for identifying markers of rare cell types from single-cell sequencing data
AU - Lubatti, Gabriele
AU - Stock, Marco
AU - Iturbide, Ane
AU - Ruiz Tejada Segura, Mayra L.
AU - Riepl, Melina
AU - Tyser, Richard C.V.
AU - Danese, Anna
AU - Colomé-Tatché, Maria
AU - Theis, Fabian J.
AU - Srinivas, Shankar
AU - Torres-Padilla, Maria Elena
AU - Scialdone, Antonio
N1 - Publisher Copyright:
© 2023. Published by The Company of Biologists Ltd.
PY - 2023/6
Y1 - 2023/6
N2 - A powerful feature of single-cell genomics is the possibility of identifying cell types from their molecular profiles. In particular, identifying novel rare cell types and their marker genes is a key potential of single-cell RNA sequencing. Standard clustering approaches perform well in identifying relatively abundant cell types, but tend to miss rarer cell types. Here, we have developed CIARA (Cluster Independent Algorithm for the identification of markers of RAre cell types), a cluster-independent computational tool designed to select genes that are likely to be markers of rare cell types. Genes selected by CIARA are subsequently integrated with common clustering algorithms to single out groups of rare cell types. CIARA outperforms existing methods for rare cell type detection, and we use it to find previously uncharacterized rare populations of cells in a human gastrula and among mouse embryonic stem cells treated with retinoic acid. Moreover, CIARA can be applied more generally to any type of single-cell omic data, thus allowing the identification of rare cells across multiple data modalities. We provide implementations of CIARA in user-friendly packages available in R and Python.
AB - A powerful feature of single-cell genomics is the possibility of identifying cell types from their molecular profiles. In particular, identifying novel rare cell types and their marker genes is a key potential of single-cell RNA sequencing. Standard clustering approaches perform well in identifying relatively abundant cell types, but tend to miss rarer cell types. Here, we have developed CIARA (Cluster Independent Algorithm for the identification of markers of RAre cell types), a cluster-independent computational tool designed to select genes that are likely to be markers of rare cell types. Genes selected by CIARA are subsequently integrated with common clustering algorithms to single out groups of rare cell types. CIARA outperforms existing methods for rare cell type detection, and we use it to find previously uncharacterized rare populations of cells in a human gastrula and among mouse embryonic stem cells treated with retinoic acid. Moreover, CIARA can be applied more generally to any type of single-cell omic data, thus allowing the identification of rare cells across multiple data modalities. We provide implementations of CIARA in user-friendly packages available in R and Python.
KW - Computational method
KW - Rare cell types
KW - Single-cell sequencing
UR - http://www.scopus.com/inward/record.url?scp=85163907872&partnerID=8YFLogxK
U2 - 10.1242/dev.201264
DO - 10.1242/dev.201264
M3 - Article
C2 - 37294170
AN - SCOPUS:85163907872
SN - 0950-1991
VL - 150
JO - Development (Cambridge)
JF - Development (Cambridge)
IS - 11
M1 - dev201264
ER -