J Immunol. 2025 Aug 21:vkaf183. doi: 10.1093/jimmun/vkaf183. Online ahead of print.
ABSTRACT
Accurate identification of immune cell subsets in single-cell RNA sequencing (scRNA-seq) data is critical for understanding immune responses in autoimmune diseases, infections, and cancer. One caveat of scRNA-seq is the inability to properly assign rare immune cell subsets due to gene dropout events. To circumvent this caveat, we here developed optimized detection and inference of names in scRNA-seq data (scODIN). scODIN uses an informed holistic 2-step approach combining expert knowledge with machine learning to rapidly assign cell type identities to large scRNA-seq datasets. First, scODIN uses key lineage-defining markers to identify a set of core cell types. Second, scODIN compensates for dropout events by integrating a k-nearest neighbors algorithm. We additionally programmed scODIN to detect dual and transitional phenotypes, which are usually overlooked in conventional analyses. Consequently, scODIN may enhance our understanding of immune cell heterogeneity and provide comprehensive insights into immune regulation, with broad implications for immunology and personalized medicine.
PMID:40839860 | DOI:10.1093/jimmun/vkaf183