Characterizing cell subsets using marker enrichment modeling - PubMed (original) (raw)
Characterizing cell subsets using marker enrichment modeling
Kirsten E Diggins et al. Nat Methods. 2017 Mar.
Abstract
Learning cell identity from high-content single-cell data presently relies on human experts. We present marker enrichment modeling (MEM), an algorithm that objectively describes cells by quantifying contextual feature enrichment and reporting a human- and machine-readable text label. MEM outperforms traditional metrics in describing immune and cancer cell subsets from fluorescence and mass cytometry. MEM provides a quantitative language to communicate characteristics of new and established cytotypes observed in complex tissues.
Figures
Figure 1. Marker enrichment modeling (MEM) automatically labels human blood cell populations in Dataset A
a) Cells from normal human blood were previously grouped into 7 canonical populations using viSNE analysis and expert review of 25D mass cytometry data. b) MEM labels were computationally generated for each canonical cell subset using the other six populations as reference. The population labeled by immunologists as “CD4+ T cells” was labeled by MEM as ▲CD4+6 CD3+5 ▼CD8a−4 CD16−3 and comprised 48.72% of PBMC in this sample. In contrast, the MEM label ▲CD16+9 CD56+2 CD11c+2▼CD4−7 CD3−4 CD44−3 was generated for the population gated as “NK cells”. Heatmaps show protein enrichment values used to generate MEM labels and the median protein expression values for each protein on each cell subset. Variability in protein expression across the 7 canonical cell populations is shown below to highlight proteins that were expressed homogeneously (low variability, e.g. CD45) and those that were expressed heterogeneously (high variability, e.g. CD8a, CD4). c) Graphs show decreasing f-measure (clustering accuracy) as markers were excluded from k-means cluster analysis based on high to low absolute MEM or median values, compared to random exclusion.
Figure 2. Hierarchical clustering based solely on MEM label groups T cells and B cells measured in diverse studies using different cytometry platforms
A) MEM label values were compared for each of 80 populations (CD4+ T cells and B cells) from 3 human tissues representing 6 mass cytometry studies and 1 fluorescence flow cytometry study. The normalized RMSD (i.e. similarity) for two populations was 100% when MEM label exponents were identical for all of the shared proteins. Populations are shown clustered according to MEM label percent similarity. Tissue type, source study (numbered 1-7 and referenced in online methods), and individual sample IDs are indicated to the right. *indicates samples stimulated by bacterial superantigen Staphlococcus enterotoxin B (SEB). B) Representative MEM labels for CD4+ T cells (top) and B cells (bottom) from SEB-stimulated normal human blood (1.4, top, mass cytometry), normal human bone marrow (5, mass cytometry), normal human tonsil (2.5, mass cytometry), SEB-stimulated normal human blood (1.4, bottom, fluorescence flow cytometry), and normal human blood (6.1, mass cytometry).
Figure 3. MEM correctly grouped immune and cancer cell populations from glioma tumors using nine proteins expressed on cancer cells in Dataset D
(A) A heatmap of MEM enrichment scores is shown for 52 populations of cells identified in tumors from 4 glioblastoma patients (G-08, G-10, G-11, G22) in an unsupervised manner using viSNE. MEM scores were then calculated based only on the nine measured proteins expected to be expressed on cancer cells (S100B, TJF1, GFAP, Nestin, MET, PGFRα, HLA-DR, and CD44). (B) Each population was annotated for a cell type based on review of the MEM label and classified as tumor infiltrating APCs (blue), tumor infiltrating T cells (green), or non-immune tumor cells (red). (C) A heatmap of median intensity values is shown for the 13 measured proteins from each of the 52 tumor cell populations. Expression of CD45, CD3, and CD64 was used to assess the respective identity of leukocytes, T cells, and antigen presenting cells.
Similar articles
- Generating Quantitative Cell Identity Labels with Marker Enrichment Modeling (MEM).
Diggins KE, Gandelman JS, Roe CE, Irish JM. Diggins KE, et al. Curr Protoc Cytom. 2018 Jan 18;83:10.21.1-10.21.28. doi: 10.1002/cpcy.34. Curr Protoc Cytom. 2018. PMID: 29345329 Free PMC article. - Supervised Machine Learning with CITRUS for Single Cell Biomarker Discovery.
Polikowsky HG, Drake KA. Polikowsky HG, et al. Methods Mol Biol. 2019;1989:309-332. doi: 10.1007/978-1-4939-9454-0_20. Methods Mol Biol. 2019. PMID: 31077114 Free PMC article. - Development, application and computational analysis of high-dimensional fluorescent antibody panels for single-cell flow cytometry.
Brummelman J, Haftmann C, Núñez NG, Alvisi G, Mazza EMC, Becher B, Lugli E. Brummelman J, et al. Nat Protoc. 2019 Jul;14(7):1946-1969. doi: 10.1038/s41596-019-0166-2. Epub 2019 Jun 3. Nat Protoc. 2019. PMID: 31160786 - Deep Profiling Human T Cell Heterogeneity by Mass Cytometry.
Cheng Y, Newell EW. Cheng Y, et al. Adv Immunol. 2016;131:101-34. doi: 10.1016/bs.ai.2016.02.002. Epub 2016 Apr 8. Adv Immunol. 2016. PMID: 27235682 Review. - Data-Driven Flow Cytometry Analysis.
Wang S, Brinkman RR. Wang S, et al. Methods Mol Biol. 2019;1989:245-265. doi: 10.1007/978-1-4939-9454-0_16. Methods Mol Biol. 2019. PMID: 31077110 Free PMC article. Review.
Cited by
- High-Dimensional Analysis Reveals Distinct Endotypes in Patients With Idiopathic Inflammatory Myopathies.
Wilfong EM, Bartkowiak T, Vowell KN, Westlake CS, Irish JM, Kendall PL, Crofford LJ, Bonami RH. Wilfong EM, et al. Front Immunol. 2022 Feb 21;13:756018. doi: 10.3389/fimmu.2022.756018. eCollection 2022. Front Immunol. 2022. PMID: 35371068 Free PMC article. - diffcyt: Differential discovery in high-dimensional cytometry via high-resolution clustering.
Weber LM, Nowicka M, Soneson C, Robinson MD. Weber LM, et al. Commun Biol. 2019 May 14;2:183. doi: 10.1038/s42003-019-0415-5. eCollection 2019. Commun Biol. 2019. PMID: 31098416 Free PMC article. - Multiparametric Single-Vesicle Flow Cytometry Resolves Extracellular Vesicle Heterogeneity and Reveals Selective Regulation of Biogenesis and Cargo Distribution.
von Lersner AK, Fernandes F, Ozawa PMM, Jackson M, Masureel M, Ho H, Lima SM, Vagner T, Sung BH, Wehbe M, Franze K, Pua H, Wilson JT, Irish JM, Weaver AM, Di Vizio D, Zijlstra A. von Lersner AK, et al. ACS Nano. 2024 Apr 16;18(15):10464-10484. doi: 10.1021/acsnano.3c11561. Epub 2024 Apr 5. ACS Nano. 2024. PMID: 38578701 Free PMC article. - Preparing Viable Single Cells from Human Tissue and Tumors for Cytomic Analysis.
Leelatian N, Doxie DB, Greenplate AR, Sinnaeve J, Ihrie RA, Irish JM. Leelatian N, et al. Curr Protoc Mol Biol. 2017 Apr 3;118:25C.1.1-25C.1.23. doi: 10.1002/cpmb.37. Curr Protoc Mol Biol. 2017. PMID: 28369679 Free PMC article. - The SWI/SNF chromatin-remodeling subunit DPF2 facilitates NRF2-dependent antiinflammatory and antioxidant gene expression.
Mas G, Man N, Nakata Y, Martinez-Caja C, Karl D, Beckedorff F, Tamiro F, Chen C, Duffort S, Itonaga H, Mookhtiar AK, Kunkalla K, Valencia AM, Collings CK, Kadoch C, Vega F, Kogan SC, Shiekhattar R, Morey L, Bilbao D, Nimer SD. Mas G, et al. J Clin Invest. 2023 Jul 3;133(13):e158419. doi: 10.1172/JCI158419. J Clin Invest. 2023. PMID: 37200093 Free PMC article.
References
- Saeys Y, Gassen SV, Lambrecht BN. Computational flow cytometry: helping to make sense of high-dimensional immunology data. Nature reviews. Immunology. 2016;16:449–462. doi:10.1038/nri.2016.56. - PubMed
- Becher B, et al. High-dimensional analysis of the murine myeloid cell system. Nature immunology. 2014;15:1181–1189. doi:10.1038/ni.3006. - PubMed
Methods-only References
- Civin CI, et al. Antigenic analysis of hematopoiesis. III. A hematopoietic progenitor cell surface antigen defined by a monoclonal antibody raised against KG-1a cells. Journal of immunology. 1984;133:157–165. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- F31 CA199993/CA/NCI NIH HHS/United States
- R25 CA136440/CA/NCI NIH HHS/United States
- P30 CA068485/CA/NCI NIH HHS/United States
- U01 CA196405/CA/NCI NIH HHS/United States
- R00 CA143231/CA/NCI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical