Characterizing cell subsets using marker enrichment modeling - PubMed (original) (raw)
Characterizing cell subsets using marker enrichment modeling
Kirsten E Diggins et al. Nat Methods. 2017 Mar.
Abstract
Learning cell identity from high-content single-cell data presently relies on human experts. We present marker enrichment modeling (MEM), an algorithm that objectively describes cells by quantifying contextual feature enrichment and reporting a human- and machine-readable text label. MEM outperforms traditional metrics in describing immune and cancer cell subsets from fluorescence and mass cytometry. MEM provides a quantitative language to communicate characteristics of new and established cytotypes observed in complex tissues.
Figures
Figure 1. Marker enrichment modeling (MEM) automatically labels human blood cell populations in Dataset A
a) Cells from normal human blood were previously grouped into 7 canonical populations using viSNE analysis and expert review of 25D mass cytometry data. b) MEM labels were computationally generated for each canonical cell subset using the other six populations as reference. The population labeled by immunologists as “CD4+ T cells” was labeled by MEM as ▲CD4+6 CD3+5 ▼CD8a−4 CD16−3 and comprised 48.72% of PBMC in this sample. In contrast, the MEM label ▲CD16+9 CD56+2 CD11c+2▼CD4−7 CD3−4 CD44−3 was generated for the population gated as “NK cells”. Heatmaps show protein enrichment values used to generate MEM labels and the median protein expression values for each protein on each cell subset. Variability in protein expression across the 7 canonical cell populations is shown below to highlight proteins that were expressed homogeneously (low variability, e.g. CD45) and those that were expressed heterogeneously (high variability, e.g. CD8a, CD4). c) Graphs show decreasing f-measure (clustering accuracy) as markers were excluded from k-means cluster analysis based on high to low absolute MEM or median values, compared to random exclusion.
Figure 2. Hierarchical clustering based solely on MEM label groups T cells and B cells measured in diverse studies using different cytometry platforms
A) MEM label values were compared for each of 80 populations (CD4+ T cells and B cells) from 3 human tissues representing 6 mass cytometry studies and 1 fluorescence flow cytometry study. The normalized RMSD (i.e. similarity) for two populations was 100% when MEM label exponents were identical for all of the shared proteins. Populations are shown clustered according to MEM label percent similarity. Tissue type, source study (numbered 1-7 and referenced in online methods), and individual sample IDs are indicated to the right. *indicates samples stimulated by bacterial superantigen Staphlococcus enterotoxin B (SEB). B) Representative MEM labels for CD4+ T cells (top) and B cells (bottom) from SEB-stimulated normal human blood (1.4, top, mass cytometry), normal human bone marrow (5, mass cytometry), normal human tonsil (2.5, mass cytometry), SEB-stimulated normal human blood (1.4, bottom, fluorescence flow cytometry), and normal human blood (6.1, mass cytometry).
Figure 3. MEM correctly grouped immune and cancer cell populations from glioma tumors using nine proteins expressed on cancer cells in Dataset D
(A) A heatmap of MEM enrichment scores is shown for 52 populations of cells identified in tumors from 4 glioblastoma patients (G-08, G-10, G-11, G22) in an unsupervised manner using viSNE. MEM scores were then calculated based only on the nine measured proteins expected to be expressed on cancer cells (S100B, TJF1, GFAP, Nestin, MET, PGFRα, HLA-DR, and CD44). (B) Each population was annotated for a cell type based on review of the MEM label and classified as tumor infiltrating APCs (blue), tumor infiltrating T cells (green), or non-immune tumor cells (red). (C) A heatmap of median intensity values is shown for the 13 measured proteins from each of the 52 tumor cell populations. Expression of CD45, CD3, and CD64 was used to assess the respective identity of leukocytes, T cells, and antigen presenting cells.
Similar articles
- Generating Quantitative Cell Identity Labels with Marker Enrichment Modeling (MEM).
Diggins KE, Gandelman JS, Roe CE, Irish JM. Diggins KE, et al. Curr Protoc Cytom. 2018 Jan 18;83:10.21.1-10.21.28. doi: 10.1002/cpcy.34. Curr Protoc Cytom. 2018. PMID: 29345329 Free PMC article. - Supervised Machine Learning with CITRUS for Single Cell Biomarker Discovery.
Polikowsky HG, Drake KA. Polikowsky HG, et al. Methods Mol Biol. 2019;1989:309-332. doi: 10.1007/978-1-4939-9454-0_20. Methods Mol Biol. 2019. PMID: 31077114 Free PMC article. - Development, application and computational analysis of high-dimensional fluorescent antibody panels for single-cell flow cytometry.
Brummelman J, Haftmann C, Núñez NG, Alvisi G, Mazza EMC, Becher B, Lugli E. Brummelman J, et al. Nat Protoc. 2019 Jul;14(7):1946-1969. doi: 10.1038/s41596-019-0166-2. Epub 2019 Jun 3. Nat Protoc. 2019. PMID: 31160786 - Deep Profiling Human T Cell Heterogeneity by Mass Cytometry.
Cheng Y, Newell EW. Cheng Y, et al. Adv Immunol. 2016;131:101-34. doi: 10.1016/bs.ai.2016.02.002. Epub 2016 Apr 8. Adv Immunol. 2016. PMID: 27235682 Review. - Data-Driven Flow Cytometry Analysis.
Wang S, Brinkman RR. Wang S, et al. Methods Mol Biol. 2019;1989:245-265. doi: 10.1007/978-1-4939-9454-0_16. Methods Mol Biol. 2019. PMID: 31077110 Free PMC article. Review.
Cited by
- Immune profiling of patients with extranodal natural killer/T cell lymphoma treated with daratumumab.
Qing M, Zhou T, Perova T, Abraham Y, Sweeney C, Krevvata M, Zhang X, Qi M, Gao G, Kim TM, Yao M, Cho SG, Eom HS, Lim ST, Yeh SP, Kwong YL, Yoon DH, Kim JS, Kim WS, Zhou L, Attar R, Verona RI. Qing M, et al. Ann Hematol. 2024 Jun;103(6):1989-2001. doi: 10.1007/s00277-023-05603-w. Epub 2024 Jan 18. Ann Hematol. 2024. PMID: 38233570 Free PMC article. Clinical Trial. - HBV infection-induced liver cirrhosis development in dual-humanised mice with human bone mesenchymal stem cell transplantation.
Yuan L, Jiang J, Liu X, Zhang Y, Zhang L, Xin J, Wu K, Li X, Cao J, Guo X, Shi D, Li J, Jiang L, Sun S, Wang T, Hou W, Zhang T, Zhu H, Zhang J, Yuan Q, Cheng T, Li J, Xia N. Yuan L, et al. Gut. 2019 Nov;68(11):2044-2056. doi: 10.1136/gutjnl-2018-316091. Epub 2019 Jan 30. Gut. 2019. PMID: 30700543 Free PMC article. - Generating Quantitative Cell Identity Labels with Marker Enrichment Modeling (MEM).
Diggins KE, Gandelman JS, Roe CE, Irish JM. Diggins KE, et al. Curr Protoc Cytom. 2018 Jan 18;83:10.21.1-10.21.28. doi: 10.1002/cpcy.34. Curr Protoc Cytom. 2018. PMID: 29345329 Free PMC article. - CD28 costimulation drives tumor-infiltrating T cell glycolysis to promote inflammation.
Beckermann KE, Hongo R, Ye X, Young K, Carbonell K, Healey DCC, Siska PJ, Barone S, Roe CE, Smith CC, Vincent BG, Mason FM, Irish JM, Rathmell WK, Rathmell JC. Beckermann KE, et al. JCI Insight. 2020 Aug 20;5(16):e138729. doi: 10.1172/jci.insight.138729. JCI Insight. 2020. PMID: 32814710 Free PMC article. - CytoPheno: Automated descriptive cell type naming in flow and mass cytometry.
Tursi AR, Lages CS, Quayle K, Koenig ZT, Loni R, Eswar S, Cobeña-Reyes J, Thornton S, Tilburgs T, Andorf S. Tursi AR, et al. bioRxiv [Preprint]. 2025 Mar 14:2025.03.11.639902. doi: 10.1101/2025.03.11.639902. bioRxiv. 2025. PMID: 40161808 Free PMC article. Preprint.
References
- Saeys Y, Gassen SV, Lambrecht BN. Computational flow cytometry: helping to make sense of high-dimensional immunology data. Nature reviews. Immunology. 2016;16:449–462. doi:10.1038/nri.2016.56. - PubMed
- Becher B, et al. High-dimensional analysis of the murine myeloid cell system. Nature immunology. 2014;15:1181–1189. doi:10.1038/ni.3006. - PubMed
Methods-only References
- Civin CI, et al. Antigenic analysis of hematopoiesis. III. A hematopoietic progenitor cell surface antigen defined by a monoclonal antibody raised against KG-1a cells. Journal of immunology. 1984;133:157–165. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- F31 CA199993/CA/NCI NIH HHS/United States
- R25 CA136440/CA/NCI NIH HHS/United States
- P30 CA068485/CA/NCI NIH HHS/United States
- U01 CA196405/CA/NCI NIH HHS/United States
- R00 CA143231/CA/NCI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical