Comprehensive assessment of T-cell receptor beta-chain diversity in alphabeta T cells - PubMed (original) (raw)

Comprehensive assessment of T-cell receptor beta-chain diversity in alphabeta T cells

Harlan S Robins et al. Blood. 2009.

Abstract

The adaptive immune system uses several strategies to generate a repertoire of T- and B-cell antigen receptors with sufficient diversity to recognize the universe of potential pathogens. In alphabeta T cells, which primarily recognize peptide antigens presented by major histocompatibility complex molecules, most of this receptor diversity is contained within the third complementarity-determining region (CDR3) of the T-cell receptor (TCR) alpha and beta chains. Although it has been estimated that the adaptive immune system can generate up to 10(16) distinct alphabeta pairs, direct assessment of TCR CDR3 diversity has not proved amenable to standard capillary electrophoresis-based DNA sequencing. We developed a novel experimental and computational approach to measure TCR CDR3 diversity based on single-molecule DNA sequencing, and used this approach to determine the CDR3 sequence in millions of rearranged TCRbeta genes from T cells of 2 adults. We find that total TCRbeta receptor diversity is at least 4-fold higher than previous estimates, and the diversity in the subset of CD45RO(+) antigen-experienced alphabeta T cells is at least 10-fold higher than previous estimates. These methods should prove valuable for assessment of alphabeta T-cell repertoire diversity after hematopoietic cell transplantation, in states of congenital or acquired immunodeficiency, and during normal aging.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Strategy for PCR amplification, hybridization, and sequencing of rearranged _TCR_β CDR3 regions. A generic rearranged _TCR_β CDR3 region PCR product is shown, indicating the constituent Vβ segment, Dβ segment, Jβ segment, and the nontemplated nucleotides inserted at the Vβ-Dβ and Dβ-Jβ junctions. Universal adapter sequences that permit solid-phase PCR on the Illumina Genome Analyzer Cluster Station (GA F and GA R) are incorporated into the 5′ and 3′ ends of the PCR products that capture each rearranged _TCR_β CDR3 region. Forty-five forward primers were designed, each specific to a single functional Vβ segment or a small family of Vβ segments. The 3′ end of each Vβ forward primer is anchored at position −43 in the Vβ segment, relative to the recombination signal sequence, thereby providing a unique Vβ tag sequence within the amplified region. Vβ forward primers were designed for all known nonpseudogenes in the _TCR_β locus. The 13 reverse primers specific to each Jβ segment are anchored in the 3′ intron, with the 3′ end of each primer crossing the intron/exon junction. The Jβ reverse primers were designed to be anchored at their 3′ ends on a consensus splice site motif to minimize overlap with the sequencing primers. Thirteen sequencing primers were designed that are complementary to the amplified portion of the Jβ segment, such that the first few bases of sequence generated will capture the unique Jβ tag sequence. The sequencing primers were designed so that promiscuous priming of a sequencing reaction for one J segment by a primer specific to another J segment would generate sequence data starting at exactly the same nucleotide as sequence data from the correct sequencing primer.

Figure 2

Figure 2

Observed _TCR_β CDR3 sequence copy number per 5 mL whole blood. Frequency histograms of _TCR_β CDR3 sequences observed in 4 different T-cell subsets distinguished by expression of CD4, CD8, and CD45RO and present in 5 mL blood of one male donor. For example, the square at 200,10 means that 10 unique sequences were each observed 200 times in the CD4+CD45RO+ (antigen-experienced) T-cell sample. The data were resampled from the sequences generated by the Genome Analyzer to approximate the expected CDR3 sequence distribution in the T cells present in 5 mL blood, as determined by flow cytometry. A small set of sequences found in the CD45RO+ compartments were found with very large copy number (> 10 000 copies) but are not displayed.

Figure 3

Figure 3

Assessment of PCR bias. The rearranged _TCR_β CDR3 regions present in approximately 30 000 T-cell genomes were amplified through 25 cycles of PCR, and the PCR products were split into 2 pools. One pool was amplified an additional 15 cycles, and then the PCR products from the 25-cycle and 40-cycle reactions were sequenced in separate lanes of a GA1 flow cell. Of the _TCR_β CDR3 sequences observed in the 25-cycle PCR lane, 97% were also observed in the 40-cycle PCR lane. Each point on the graph represents a single unique CDR3 sequence, plotted according to the number of times that sequence was observed in the data from 25-cycle (abscissa) and 40-cycle (ordinate) PCR reactions, respectively. The density of sequences at each point in the plot is indicated by color, with purple the highest density and red the lowest. The solid line represents a linear regression of the data, and the dotted lines 1 SD above and below the mean.

Figure 4

Figure 4

Jβgene segment use in 4 different T-cell compartments. Jβ gene segment use of _TCR_β CDR3 sequences observed in the 4 different flow cytometrically defined T-cell compartments from donor 1.

Figure 5

Figure 5

Relative abundance of unique _TCR_β CDR3 sequences correlates inversely with divergence from germline. (A) Observed frequency (top panels) and average observed frequency (bottom panels) of _TCR_β CDR3 sequences in the CD8+CD45RO+/− and CD4+CD45RO+/− T-cell compartments of 2 male donors plotted, from left to right, according to their Jβ and Vβ gene segment use, CDR3 length, and total number of (inserted + deleted) nucleotides at the Vβ-Dβ and Dβ-Jβ junctions. (B) Heat map representation of relative abundance of TCRβ CDR3 sequences observed in CD4+ naive, CD4+ memory, and CD8+ naive T-cell compartments of the 2 male donors arrayed according to the number of nucleotides deleted or inserted at the Vβ-Dβ and Dβ-Jβ junctions. Color indicates the log10(observed frequency) of the sequences with the indicated number of inserted or deleted junctional nucleotides.

Figure 6

Figure 6

Direct _TCR_β CDR3 sequencing captures all of the TCR diversity information present in a conventional spectratype. (A) Comparison of standard TCRβ spectratype data and calculated _TCR_β CDR3 length distributions for sequences using representative TCR Vβ gene segments and present in CD4+CD45RO+ cells from male donor 1. CDR3 length is plotted along the x-axis and the number of unique CDR3 sequences with that length (GA sequence data) or the relative intensity of the corresponding peak in the spectratype is plotted along the y-axis. Reducing the information contained in the GA sequence data to a frequency histogram of the unique CDR3 sequences with different lengths within each Vβ family readily reproduces all of the information contained in the spectratype data. The length of the differently colored segments within each bar of the histograms indicates the fraction of unique CDR3 sequences that were observed 1 to 5 times (black), 6 to 10 times (blue), 11 to 100 times (green), or more than 100 times (red). (B) A representative “virtual spectratype” of TCRβ CDR3 sequences extracted from CD4+CD45RO+ T cells from donor 1 that use the Vβ10 gene segment. The CDR3 sequences using Vβ10 were sorted by CDR3 length into a frequency histogram, and the sequences within each length bin were then color-coded on the basis of their Jβ use. The inset shows all of the CDR3 sequences using Vβ10 and Jβ2-6, and having a length of 39 nt, as well as the number of times that each of these sequences was observed in the data. The origin of the nucleotides in each sequence is color-coded as follows: Vβ gene segment, red; template-independent N nucleotide, black; Dβ gene segment, blue; Jβ gene segment, green.

Similar articles

Cited by

References

    1. Rudolph MG, Stanfield RL, Wilson IA. How TCRs bind MHCs, peptides, and coreceptors. Annu Rev Immunol. 2006;24:419–466. - PubMed
    1. Arstila TP, Casrouge A, Baron V, Even J, Kanellopoulos J, Kourilsky P. A direct estimate of the human alphabeta T cell receptor diversity. Science. 1999;286(5441):958–961. - PubMed
    1. Shendure J, Ji H. Next-generation DNA sequencing. Nat Biotechnol. 2008;26(10):1135–1145. - PubMed
    1. Fisher RA, Corbet AS, Williams CB. The relation between the number of species and the number of individuals in a random sample of an animal population. J Anim Ecol. 1943;12:42–58.
    1. Efron B, Thisted R. Estimating the number of unseen species: How many words did Shakespeare know? Biometrika. 1976;63(3):435–447.

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources