Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 May;21(5):790-7.
doi: 10.1101/gr.115428.110. Epub 2011 Feb 24.

Exhaustive T-cell repertoire sequencing of human peripheral blood samples reveals signatures of antigen selection and a directly measured repertoire size of at least 1 million clonotypes

Affiliations

Exhaustive T-cell repertoire sequencing of human peripheral blood samples reveals signatures of antigen selection and a directly measured repertoire size of at least 1 million clonotypes

René L Warren et al. Genome Res. 2011 May.

Abstract

Massively parallel sequencing is a useful approach for characterizing T-cell receptor diversity. However, immune receptors are extraordinarily difficult sequencing targets because any given receptor variant may be present in very low abundance and may differ legitimately by only a single nucleotide. We show that the sensitivity of sequence-based repertoire profiling is limited by both sequencing depth and sequencing accuracy. At two timepoints, 1 wk apart, we isolated bulk PBMC plus naïve (CD45RA+/CD45RO-) and memory (CD45RA-/CD45RO+) T-cell subsets from a healthy donor. From T-cell receptor beta chain (TCRB) mRNA we constructed and sequenced multiple libraries to obtain a total of 1.7 billion paired sequence reads. The sequencing error rate was determined empirically and used to inform a high stringency data filtering procedure. The error filtered data yielded 1,061,522 distinct TCRB nucleotide sequences from this subject which establishes a new, directly measured, lower limit on individual T-cell repertoire size and provides a useful reference set of sequences for repertoire analysis. TCRB nucleotide sequences obtained from two additional donors were compared to those from the first donor and revealed limited sharing (up to 1.1%) of nucleotide sequences among donors, but substantially higher sharing (up to 14.2%) of inferred amino acid sequences. For each donor, shared amino acid sequences were encoded by a much larger diversity of nucleotide sequences than were unshared amino acid sequences. We also observed a highly statistically significant association between numbers of shared sequences and shared HLA class I alleles.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Distinct TCRB sequences. (A) Sequence coverage of distinct TRBJ sequences observed upon alignment of Illumina reads quality filtered to >99.9% predicted accuracy (Q30). There was substantial coverage, up to several million-fold, of the 13 human TRBJ segments, but thousands of artifactual distinct TRBJ sequences were also observed, arising from residual sequencing error. (B) Restricting the data set to D96 effectively retains all real TRBJ sequences and excludes all of the artifactual TRBJ sequences observed when no coverage restriction is applied (D100).
Figure 2.
Figure 2.
TCRB diversity in peripheral blood samples. (A) Rarefaction curves plotted using all data from all libraries from blood draw 1 (round symbols) and blood draw 2, taken 1 wk later (square symbols) from donor 1. Random resampling was done in triplicate, and error bars are contained within symbols. Both curves plateau, suggesting that more sequencing of either sample would not be expected to produce many new sequences. However, this tendency toward leveling is a property of rarefaction curves, and the plateau is not informative regarding absolute abundance. Hence, rarefaction curves must be interpreted with caution. (B) Accumulation analysis provides a more meaningful measure of saturation. Here we show the number of new distinct sequences found in each library (x-axis) against the total number of distinct sequences from the blood sample as a whole. The TCRB diversity within blood draw 1 from donor 1 appears to have been captured, since analyzing additional libraries would not be expected to yield many new TCRB sequences. In contrast to the rarefaction curve present in the previous panel, library-based accumulation shows that the diversity of blood draw 2 is similar to that of blood draw 1 but has not yet been fully captured. (C) Despite saturation of blood draw 1, sequences found in blood draw 2 only partially overlap, indicating that there is considerable un-sampled diversity within the peripheral blood TCRB repertoire of this individual.
Figure 3.
Figure 3.
TCRB sequence diversity in the naïve and memory compartments of a deeply sequenced individual. (A) Frequency distributions of CD45RA+/CD45RO− (naïve) and CD45RA−/CD45RO+ (memory) T-cell subsets isolated by FACS from PBMCs from a separate blood sample from donor 1, taken on day 1. The similarity of the two curves reflects the presence of high diversity within both subsets, although this is slightly greater for the naïve subset, as evident from the extended tail of the distribution. There are, in both cases, a small number of extreme copy clonotypes and a relative clonotype abundance varying over four orders of magnitude. (B) There is more overlap with the deeply sequenced sorted cells for the memory versus the naïve subset, but even the overlap of the memory subset is modest. This is consistent with a large total repertoire size that can be only partially captured in a given blood draw. (C) Of the sequences that were shared between sorted naïve and memory cells on day 1, there was a preferential transition to the memory subset on day 8.

Similar articles

Cited by

References

    1. Akondy RS, Monson ND, Miller JD, Edupuganti S, Teuwen D, Wu H, Quyyumi F, Garg S, Altman JD, Del Rio C, et al. 2009. The yellow fever virus vaccine induces a broad and polyfunctional human memory CD8+ T cell response. J Immunol 183: 7919–7930 - PMC - PubMed
    1. Arstila TP, Casrouge A, Baron V, Even J, Kanellopoulos J, Kourilsky P 1999. A direct estimate of the human T cell receptor diversity. Science 286: 958–961 - PubMed
    1. Bassing CH, Swat W, Alt FW 2002. The mechanism and regulation of chromosomal V(D)J recombination. Cell 109(Suppl): S45–S55 - PubMed
    1. Boyd SD, Marshall EL, Merker JD, Maniar JM, Zhang LN, Sahaf B, Jones CD, Simen BB, Hanczaruk B, Nguyen KD, et al. 2009. Measurement and clinical monitoring of human lymphocyte clonality by massively parallel VDJ pyrosequencing. Sci Transl Med 1: 12ra23 - PMC - PubMed
    1. Cereb N, Maye P, Lee S, Kong Y, Yang SY 1995. Locus-specific amplification of HLA class I genes from genomic DNA: Locus-specific sequences in the first and third introns of HLA-A, -B, and -C alleles. Tissue Antigens 45: 1–11 - PubMed

Publication types

MeSH terms

Substances