. 2012 Jun 28:3:176.

doi: 10.3389/fimmu.2012.00176. eCollection 2012.

Immunoglobulin analysis tool: a novel tool for the analysis of human and mouse heavy and light chain transcripts

Tobias Rogosch¹, Sebastian Kerzel, Kam Hon Hoi, Zhixin Zhang, Rolf F Maier, Gregory C Ippolito, Michael Zemlin

Affiliations

PMID: 22754554
PMCID: PMC3384897
DOI: 10.3389/fimmu.2012.00176

Immunoglobulin analysis tool: a novel tool for the analysis of human and mouse heavy and light chain transcripts

Tobias Rogosch et al. Front Immunol. 2012.

. 2012 Jun 28:3:176.

doi: 10.3389/fimmu.2012.00176. eCollection 2012.

Authors

Tobias Rogosch¹, Sebastian Kerzel, Kam Hon Hoi, Zhixin Zhang, Rolf F Maier, Gregory C Ippolito, Michael Zemlin

Affiliation

¹ Department of Pediatrics, Philipps-University Marburg Marburg, Germany.

PMID: 22754554
PMCID: PMC3384897
DOI: 10.3389/fimmu.2012.00176

Abstract

Sequence analysis of immunoglobulin (Ig) heavy and light chain transcripts can refine categorization of B cell subpopulations and can shed light on the selective forces that act during immune responses or immune dysregulation, such as autoimmunity, allergy, and B cell malignancy. High-throughput sequencing yields Ig transcript collections of unprecedented size. The authoritative web-based IMGT/HighV-QUEST program is capable of analyzing large collections of transcripts and provides annotated output files to describe many key properties of Ig transcripts. However, additional processing of these flat files is required to create figures, or to facilitate analysis of additional features and comparisons between sequence sets. We present an easy-to-use Microsoft(®) Excel(®) based software, named Immunoglobulin Analysis Tool (IgAT), for the summary, interrogation, and further processing of IMGT/HighV-QUEST output files. IgAT generates descriptive statistics and high-quality figures for collections of murine or human Ig heavy or light chain transcripts ranging from 1 to 150,000 sequences. In addition to traditionally studied properties of Ig transcripts - such as the usage of germline gene segments, or the length and composition of the CDR-3 region - IgAT also uses published algorithms to calculate the probability of antigen selection based on somatic mutational patterns, the average hydrophobicity of the antigen-binding sites, and predictable structural properties of the CDR-H3 loop according to Shirai's H3-rules. These refined analyses provide in-depth information about the selective forces acting upon Ig repertoires and allow the statistical and graphical comparison of two or more sequence sets. IgAT is easy to use on any computer running Excel(®) 2003 or higher. Thus, IgAT is a useful tool to gain insights into the selective forces and functional properties of small to extremely large collections of Ig transcripts, thereby assisting a researcher to mine a data set to its fullest.

Keywords: antibody repertoire; deep sequencing; high-throughput analysis; immunoglobulin heavy chain gene; immunoglobulin light chain gene; rearrangement; sequence analysis software; somatic mutation.

PubMed Disclaimer

Figures

**Figure 1**
**Screenshot of the “input” worksheet**.

**Figure 2**
**Screenshot of the “summary” worksheet of IgAT**.

**Figure 3**
The “VDJ” worksheet contains graphs displaying the relative utilization of V_H families (A), D_H families (B), and J_H gene segments (C) as well as individual V gene segments (D) and D_H gene segments (E).

**Figure 4**
The graphs in the “CDR-3_length” (positions 105–117) worksheet display the length distribution of CDR-H3 (A), N1 (B), N2 (C), and deconstruction graphs for CDR-H3 with (D) or without (E) identifiable D_H gene segment. Lengths are given in nucleotides.

**Figure 5**
**(A)** Somatic mutation frequency of Ig transcripts (mutations per 1000 nt). Each data point represents the somatic mutation frequency of one sequence. **(B)** Inference of Ag selection in Ig transcripts. Shown is the ratio of replacement mutations in CDR-H1 and CDR-H2 (R_CDR) to the total number of mutations in the V region (M_V) plotted against M_V. The dark shaded area represents the 90% confidence limits and the light gray shaded area the 95% confidence limits for the probability of random mutations. A data point falling outside these confidence limits represents a sequence that has a high proportion of replacement mutations in the CDR. The probability that such a sequence has accumulated as many replacement mutations in the CDR by mere random mutation is p = 0.1 and p = 0.05, respectively. An allocation above the upper confidence limit was considered indicative of Ag selection. Data points are accompanied by their observed frequency. 6.5% (α = 0.05) of the sequences were Ag selected (α = 0.1: 9.6%).

**Figure 6**
**Graphic output of the analysis of amino acid frequencies and variability, using as an example the CDR-H3 sequences with the length of 12 amino acids (positions 105–117, n = 2572)**. **(A)** The Shannon entropy for each position in the CDR-H3 region (the higher the score the more variable the position in terms of amino acids). **(B)** Relative amino acid frequencies at the positions 105–117 for CDR-H3 region. Each bar represents 100% of the amino acid residues found at this specific position. The amino acid residues are stacked in the order of their hydrophobicity according to a normalized Kyte–Doolittle Index (Eisenberg, 1984). Charged amino acid residues are at the bottom, and hydrophobic amino acid residues at the top of each bar as presented previously (Zemlin et al., 2003). **(C)** The Kabat–Wu variability for each position in the CDR-3 region (the higher the score the more variable the position). **(D)** Overall amino acid frequencies within the CDR-3 loop (positions 107–114).

**Figure 7**
**Amino acid frequencies of the CDR-H3 loop for all unique sequences (positions 107–114)**.

**Figure 8**
**Distribution of average CDR-H3 loop hydrophobicities according to a normalized Kyte–Doolittle scale (positions 107–114;** Eisenberg, 1984).

**Figure 9**
**Reading frame utilization given as percent of all unique sequences with identifiable D_H gene segment**. The D_H reading frames are defined according to the nomenclature of Ichihara et al. (1989).

**Figure 10**
**Predicted structural features of the CDR-3 according to the “H3-rules” by Shirai et al. (1999)**. (K−, kinked base; K+, extra kinked base; K−/+, kinked or extra kinked base; E, extended base; hp def K−, deformed hairpin in sequences with kinked base; hp def K+, deformed hairpin in sequences with extra kinked base; hp def K−/+, deformed hairpin in sequences with kinked and extra kinked base; H lad K−, intact hydrogen bond ladder in sequences with kinked base; H lad K+, intact hydrogen bond ladder in sequences with extra kinked base; H lad K−/+, intact hydrogen bond ladder in sequences with kinked and extra kinked base).

See this image and copyright information in PMC

Cited by

The Diagnostic and Prognostic Potential of the B-Cell Repertoire in Membranous Nephropathy.
Su Z, Jin Y, Zhang Y, Guan Z, Li H, Chen X, Xie C, Zhang C, Liu X, Li P, Ye P, Zhang L, Kong Y, Luo W. Su Z, et al. Front Immunol. 2021 May 27;12:635326. doi: 10.3389/fimmu.2021.635326. eCollection 2021. Front Immunol. 2021. PMID: 34122405 Free PMC article.
The same self-peptide selects conventional and regulatory CD4⁺ T cells with identical antigen receptors.
Wojciech L, Ignatowicz A, Seweryn M, Rempala G, Pabla SS, McIndoe RA, Kisielow P, Ignatowicz L. Wojciech L, et al. Nat Commun. 2014 Oct 1;5:5061. doi: 10.1038/ncomms6061. Nat Commun. 2014. PMID: 25270305 Free PMC article.
Evaluation of the Antigen-Experienced B-Cell Receptor Repertoire in Healthy Children and Adults.
IJspeert H, van Schouwenburg PA, van Zessen D, Pico-Knijnenburg I, Driessen GJ, Stubbs AP, van der Burg M. IJspeert H, et al. Front Immunol. 2016 Oct 17;7:410. doi: 10.3389/fimmu.2016.00410. eCollection 2016. Front Immunol. 2016. PMID: 27799928 Free PMC article.
V(D)J Rearrangement Is Dispensable for Producing CDR-H3 Sequence Diversity in a Gene Converting Species.
Leighton PA, Morales J, Harriman WD, Ching KH. Leighton PA, et al. Front Immunol. 2018 Jun 11;9:1317. doi: 10.3389/fimmu.2018.01317. eCollection 2018. Front Immunol. 2018. PMID: 29951062 Free PMC article.
Characterization of T and B cell repertoire diversity in patients with RAG deficiency.
Lee YN, Frugoni F, Dobbs K, Tirosh I, Du L, Ververs FA, Ru H, Ott de Bruin L, Adeli M, Bleesing JH, Buchbinder D, Butte MJ, Cancrini C, Chen K, Choo S, Elfeky RA, Finocchi A, Fuleihan RL, Gennery AR, El-Ghoneimy DH, Henderson LA, Al-Herz W, Hossny E, Nelson RP, Pai SY, Patel NC, Reda SM, Soler-Palacin P, Somech R, Palma P, Wu H, Giliani S, Walter JE, Notarangelo LD. Lee YN, et al. Sci Immunol. 2016 Dec 16;1(6):eaah6109. doi: 10.1126/sciimmunol.aah6109. Epub 2016 Dec 16. Sci Immunol. 2016. PMID: 28783691 Free PMC article.

See all "Cited by" articles

References

1. Ademokun A., Wu Y. C., Martin V., Mitra R., Sack U., Baxendale H., Kipling D., Dunn-Walters D. K. (2011). Vaccination-induced changes in human B-cell repertoire and pneumococcal IgM and IgA antibody at different ages. Aging Cell 10, 922–930 - PMC - PubMed
1. Alamyar E., Giudicelli V., Li S., Duroux P., Lefranc M. P. (2012). IMGT/HighV-QUEST: the IMGT® web portal for immunoglobulin (IG) or antibody and T cell receptor (TR) analysis from NGS high throughput and deep sequencing. Immunome Res. 8, 26. - PubMed
1. Arnaout R., Lee W., Cahill P., Honan T., Sparrow T., Weiand M., Nusbaum C., Rajewsky K., Koralov S. B. (2011). High-resolution description of antibody heavy-chain repertoires in humans. PLoS ONE 6, e22365.10.1371/journal.pone.0022365 - DOI - PMC - PubMed
1. Benichou G., Yamada Y., Yun S. H., Lin C., Fray M., Tocco G. (2011). Immune recognition and rejection of allogeneic skin grafts. Immunotherapy 3, 757–77010.2217/imt.11.2 - DOI - PMC - PubMed
1. Berek C., Griffiths G. M., Milstein C. (1985). Molecular events during maturation of the immune response to oxazolone. Nature 316, 412–41810.1038/316412a0 - DOI - PubMed

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Immunoglobulin analysis tool: a novel tool for the analysis of human and mouse heavy and light chain transcripts

Affiliation

Immunoglobulin analysis tool: a novel tool for the analysis of human and mouse heavy and light chain transcripts

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources