Review

. 2018 Feb 21:9:224.

doi: 10.3389/fimmu.2018.00224. eCollection 2018.

Computational Strategies for Dissecting the High-Dimensional Complexity of Adaptive Immune Repertoires

Enkelejda Miho^{1

2}, Alexander Yermanos¹, Cédric R Weber¹, Christoph T Berger^{3

4}, Sai T Reddy¹, Victor Greiff^{1

5}

Affiliations

¹ Department for Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland.
² aiNET GmbH, ETH Zürich, Basel, Switzerland.
³ Department of Biomedicine, University Hospital Basel, Basel, Switzerland.
⁴ Department of Internal Medicine, Clinical Immunology, University Hospital Basel, Basel, Switzerland.
⁵ Department of Immunology, University of Oslo, Oslo, Norway.

PMID: 29515569
PMCID: PMC5826328
DOI: 10.3389/fimmu.2018.00224

Review

Computational Strategies for Dissecting the High-Dimensional Complexity of Adaptive Immune Repertoires

Enkelejda Miho et al. Front Immunol. 2018.

. 2018 Feb 21:9:224.

doi: 10.3389/fimmu.2018.00224. eCollection 2018.

Authors

Enkelejda Miho^{1

2}, Alexander Yermanos¹, Cédric R Weber¹, Christoph T Berger^{3

4}, Sai T Reddy¹, Victor Greiff^{1

5}

Affiliations

¹ Department for Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland.
² aiNET GmbH, ETH Zürich, Basel, Switzerland.
³ Department of Biomedicine, University Hospital Basel, Basel, Switzerland.
⁴ Department of Internal Medicine, Clinical Immunology, University Hospital Basel, Basel, Switzerland.
⁵ Department of Immunology, University of Oslo, Oslo, Norway.

PMID: 29515569
PMCID: PMC5826328
DOI: 10.3389/fimmu.2018.00224

Abstract

The adaptive immune system recognizes antigens via an immense array of antigen-binding antibodies and T-cell receptors, the immune repertoire. The interrogation of immune repertoires is of high relevance for understanding the adaptive immune response in disease and infection (e.g., autoimmunity, cancer, HIV). Adaptive immune receptor repertoire sequencing (AIRR-seq) has driven the quantitative and molecular-level profiling of immune repertoires, thereby revealing the high-dimensional complexity of the immune receptor sequence landscape. Several methods for the computational and statistical analysis of large-scale AIRR-seq data have been developed to resolve immune repertoire complexity and to understand the dynamics of adaptive immunity. Here, we review the current research on (i) diversity, (ii) clustering and network, (iii) phylogenetic, and (iv) machine learning methods applied to dissect, quantify, and compare the architecture, evolution, and specificity of immune repertoires. We summarize outstanding questions in computational immunology and propose future directions for systems immunology toward coupling AIRR-seq with the computational discovery of immunotherapeutics, vaccines, and immunodiagnostics.

Keywords: B-cell receptor; T-cell receptor; antibody discovery; artificial intelligence; immunogenomics; networks; phylogenetics; systems immunology.

PubMed Disclaimer

Figures

**Figure 1**
The immune repertoire space is defined by diversity, architecture, evolution, and convergence. **(A)** Diversity measurements are based on (i) the accurate annotation of V (D) J segments using deterministic and probabilistic approaches with population-level or individualized germline gene reference databases. (ii) Probabilistic and hidden Markov models allow inference of recombination statistics. (iii) Measurement of clonotype diversity using diversity profiles. **(B)** Analysis of repertoire architecture relies predominantly on (i) clonal networks that are constructed by connecting nucleotide or amino acid sequence nodes by similarity edges. The sequence similarity between clones is defined *via* a string distance [e.g., Levenshtein distance (LD)], resulting in undirected Boolean networks for a given threshold (nucleotides/amino acids). An example of the global characterization of the network is the diameter, shown by black edges. An example of the local parameters of the network is the degree (n = 1) related to the individual clonal node in black. (ii) Degree distribution is a global characteristic of immune repertoire networks, which can be used for analyzing clonal expansion. (iii) Several similarity layers decompose the immune repertoire along its similarity layers. Layer D1 captures clonal nodes similar by edit distance 1 (1 nt/a.a. different), D2 of distance 2 and so forth. **(C)** Assessing evolution of antibody lineages. (i) Reconstruction of phylogenetic trees. Stars indicate somatic hypermutation. (ii) Probabilistic methods for the inference of mutation statistics in antibody lineage evolution. (iii) Simulation of antibody repertoire evolution for benchmarking antibody-tailored phylogenetic inference algorithms. **(D)** Naive and antigen-driven cross-individual sequence similarity and convergence in immune repertoires. (i) The Venn diagram shows sequences shared in the two repertoires (circles). Signature-like sequence features are highlighted by black squares. (ii) Database of convergent or antigen-specific immune receptor sequences. (iii) K-mer sequence decomposition and classification of immune receptor sequences.

**Figure 2**
An overview of selected computational tools used in immune repertoire analyses. Each horizontal colored bar colored bar in the *Basis* column represents a unique antibody or T-cell receptor (TCR) sequence. Vertical red bars represent sequence differences or somatic hypermutation. The *Method* column describes the general concept of the computational methods and how these are applied to immune repertoires. The *Tools* column highlights exemplary key resources for performing computational analysis in the respective analytical sections [rows **(A–D)**].

See this image and copyright information in PMC

Cited by

Generation of a single-cell B cell atlas of antibody repertoires and transcriptomes to identify signatures associated with antigen specificity.
Agrafiotis A, Neumeier D, Hong KL, Chowdhury T, Ehling R, Kuhn R, Sandu I, Kreiner V, Cotet TS, Shlesinger D, Laslo D, Anzböck S, Starkie D, Lightwood DJ, Oxenius A, Reddy ST, Yermanos A. Agrafiotis A, et al. iScience. 2023 Jan 25;26(3):106055. doi: 10.1016/j.isci.2023.106055. eCollection 2023 Mar 17. iScience. 2023. PMID: 36852274 Free PMC article.
Antigen-driven T cell responses in rheumatic diseases: insights from T cell receptor repertoire studies.
Garrido-Mesa J, Brown MA. Garrido-Mesa J, et al. Nat Rev Rheumatol. 2025 Mar;21(3):157-173. doi: 10.1038/s41584-025-01218-9. Epub 2025 Feb 7. Nat Rev Rheumatol. 2025. PMID: 39920282 Review.
Breast cancer is detectable from peripheral blood using machine learning over T cell receptor repertoires.
Zuckerbrot-Schuldenfrei M, Raphael A, Zilberberg A, Efroni S. Zuckerbrot-Schuldenfrei M, et al. NPJ Syst Biol Appl. 2025 Aug 8;11(1):89. doi: 10.1038/s41540-025-00573-3. NPJ Syst Biol Appl. 2025. PMID: 40781233 Free PMC article.
AIRRscape: An interactive tool for exploring B-cell receptor repertoires and antibody responses.
Waltari E, Nafees S, McCutcheon KM, Wong J, Pak JE. Waltari E, et al. PLoS Comput Biol. 2022 Sep 20;18(9):e1010052. doi: 10.1371/journal.pcbi.1010052. eCollection 2022 Sep. PLoS Comput Biol. 2022. PMID: 36126074 Free PMC article.
B cell tolerance and autoimmunity: Lessons from repertoires.
Deguine J, Xavier RJ. Deguine J, et al. J Exp Med. 2024 Sep 2;221(9):e20231314. doi: 10.1084/jem.20231314. Epub 2024 Aug 2. J Exp Med. 2024. PMID: 39093312 Free PMC article. Review.

See all "Cited by" articles

References

1. Tonegawa S. Somatic generation of antibody diversity. Nature (1983) 302:575–81.10.1038/302575a0 - DOI - PubMed
1. Wardemann H, Busse CE. Novel approaches to analyze immunoglobulin repertoires. Trends Immunol (2017) 38(7):471–82.10.1016/j.it.2017.05.003 - DOI - PubMed
1. Glanville J, Zhai W, Berka J, Telman D, Huerta G, Mehta GR, et al. Precise determination of the diversity of a combinatorial antibody library gives insight into the human immunoglobulin repertoire. Proc Natl Acad Sci U S A (2009) 106:20216–21.10.1073/pnas.0909775106 - DOI - PMC - PubMed
1. Elhanati Y, Sethna Z, Marcou Q, Callan CG, Mora T, Walczak AM. Inferring processes underlying B-cell repertoire diversity. Phil Trans R Soc Lond B Biol Sci (2015) 370:20140243.10.1098/rstb.2014.0243 - DOI - PMC - PubMed
1. Murugan A, Mora T, Walczak AM, Callan CG. Statistical inference of the generation probability of T-cell receptors from sequence repertoires. Proc Natl Acad Sci U S A (2012) 109:16161–6.10.1073/pnas.1212755109 - DOI - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Computational Strategies for Dissecting the High-Dimensional Complexity of Adaptive Immune Repertoires

Affiliations

Computational Strategies for Dissecting the High-Dimensional Complexity of Adaptive Immune Repertoires

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources