Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun:10:100025.
doi: 10.1016/j.immuno.2023.100025. Epub 2023 Feb 19.

AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets

Affiliations

AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets

William D Lees et al. Immunoinformatics (Amst). 2023 Jun.

Abstract

Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.

Keywords: AIRR-seq; Immune receptor; Immune receptor germline; Immune receptor repertoire; Rep-seq.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Cumulative number of human IG alleles in IMGT databases (LIGM-DB to 2001 and IMGT GENE/DB subsequently).
Fig. 2.
Fig. 2.
Alleles inferred from AIRR-seq reads are derived from recombined VDJ sequences, meaning that the exact genes from which they arise (determined by location in the immune receptor locus) cannot be determined. In some cases, the location can be inferred by assessing sequence similarity to genes in the reference genome. However, the presence of multiple highly similar genes may make it impossible to determine the correct mapping unambiguously.
Fig. 3.
Fig. 3.
Challenges in the characterization of IG/TR genes, and the current status in three key species.
Fig. 4.
Fig. 4.
Summary of results.
Fig. 5.
Fig. 5.
(A) – the relationship of objects defined in the Schema. (B) - evolution of labels and aliases through phases of discovery. The diagram depicts four stages in the activity of a community group curating sequences from a particular species. These events are likely to be separated in time and may be triggered by the availability of additional evidence.
Fig. 6.
Fig. 6.
The AIRR Germline Set Schema. See Supplementary Data for detailed description and itemisation of fields.

Similar articles

Cited by

References

    1. Lefranc MP, Giudicelli V, Duroux P, Jabado-Michaloud J, Folch G, Aouinti S, Carillon E, Duvergey H, Houles A, Paysan-Lafosse T, Hadi-Saljoqi S, Sasorith S, Lefranc G, Kossida S. IMGT®, the international ImMunoGeneTics information system® 25 years on. Nud Acids Res 2015;43:D413–22. 10.1093/nar/gku1056. - DOI - PMC - PubMed
    1. Retter I, Althaus HH, Münch R, Müller W. VBASE2, an integrative V gene database. Nucl Acids Res 2005;33:D671–4. 10.1093/nar/gki088. - DOI - PMC - PubMed
    1. Kaduk M, Corcoran M, Karlsson Hedestam GB. Addressing IGHV gene structural diversity enhances immunoglobulin repertoire analysis: lessons from rhesus Macaque. Front Immunol 2022;13. https://www.frontiersin.org/article/10.3389/fimmu.2022.818440. accessed May 1, 2022. - DOI - PMC - PubMed
    1. Collins AM, Peres A, Corcoran MM, Watson CT, Yaari G, Lees WD, Ohlin M. Commentary on Population matched (pm) germline allelic variants of immunoglobulin (IG) loci: relevance in infectious diseases and vaccination studies in human populations. Genes Immun 2021;22:335–8. 10.1038/S41435-021-00152-6. - DOI - PMC - PubMed
    1. Jackson KJ, Kos JT, Lees W, Gibson WS, Smith ML, Peres A, et al. A BALB/c IGHV reference set, defined by haplotype analysis of long-read VDJ-C sequences from F1 (BALB/c /C57BL/6) mice. Front Immunol 2022;13. https://www.frontiersin.org/articles/10.3389/fimmu.2022.888555/full. - DOI - PMC - PubMed

LinkOut - more resources