Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan-Dec;15(1):2268255.
doi: 10.1080/19420862.2023.2268255. Epub 2023 Oct 24.

A window into the human immune system: comprehensive characterization of the complexity of antibody complementary-determining regions in functional antibodies

Affiliations

A window into the human immune system: comprehensive characterization of the complexity of antibody complementary-determining regions in functional antibodies

Oscar Mejias-Gomez et al. MAbs. 2023 Jan-Dec.

Abstract

The human immune system uses antibodies to neutralize foreign antigens. They are composed of heavy and light chains, both with constant and variable regions. The variable region has six hypervariable loops, also known as complementary-determining regions (CDRs) that determine antibody diversity and antigen specificity. Knowledge of their significance, and certain residues present in these areas, is vital for antibody therapeutics development. This study includes an analysis of more than 11,000 human antibody sequences from the International Immunogenetics information system (IMGT). The analysis included parameters such as length distribution, overall amino acid diversity, amino acid frequency per CDR and residue position within antibody chains. Overall, our findings confirm existing knowledge, such as CDRH3's high length diversity and amino acid variability, increased aromatic residue usage, particularly tyrosine, charged and polar residues like aspartic acid, serine, and the flexible residue glycine. Specific residue positions within each CDR influence these occurrences, implying a unique amino acid type distribution pattern. We compared amino acid type usage in CDRs and non-CDR regions, both in globular and transmembrane proteins, which revealed distinguishing features, such as increased frequency of tyrosine, serine, aspartic acid, and arginine. These findings should prove useful for future optimization, improvement of affinity, synthetic antibody library design, or the creation of antibodies de-novo in silico.

Keywords: Amino acid diversity; Complementary-Determining Regions (CDRs); IMGT (International Immunogenetics Information System); antibodies/immunoglobulins; antibody sequencing; antibody therapeutics development; antibody-antigen complexes; de-novo in silico antibody design; immunology; synthetic antibody libraries.

PubMed Disclaimer

Conflict of interest statement

No potential conflict of interest was reported by the author(s).

Figures

Bar plots showing the length diversity for the complementary determining regions of heavy and light chains of human immunoglobulins.
Figure 1.
Length distribution of human immunoglobulin heavy and light chain CDRs. a) CDRH1. 80,1% of the sequences fall under a CDRH1 length of 8 amino acids (7779 out of 9684 sequences). 14,7% of the sequences have 10 amino acid long CDRH1 (1442 out of 9684 sequences). b) CDRκ1. CDR1 lengths for κ are either 6, 7, 11, or 12 amino acids, interestingly with no lengths of 8 to 10 amino acids, while the frequency 6 (56,4%) and 7 amino acids (25,9%) is highest c) CDRλ1. CDRλ1 lengths for λ are either 6, 7, 8, or 9 amino acids long, while all are comparably frequent (6 amino acids highest with 35,6%), 7 amino acids is rarely used (5,8%). d) CDRH2. 62,9% of the sequences have an 8 amino acids long CDRH2, constituting the majority of the CDRH2 diversity (6088 out of 9684 sequences). A considerable number of sequences have 7 amino acids long CDRH2 (28,9%, 2795 out of 9684 sequences). e) CDRκ2. CDR2 for κ is exclusively 3 amino acids long (99,8%) f) CDRλ2. CDR2 for λ seems to be limited to 3 amino acids (92,4%) with a rare occurrence of 7 amino acids. g) CDRκ3. CDR3 for κ shows lower diversity that its counterpart in heavy chain with only four lengths and where 9 amino acids comprises the majority of sequences (69,9%) h) CDRH3. The length diversity of CDRH3 is much greater than those of CDRH1 and CDRH2. There are 18 lengths with more than 50 sequences, where 14 and 15 amino acids are the two most abundant lengths. Human heavy chain CDRH3 can be remarkably long up to 36 (only 1 sequence entry in database). i) CDRλ3. CDR3 for λ also shows low diversity compared to CDRH3 with four lengths spanning the CDR3 length diversity and where 11 amino acids comprises the majority of sequences (46,8%).
Two stacked bar plot showing the frequency of each amino acid in CDRs of heavy and light chains compared with constant domains, globular proteins, and transmembrane proteins. The bars have different colors indicating different amino acids.
Figure 2.
Average of relative abundance of each amino acid per CDR of heavy (a) and light chain (b), globular (Glob) and transmembrane proteins (TMB). The frequencies of globular proteins, transmembrane proteins, framework regions within human antibody sequences, and constant regions taking Trastuzumab as model IgG are shown for comparison a) the average relative abundances of the 20 natural amino acids are shown for CDRH1 of 8 amino acids long, CDRH2 of 7 and 8 amino acids long, all CDRH3 lengths with more than 50 sequences in the database. B) average distribution of amino acids in κ and λ CDRs. The average relative abundances of the 20 natural amino acids are shown for the most abundant κ and λ light chain lengths for CDR1, CDR2, and CDR3.
Seven stacked bar plots showing the amino acid frequency per position in CDRs of heavy chain of different lengths. Two line plots where the frequency of particular amino acids in certain positions can be observed. The bars have different colors indicating different amino acids.
Figure 3.
Amino acid distributions per position in CDRH1, CDRH2, and CDRH3. The y-axis shows relative abundance in percentage (%) and the x-axis shows the specific position within each CDRH for plots a to G. The x-axis in plot H and I shows the different CDRH3 lengths studied. a) CDRH1 b) CDRH2-7aa c) CDRH2-8aa d) CDRH3-8aa e) CDRH3-14aa f) CDRH3-20aa g) CDRH3-25aa h) amino acid changes over different CDRH3 length at flanking CDRH3 IMGT positions i) amino acid changes over different CDRH3 length at inner CDRH3 IMGT positions.
Twelve stacked bar plots showing the amino acid frequency per position in CDRs of kappa light chain for human immunoglobulins. The bars have different colors indicating different amino acids.
Figure 4.
Position specific amino acid distribution in κ and λ light chain CDRs. The y-axis shows the relative abundance of each amino acid while the x-axis shows the positions for CDR1, CDR2, and CDR3 according to IMGT numbering scheme. a) CDRκ1-6aa b) CDRκ2 c) CDRκ3-8aa d) CDRκ3-9aa e) CDRκ3-10aa f) CDRκ3-11aa g) CDRλ1-6aa h) CDRλ2 i) CDRλ2-9aa j) CDRλ2-10aa k) CDRλ2-11aa i) CDRλ2-12aa.

Similar articles

Cited by

References

    1. Kienzler A-K, Eibel H.. Human B cell development and tolerance. In: Ratcliffe MJH, editor. Encyclopedia of immunobiology. United Kingdom: Elsevier; 2016. p. 105–12.
    1. Chi X, Li Y, Qiu X. V(D)J recombination, somatic hypermutation and class switch recombination of immunoglobulins: mechanism and regulation. Immunology. 2020;160(3):233–47. doi:10.1111/imm.13176. - DOI - PMC - PubMed
    1. Schroeder HW. The evolution and development of the antibody repertoire. Front Immunol. 2015;6:6. doi:10.3389/fimmu.2015.00033. - DOI - PMC - PubMed
    1. Tonegawa S. Somatic generation of antibody diversity. Nature. 1983;302(5909):575–81. doi:10.1038/302575a0. - DOI - PubMed
    1. Kunik V, Peters B, Ofran Y, Baker B. Structural consensus among antibodies defines the antigen binding site. PLoS Comput Biol [Internet]. 2012;8(2):e1002388. [accessed 2022 Nov 16]. doi:10.1371/journal.pcbi.1002388. - DOI - PMC - PubMed

Publication types

LinkOut - more resources