Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Oct 4;8(1):14820.
doi: 10.1038/s41598-018-32986-y.

Humanization of Antibodies using a Statistical Inference Approach

Affiliations

Humanization of Antibodies using a Statistical Inference Approach

Alejandro Clavero-Álvarez et al. Sci Rep. .

Abstract

Antibody humanization is a key step in the preclinical phase of the development of therapeutic antibodies, originally developed and tested in non-human models (most typically, in mouse). The standard technique of Complementarity-Determining Regions (CDR) grafting into human Framework Regions of germline sequences has some important drawbacks, in that the resulting sequences often need further back-mutations to ensure functionality and/or stability. Here we propose a new method to characterize the statistical distribution of the sequences of the variable regions of human antibodies, that takes into account phenotypical correlations between pairs of residues, both within and between chains. We define a "humanness score" of a sequence, comparing its performance in distinguishing human from murine sequences, with that of some alternative scores in the literature. We also compare the score with the experimental immunogenicity of clinically used antibodies. Finally, we use the humanness score as an optimization function and perform a search in the sequence space, starting from different murine sequences and keeping the CDR regions unchanged. Our results show that our humanness score outperforms other methods in sequence classification, and the optimization protocol is able to generate humanized sequences that are recognized as human by standard homology modelling tools.

PubMed Disclaimer

Conflict of interest statement

M.M. is the scientific director of Diatheva srl, and holds shares in the company; T.D.M. was partially supported by a Diatheva grant. The rest of authors declare no competing interests.

Figures

Figure 1
Figure 1
ROC curves for the different models. ROC curves obtained upon classification of the test human and murine databases, using the human learning database to learn the MG distribution, and to calculate the average distance for the “Tall” and “Tn” methods. Panel (a): ROC curves obtained using the full VHVL chain for classification, with or without CDR regions. Panel (b): ROC curves obtained using the VH or VL chain separately for classification. “Corr. off” indicates that correlations between residues have been removed in the MG model, “FR” refers to the curve obtained removing the CDRs and keeping just the framework regions, “h-m” indicates that classification is performed using both human and murine learning datasets as reference (see Methods).
Figure 2
Figure 2
Boxplot of three score distributions for the different datasets: Panel (a): MG score; Panel (b): Tall score; Panel (c): T20 score. Pharmaceutical antibodies are indicated according to the suffix in their International Nonproprietary Name: “umab” are fully human antibodies; “zumab” are humanized antibodies, usually containing murine CDRs grafted on top of human framework variable regions; “ximab” are chimeric antibodies, obtained by assembling the whole murine variable region on top of a human constant part; “omab” are murine antibodies. Since we deal with just the antibodies’ variable regions, “ximab” and “omab” are indistinguishable. The horizontal lines signal the threshold score above (for the MG case) or below (for the other cases) which the sequence is classified as human. The threshold values are tMG=6383, tTall=116, tT20=76.
Figure 3
Figure 3
Scatter plot of the experimental immunogenicity and the MG score. The immunogenicity (% of patients that develop antibodies against the therapeutic antibody) is plotted versus the MG-score.
Figure 4
Figure 4
Scatter MG score-Hamming distance plot for the Steepest Descent (Panel (a)) and the SAMC (Panel (b)) for all our targets. Here HD is the Hamming distance between our proposed sequence and the original murine one. The SAMC trajectories start at the murine sequences, on the left (HD = 0, MG-score around 4500), then jump immediately to the region at the bottom-right of high HD, low score (highly non-human sequences, but very different from the original murine). Notice that all trajectories roughly overlap in this region: there is no memory of their different, and fixed, CDR sequences, and we witness a basically free exploration of the sequence space. Then, when the temperature falls below a certain threshold, the trajectories move to the top-left region, of highly human sequences with score and HD depending on the fixed CDR regions of the original sequence.

References

    1. Williams David Gareth, Matthews David J., Jones Tarran. Antibody Engineering. Berlin, Heidelberg: Springer Berlin Heidelberg; 2010. Humanising Antibodies by CDR Grafting; pp. 319–339.
    1. Abhinandan K, Martin AC. Analyzing the “Degree of Humanness” of Antibody Sequences. J. Mol. Biol. 2007;369:852–862. doi: 10.1016/j.jmb.2007.02.100. - DOI - PubMed
    1. Gao SH, Huang K, Tu H, Adler AS. Monoclonal antibody humanness score and its applications. BMC Biotechnology. 2013;13:1–12. doi: 10.1186/1472-6750-13-55. - DOI - PMC - PubMed
    1. Lazar GA, Desjarlais JR, Jacinto J, Karki S, Hammond PW. A molecular immunology approach to antibody humanization and functional optimization. Mol. immunology. 2007;44:1986–1998. doi: 10.1016/j.molimm.2006.09.029. - DOI - PubMed
    1. Choi Y, Hua C, Sentman CL, Ackerman ME, Bailey-Kellogg C. Antibody humanization by structure-based computational protein design. mAbs. 2015;7:1045–1057. doi: 10.1080/19420862.2015.1076600. - DOI - PMC - PubMed

Publication types

MeSH terms

Substances