Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013;8(1):e53235.
doi: 10.1371/journal.pone.0053235. Epub 2013 Jan 7.

Prediction and analysis of antibody amyloidogenesis from sequences

Affiliations

Prediction and analysis of antibody amyloidogenesis from sequences

Chyn Liaw et al. PLoS One. 2013.

Abstract

Antibody amyloidogenesis is the aggregation of soluble proteins into amyloid fibrils that is one of major causes of the failures of humanized antibodies. The prediction and prevention of antibody amyloidogenesis are helpful for restoring and enhancing therapeutic effects. Due to a large number of possible germlines, the existing method is not practical to predict sequences of novel germlines, which establishes individual models for each known germline. This study proposes a first automatic and across-germline prediction method (named AbAmyloid) capable of predicting antibody amyloidogenesis from sequences. Since the amyloidogenesis is determined by a whole sequence of an antibody rather than germline-dependent properties such as mutated residues, this study assess three types of germline-independent sequence features (amino acid composition, dipeptide composition and physicochemical properties). AbAmyloid using a Random Forests classifier with dipeptide composition performs well on a data set of 12 germlines. The within- and across-germline prediction accuracies are 83.10% and 83.33% using Jackknife tests, respectively, and the novel-germline prediction accuracy using a leave-one-germline-out test is 72.22%. A thorough analysis of sequence features is conducted to identify informative properties for further providing insights to antibody amyloidogenesis. Some identified informative physicochemical properties are amphiphilicity, hydrophobicity, reverse turn, helical structure, isoelectric point, net charge, mutability, coil, turn, linker, nuclear protein, etc. Additionally, the numbers of ubiquitylation sites in amyloidogenic and non-amyloidogenic antibodies are found to be significantly different. It reveals that antibodies less likely to be ubiquitylated tend to be amyloidogenic. The method AbAmyloid capable of automatically predicting antibody amyloidogenesis of novel germlines is implemented as a publicly available web server at http://iclab.life.nctu.edu.tw/abamyloid.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. The receiver operator characteristic (ROC) curves of three general methods.
Figure 2
Figure 2. Three evaluation methods for AbAmyloid.
(A) There are 12 individual germline models. Each model is evaluated using a Jackknife test. (B) Only one model is constructed using a dataset of 12 germline (AA-432). (C) The leave-one-germline-out test is applied to evaluate the novel-germline prediction of AbAmyloid where each dataset of one germline is served as the test dataset of novel germline in turn.
Figure 3
Figure 3. Histogram and percentages for sequence pairs with sequence identities between training and test datasets.
Figure 4
Figure 4. Histogram for sequence pairs with sequence identities between amyloidogenic and non-amyloidogenic sequences.
Figure 5
Figure 5. Learning curves using various numbers of germlines for training classifiers.
The prediction performance is evaluated by using a leave-one-germline-out test.
Figure 6
Figure 6. Feature importance of Amino Acid Composition (AAC).
The feature with the largest value of mean decrease of Gini index (MDGI) is the most important.
Figure 7
Figure 7. Feature importance of Dipeptide Composition (DPC).
The feature with the largest value of mean decrease of Gini index (MDGI) is the most important.
Figure 8
Figure 8. The heatmap of DPC feature importance.
Figure 9
Figure 9. Feature importance of Physicochemical Properties (PPs).
The feature with the largest value of mean decrease of Gini index (MDGI) is the most important.
Figure 10
Figure 10. The 100% stacked column chart of the numbers of lysines (K) and putative ubiquitylated lysines (Ub-K).
Figure 11
Figure 11. Distribution of lysines for amyloidogenic and non-amyloidogenic antibodies.
Figure 12
Figure 12. Feature importance of 12 ubiquitylation features.
The feature with the largest value of mean decrease of Gini index (MDGI) is the most important.
Figure 13
Figure 13. Feature importance of 12 ubiquitylation features and Dipeptide Composition (DPC).
The feature with the largest value of mean decrease of Gini index (MDGI) is the most important.

Similar articles

Cited by

References

    1. Presta LG (2008) Molecular engineering and design of therapeutic antibodies. Curr Opin Immunol 20: 460–470. - PubMed
    1. Waldmann H, Hale G (2005) CAMPATH: from concept to clinic. Philos Trans R Soc Lond B Biol Sci 360: 1707–1711. - PMC - PubMed
    1. Padlan E (1991) A possible procedure for reducing the immunogenicity of antibody variable domains while preserving their ligand-binding properties. Molecular Immunology 28: 489–498. - PubMed
    1. Roguska M, Pedersen J, Keddy C (1994) Humanization of murine monoclonal antibodies through variable domain resurfacing. Proceedings of the National Academy of Sciences 91: 969–973. - PMC - PubMed
    1. Clark M (2000) Antibody humanization: a case of the 'Emperor's new clothes'? Immunol Today 21: 397–402. - PubMed

Publication types

LinkOut - more resources