Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jun 19:8:211.
doi: 10.1186/1471-2105-8-211.

Composition Profiler: a tool for discovery and visualization of amino acid composition differences

Affiliations

Composition Profiler: a tool for discovery and visualization of amino acid composition differences

Vladimir Vacic et al. BMC Bioinformatics. .

Abstract

Background: Composition Profiler is a web-based tool for semi-automatic discovery of enrichment or depletion of amino acids, either individually or grouped by their physico-chemical or structural properties.

Results: The program takes two samples of amino acids as input: a query sample and a reference sample. The latter provides a suitable background amino acid distribution, and should be chosen according to the nature of the query sample, for example, a standard protein database (e.g. SwissProt, PDB), a representative sample of proteins from the organism under study, or a group of proteins with a contrasting functional annotation. The results of the analysis of amino acid composition differences are summarized in textual and graphical form.

Conclusion: As an exploratory data mining tool, our software can be used to guide feature selection for protein function or structure predictors. For classes of proteins with significant differences in frequencies of amino acids having particular physico-chemical (e.g. hydrophobicity or charge) or structural (e.g. alpha helix propensity) properties, Composition Profiler can be used as a rough, light-weight visual classifier.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Composition Profiles of homo- (A) and heterodimerisation (B) interfaces and hub proteins from C. elegans PPI network. Analysis of residues in homo- and heterodimer interfaces against surface residues of monomeric proteins shows slight depletion in hydrophilics (cyan) and enrichement in hydrophobics (black) as a general trend, although homodimer interfaces show closer resemblance to the protein surfaces. Composition profile of hub proteins shows a general enrichment in disorder (red) and depletion in order promoting residues (blue).
Figure 2
Figure 2
Composition profiles of PDB Select 25 (A), surface residues of monomers (B) and DisProt (C) against SwissProt. Plotting the three graphs using the same y-axis scale, same ordering of amino acids and the same color-coding scheme (flexibility) allows for a direct visual comparison between enrichment and depletion patterns in the three datasets.

References

    1. Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Natale DA, O'Donovan C, Redaschi N, Yeh LS. The Universal Protein Resource (UniProt) Nucleic Acids Research. 2005;33:D154–159. doi: 10.1093/nar/gki070. - DOI - PMC - PubMed
    1. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Research. 2000;28:235–242. doi: 10.1093/nar/28.1.235. - DOI - PMC - PubMed
    1. Molecular Surface Package http://www.biohedron.com
    1. Sickmeier M, Hamilton JA, LeGall T, Vacic V, Cortese MS, Tantos A, Szabo B, Tompa P, Chen J, Uversky VN, Obradovic Z, Dunker AK. DisProt: the Database of Disordered Proteins. Nucleic Acids Research. 2007;35:D786–93. doi: 10.1093/nar/gkl893. - DOI - PMC - PubMed
    1. Zimmerman JM, Eliezer N, Simha R. The characterization of amino acid sequences in proteins by statistical methods. J Theor Biol. 1968;21:170–201. doi: 10.1016/0022-5193(68)90069-6. - DOI - PubMed

Publication types