Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jan 26:11:26.
doi: 10.1186/1471-2148-11-26.

Proteome sequence features carry signatures of the environmental niche of prokaryotes

Affiliations

Proteome sequence features carry signatures of the environmental niche of prokaryotes

Zlatko Smole et al. BMC Evol Biol. .

Abstract

Background: Prokaryotic environmental adaptations occur at different levels within cells to ensure the preservation of genome integrity, proper protein folding and function as well as membrane fluidity. Although specific composition and structure of cellular components suitable for the variety of extreme conditions has already been postulated, a systematic study describing such adaptations has not yet been performed. We therefore explored whether the environmental niche of a prokaryote could be deduced from the sequence of its proteome. Finally, we aimed at finding the precise differences between proteome sequences of prokaryotes from different environments.

Results: We analyzed the proteomes of 192 prokaryotes from different habitats. We collected detailed information about the optimal growth conditions of each microorganism. Furthermore, we selected 42 physico-chemical properties of amino acids and computed their values for each proteome. Further, on the same set of features we applied two fundamentally different machine learning methods, Support Vector Machines and Random Forests, to successfully classify between bacteria and archaea, halophiles and non-halophiles, as well as mesophiles, thermophiles and mesothermophiles. Finally, we performed feature selection by using Random Forests.

Conclusions: To our knowledge, this is the first time that three different classification cases (domain of life, halophilicity and thermophilicity) of proteome adaptation are successfully performed with the same set of 42 features. The characteristic features of a specific adaptation constitute a signature that may help understanding the mechanisms of adaptation to extreme environments.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Four unique features used for classifications regarding domain of life revealed by the feature selection algorithm of RF. Pairs of box-and-whisker plots are shown for each feature: Leu content, average protein size in a proteome, His content, and 10-Cys content. Box-and-whisker plots represent bacteria and archaea from top to bottom. The feature values are normalized from 0 to 1 from left to right. (+) signs represent outliers.
Figure 2
Figure 2
Three unique features used for classifications regarding halophilicity revealed by the feature selection algorithm of RF. Pairs of box-and-whisker plots are shown for each feature: positive charge, normalized frequency of beta turn, and Phe content. Box-and-whisker plots represent non-halophiles and halophiles from top to bottom. The feature values are normalized from 0 to 1 from left to right. (+) signs represent outliers.
Figure 3
Figure 3
Four unique features used for classifications regarding thermophilicity revealed by the feature selection algorithm of RF. Triplets of box-and-whisker plots are shown for each feature: information measure for loop, Val content, Tyr content, and Chou-Fasman parameter of the coil conformation. Box-and-whisker plots represent mesophiles, mesothermophiles and thermophiles from top to bottom. The feature values are normalized from 0 to 1 from left to right. (+) signs represent outliers.

References

    1. Cavicchioli R. Cold-adapted archaea. Nat Rev Microbiol. 2006;4:331–343. doi: 10.1038/nrmicro1390. - DOI - PubMed
    1. Hickey DA, Singer GA. Genomic and proteomic adaptations to growth at high temperature. Gen Biol. 2004;5:117. doi: 10.1186/gb-2004-5-10-117. - DOI - PMC - PubMed
    1. Bolhuis H, Poeleand EM, Valera FR. Isolation and cultivation of Walsby's square archaeon. Env Microbiol. 2004;6(12):1287. doi: 10.1111/j.1462-2920.2004.00692.x. - DOI - PubMed
    1. Schleper C, Puhler G, Klenk HP, Zillig W. Picrophilus oshimae and Picrophilus torridus fam. nov., gen. nov., sp. nov., two species of hyperacidophilic, thermophilic, heterotrophic, aerobic archaea. Int J Sys Bacterio. 1996;46:814–816. doi: 10.1099/00207713-46-3-814. - DOI
    1. Pikuta EV, Hoover RB, Tang J. Micro Extremophiles at the Limits of Life. Crit Rev Microbiol. 2007;33:183–209. doi: 10.1080/10408410701451948. - DOI - PubMed

Publication types