Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2004 Mar 15:5:29.
doi: 10.1186/1471-2105-5-29.

A Hidden Markov Model method, capable of predicting and discriminating beta-barrel outer membrane proteins

Affiliations
Comparative Study

A Hidden Markov Model method, capable of predicting and discriminating beta-barrel outer membrane proteins

Pantelis G Bagos et al. BMC Bioinformatics. .

Abstract

Background: Integral membrane proteins constitute about 20-30% of all proteins in the fully sequenced genomes. They come in two structural classes, the alpha-helical and the beta-barrel membrane proteins, demonstrating different physicochemical characteristics, structure and localization. While transmembrane segment prediction for the alpha-helical integral membrane proteins appears to be an easy task nowadays, the same is much more difficult for the beta-barrel membrane proteins. We developed a method, based on a Hidden Markov Model, capable of predicting the transmembrane beta-strands of the outer membrane proteins of gram-negative bacteria, and discriminating those from water-soluble proteins in large datasets. The model is trained in a discriminative manner, aiming at maximizing the probability of correct predictions rather than the likelihood of the sequences.

Results: The training has been performed on a non-redundant database of 14 outer membrane proteins with structures known at atomic resolution; it has been tested with a jacknife procedure, yielding a per residue accuracy of 84.2% and a correlation coefficient of 0.72, whereas for the self-consistency test the per residue accuracy was 88.1% and the correlation coefficient 0.824. The total number of correctly predicted topologies is 10 out of 14 in the self-consistency test, and 9 out of 14 in the jacknife. Furthermore, the model is capable of discriminating outer membrane from water-soluble proteins in large-scale applications, with a success rate of 88.8% and 89.2% for the correct classification of outer membrane and water-soluble proteins respectively, the highest rates obtained in the literature. That test has been performed independently on a set of known outer membrane proteins with low sequence identity with each other and also with the proteins of the training set.

Conclusion: Based on the above, we developed a strategy, that enabled us to screen the entire proteome of E. coli for outer membrane proteins. The results were satisfactory, thus the method presented here appears to be suitable for screening entire proteomes for the discovery of novel outer membrane proteins. A web interface available for non-commercial users is located at: http://bioinformatics.biol.uoa.gr/PRED-TMBB, and it is the only freely available HMM-based predictor for beta-barrel outer membrane protein topology.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Evaluation of the discrimination score. The percentage of correctly predicted outer membrane proteins (filled boxes) and globular ones (open circles) as a function of different score thresholds in the validation of the discrimination procedure. The optimal proportion of correctly classified proteins was obtained using the value of 2.995 as a threshold (determined from, approximately, the intersection of the two curves).
Figure 2
Figure 2
The submission form of the web-based application.
Figure 3
Figure 3
Output of the prediction obtained from the web predictor for the OmpF of E. coli (PDB code: 2OMF [29]).
Figure 4
Figure 4
The architecture of the model used in this work. The 3 "sub-models" corresponding to the 3 labels are shown separately. In the transmembrane sub-model different colors correspond to the tied states. Black circles correspond to the aromatic belt, gray to exterior of the strand's core, and white to the interior side. In the inner and outer loop sub-models, the states forming the ladder are tied together respectively, whereas the N-terminal tail is tied with the C-terminal and the globular outer loop state is not tied with another state. The allowed transitions are shown with arrows.
Figure 5
Figure 5
Posterior probabilities plot. Graphs showing posterior probabilities along the sequence, for residues to be in a transmembrane β-strand, for the proteins with PDB codes 2OMF and 1QD5 [29]. The observed strands taken from PDB [29] and the predicted transmembrane strands obtained by the N-best algorithm are also plotted.

References

    1. von Heijne G. Recent advances in the understanding of membrane protein assembly and function. Q Rev Biophys. 1999;32:285–307. doi: 10.1017/S0033583500003541. - DOI - PubMed
    1. Schulz GE. The structure of bacterial outer membrane proteins. Biochim Biophys Acta. 2002;1565:308–317. doi: 10.1016/S0005-2736(02)00577-1. - DOI - PubMed
    1. Zhai Y, Saier M. H., Jr. The beta-barrel finder (BBF) program, allowing identification of outer membrane beta-barrel proteins encoded within prokaryotic genomes. Protein Sci. 2002;11:2196–2207. doi: 10.1110/ps.0209002. - DOI - PMC - PubMed
    1. Wimley WC. Toward genomic identification of beta-barrel membrane proteins: composition and architecture of known structures. Protein Sci. 2002;11:301–312. doi: 10.1110/ps.29402. - DOI - PMC - PubMed
    1. Diederichs K, Freigang J, Umhau S, Zeth K, Breed J. Prediction by a neural network of outer membrane beta-strand protein topology. Protein Sci. 1998;7:2413–2420. - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources