Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2002 Dec;11(12):2974-80.
doi: 10.1110/ps.0226702.

Prediction of partial membrane protein topologies using a consensus approach

Affiliations

Prediction of partial membrane protein topologies using a consensus approach

Johan Nilsson et al. Protein Sci. 2002 Dec.

Abstract

We have developed a method to reliably identify partial membrane protein topologies using the consensus of five topology prediction methods. When evaluated on a test set of experimentally characterized proteins, we find that approximately 90% of the partial consensus topologies are correctly predicted in membrane proteins from prokaryotic as well as eukaryotic organisms. Whole-genome analysis reveals that a reliable partial consensus topology can be predicted for approximately 70% of all membrane proteins in a typical bacterial genome and for approximately 55% of all membrane proteins in a typical eukaryotic genome. The average fraction of sequence length covered by a partial consensus topology is 44% for the prokaryotic proteins and 17% for the eukaryotic proteins in our test set, and similar numbers are found when the algorithm is applied to whole genomes. Reliably predicted partial topologies may simplify experimental determinations of membrane protein topology.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Fraction correctly predicted global topologies (black bars) and fraction of the test sets covered (white bars) for different levels of agreement among the five prediction methods (5/0, all methods agree; 4/1, four methods agree, etc.). Left: prokaryotic proteins; right: eukaryotic proteins.
Fig. 2.
Fig. 2.
Partial consensus topology prediction procedure (for simplicity shown only for two prediction methods). (A) For each residue position, a consensus topology state (i, o, m, w) is assigned only if all five prediction methods agree. Otherwise, no consensus is assigned (designated as ‘.’). If two loops with opposite locations (o and i) appear at the same position, a loop clash (X) is assigned. In the same manner, a TMH clash (#) is assigned when two TMHs with opposite orientations (m and w) appear at the same position. After the filtering step described in B, the consensus topology is scanned from the N–terminus, and the beginning of the first PCT is defined by the first TM region (m or w states) of at least n consecutive residues in the consensus topology (where the default value is n = 5). The PCT is then extended towards the C–terminus until either a consensus TMH of less than n consecutive residues is encountered, or a loop– or TM–clash occurs. In either case, the end of the PCT is defined by the most C–terminally located TMH in the consensus topology. The process of PCT construction is then repeated until the C-terminal end of the protein is reached. (B) Removal of TMH clashes which are due to slight misalignments of the TMHs. When a TMH clash occurs between two partially overlapping consensus TMHs with opposite orientations, the clash state is replaced by a loop state of the appropriate type. To emphasize that the resulting loop was obtained in the filtering procedure (and not in the majority–vote procedure), it is indicated in lowercase. (C) The figure illustrates how the choice of n–value affects the PCT prediction results. The result can differ substantially when a consensus TMH that is longer (left) or shorter (right) than n is encountered. (D) Average fraction of sequence length covered by PCTs (squares) and average fraction of correctly predicted PCTs (circles) for different values of n for the prokaryotic (black symbols) and eukaryotic (white symbols) test sets.

Similar articles

Cited by

References

    1. Adams, M.D., Celniker, S.E., Holt, R.A., Evans, C.A., Gocayne, J.D., Amanatides, P.G., Scherer, S.E., Li, P.W., Hoskins, R.A., Galle, R.F., et al. 2000. The genome sequence of Drosophila melanogaster. Science 287 2185–2195. - PubMed
    1. Blattner, F.R., Plunkett III, G., Bloch, C.A., Perna, N.T., Burland, V., Riley, M., Collado-Vides, J., Glasner, J.D., Rode, C.K., Mayhew, G.F., et al. 1997. The complete genome sequence of Escherichia coli K-12. Science 277 1453–1474. - PubMed
    1. Bult, C.J., White, O., Olsen, G.J., Zhou, L., Fleischmann, R.D., Sutton, G.G., Blake, J.A., FitzGerald, L.M., Clayton, R.A., Gocayne, J.D., et al. 1996. Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science 273 1058–1073. - PubMed
    1. Claros, M.G. and von Heijne, G. 1994. TopPred II: An improved software for membrane protein structure predictions. Comput. Appl. Biosci. 10 685–686. - PubMed
    1. Cole, S.T., Brosch, R., Parkhill, J., Garnier, T., Churcher, C., Harris, D., Gordon, S.V., Eiglmeier, K., Gas, S., Barry III, C.E., et al. 1998. Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 393 537–544. - PubMed

Publication types

Substances

LinkOut - more resources