Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jul;38(Web Server issue):W563-8.
doi: 10.1093/nar/gkq427. Epub 2010 May 27.

Protein annotation and modelling servers at University College London

Affiliations

Protein annotation and modelling servers at University College London

D W A Buchan et al. Nucleic Acids Res. 2010 Jul.

Abstract

The UCL Bioinformatics Group web portal offers several high quality protein structure prediction and function annotation algorithms including PSIPRED, pGenTHREADER, pDomTHREADER, MEMSAT, MetSite, DISOPRED2, DomPred and FFPred for the prediction of secondary structure, protein fold, protein structural domain, transmembrane helix topology, metal binding sites, regions of protein disorder, protein domain boundaries and protein function, respectively. We also now offer a fully automated 3D modelling pipeline: BioSerf, which performed well in CASP8 and uses a fragment-assembly approach which placed it in the top five servers in the de novo modelling category. The servers are available via the group web site at http://bioinf.cs.ucl.ac.uk/.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The diagram produced by the MEMSAT-SVM algorithm available via the PSIPRED server. (a) A schematic diagram of the transmembrane regions is presented. Directly below this a trace of the Kyte–Doolittle hydropathy plot (18) and below that four traces of the outputs for the four SVMs used to make the transmembrane region assignment are shown. (b) A cartoon of the transmembrane helix topology summarizing the linear coordinates for the helices and indicating where the protein’s extra- and intercellular regions are.
Figure 2.
Figure 2.
The process flow when analysing a sequence using the BioSerf pipeline is shown. An incoming sequence is initially analysed by both PSI-BLAST, PSIPRED and pGenTHREADER. The PSI-BLAST run allows the pipeline to identify whether the query sequences closely matches the sequence of a known structure in the PDB. PSIPRED and pGenTHREADER attempt to identify any remote structural homologues of the sequence should the PSI-BLAST run fail to find any related sequences. These data are then combined and potential template structures for modelling are selected given suitable cut-offs. Where there is at least one suitable template, the pipeline then uses MODELLER to generate a homology model. Where there are no suitable template structures, FRAGFOLD is used to build a possible model structure. A typical run of the pipeline that finds a suitable template structure takes around 10 min. In those cases where no template can be found and FRAGFOLD is used, runs can take up to 3 h.
Figure 3.
Figure 3.
A typical series of analyses using our servers to annotate a protein of unknown function are shown. Putative human membrane protein A8MYE5 was selected for this analysis (a). It was initially submitted to a MEMSAT-SVM prediction to locate any transmembrane helices (b). In the third step, (c), the sequence is submitted to DomPred to locate any possible domain boundaries. As these do not conflict with the transmembrane helix assignment, the sequence is divided into it’s possible constituent domains and those domains that are not transmembrane are submitted to BioSerf for homology modelling (d). The larger domain has a long terminal region consisting of two beta sheets that are likely to be disordered rather than packed as in the prediction. To confirm this possibility, the larger domain is submitted the DISOPRED2 server. The DISOPRED2 prediction (e) confirms that the terminal stretch of the larger domain is likely to be disordered rather than packed onto the structure.
Figure 4.
Figure 4.
The final, putative, annotation of Uniprot A8MYE5 sequence.

Similar articles

Cited by

References

    1. Jones DT. Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 1999;292:195–202. - PubMed
    1. Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–402. - PMC - PubMed
    1. Jones DT. Improving the accuracy of transmembrane protein topology prediction using evolutionary information. Bioinformatics. 2007;23:538–544. - PubMed
    1. Nugent T, Jones DT. Transmembrane protein topology prediction using support vector machines. BMC Bioinformatics. 2009;10:159. - PMC - PubMed
    1. Ward JJ, McGuffin LJ, Bryson K, Buxton BF, Jones DT. The DISOPRED server for the prediction of protein disorder. Bioinformatics. 2004;20:2138–2139. - PubMed

Publication types