Protein annotation and modelling servers at University College London

D W A Buchan¹, S M Ward, A E Lobley, T C O Nugent, K Bryson, D T Jones

Affiliations

PMID: 20507913
PMCID: PMC2896093
DOI: 10.1093/nar/gkq427

Protein annotation and modelling servers at University College London

D W A Buchan et al. Nucleic Acids Res. 2010 Jul.

. 2010 Jul;38(Web Server issue):W563-8.

doi: 10.1093/nar/gkq427. Epub 2010 May 27.

Authors

D W A Buchan¹, S M Ward, A E Lobley, T C O Nugent, K Bryson, D T Jones

Affiliation

¹ Bioinformatics Group, University College London, Gower Street, London, WC1E 6BT, UK. d.buchan@cs.ucl.ac.uk

PMID: 20507913
PMCID: PMC2896093
DOI: 10.1093/nar/gkq427

Abstract

The UCL Bioinformatics Group web portal offers several high quality protein structure prediction and function annotation algorithms including PSIPRED, pGenTHREADER, pDomTHREADER, MEMSAT, MetSite, DISOPRED2, DomPred and FFPred for the prediction of secondary structure, protein fold, protein structural domain, transmembrane helix topology, metal binding sites, regions of protein disorder, protein domain boundaries and protein function, respectively. We also now offer a fully automated 3D modelling pipeline: BioSerf, which performed well in CASP8 and uses a fragment-assembly approach which placed it in the top five servers in the de novo modelling category. The servers are available via the group web site at http://bioinf.cs.ucl.ac.uk/.

PubMed Disclaimer

Figures

**Figure 1.**
The diagram produced by the MEMSAT-SVM algorithm available via the PSIPRED server. (a) A schematic diagram of the transmembrane regions is presented. Directly below this a trace of the Kyte–Doolittle hydropathy plot (18) and below that four traces of the outputs for the four SVMs used to make the transmembrane region assignment are shown. (b) A cartoon of the transmembrane helix topology summarizing the linear coordinates for the helices and indicating where the protein’s extra- and intercellular regions are.

**Figure 2.**
The process flow when analysing a sequence using the BioSerf pipeline is shown. An incoming sequence is initially analysed by both PSI-BLAST, PSIPRED and pGenTHREADER. The PSI-BLAST run allows the pipeline to identify whether the query sequences closely matches the sequence of a known structure in the PDB. PSIPRED and pGenTHREADER attempt to identify any remote structural homologues of the sequence should the PSI-BLAST run fail to find any related sequences. These data are then combined and potential template structures for modelling are selected given suitable cut-offs. Where there is at least one suitable template, the pipeline then uses MODELLER to generate a homology model. Where there are no suitable template structures, FRAGFOLD is used to build a possible model structure. A typical run of the pipeline that finds a suitable template structure takes around 10 min. In those cases where no template can be found and FRAGFOLD is used, runs can take up to 3 h.

**Figure 3.**
A typical series of analyses using our servers to annotate a protein of unknown function are shown. Putative human membrane protein A8MYE5 was selected for this analysis (a). It was initially submitted to a MEMSAT-SVM prediction to locate any transmembrane helices (b). In the third step, (c), the sequence is submitted to DomPred to locate any possible domain boundaries. As these do not conflict with the transmembrane helix assignment, the sequence is divided into it’s possible constituent domains and those domains that are not transmembrane are submitted to BioSerf for homology modelling (d). The larger domain has a long terminal region consisting of two beta sheets that are likely to be disordered rather than packed as in the prediction. To confirm this possibility, the larger domain is submitted the DISOPRED2 server. The DISOPRED2 prediction (e) confirms that the terminal stretch of the larger domain is likely to be disordered rather than packed onto the structure.

**Figure 4.**
The final, putative, annotation of Uniprot A8MYE5 sequence.

See this image and copyright information in PMC

References

1. Jones DT. Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 1999;292:195–202. - PubMed
1. Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–402. - PMC - PubMed
1. Jones DT. Improving the accuracy of transmembrane protein topology prediction using evolutionary information. Bioinformatics. 2007;23:538–544. - PubMed
1. Nugent T, Jones DT. Transmembrane protein topology prediction using support vector machines. BMC Bioinformatics. 2009;10:159. - PMC - PubMed
1. Ward JJ, McGuffin LJ, Bryson K, Buxton BF, Jones DT. The DISOPRED server for the prediction of protein disorder. Bioinformatics. 2004;20:2138–2139. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

BB/E023533/1/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Protein annotation and modelling servers at University College London

Affiliation

Protein annotation and modelling servers at University College London

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous