How many 3D structures do we need to train a predictor?
- PMID: 19944385
- PMCID: PMC5054404
- DOI: 10.1016/S1672-0229(08)60041-8
How many 3D structures do we need to train a predictor?
Abstract
It has been shown that the progress in the determination of membrane protein structure grows exponentially, with approximately the same growth rate as that of the water-soluble proteins. In order to investigate the effect of this, on the performance of prediction algorithms for both alpha-helical and beta-barrel membrane proteins, we conducted a prospective study based on historical records. We trained separate hidden Markov models with different sized training sets and evaluated their performance on topology prediction for the two classes of transmembrane proteins. We show that the existing top-scoring algorithms for predicting the transmembrane segments of alpha-helical membrane proteins perform slightly better than that of beta-barrel outer membrane proteins in all measures of accuracy. With the same rationale, a meta-analysis of the performance of the secondary structure prediction algorithms indicates that existing algorithmic techniques cannot be further improved by just adding more non-homologous sequences to the training sets. The upper limit for secondary structure prediction is estimated to be no more than 70% and 80% of correctly predicted residues for single sequence based methods and multiple sequence based ones, respectively. Therefore, we should concentrate our efforts on utilizing new techniques for the development of even better scoring predictors.
Figures



Similar articles
-
Evaluation of methods for predicting the topology of beta-barrel outer membrane proteins and a consensus prediction method.BMC Bioinformatics. 2005 Jan 12;6:7. doi: 10.1186/1471-2105-6-7. BMC Bioinformatics. 2005. PMID: 15647112 Free PMC article.
-
Predicting Beta Barrel Transmembrane Proteins Using HMMs.Methods Mol Biol. 2017;1552:43-61. doi: 10.1007/978-1-4939-6753-7_4. Methods Mol Biol. 2017. PMID: 28224490
-
Predicting Alpha Helical Transmembrane Proteins Using HMMs.Methods Mol Biol. 2017;1552:63-82. doi: 10.1007/978-1-4939-6753-7_5. Methods Mol Biol. 2017. PMID: 28224491
-
Topology of membrane proteins-predictions, limitations and variations.Curr Opin Struct Biol. 2018 Jun;50:9-17. doi: 10.1016/j.sbi.2017.10.003. Epub 2017 Nov 5. Curr Opin Struct Biol. 2018. PMID: 29100082 Review.
-
Topology prediction of helical transmembrane proteins: how far have we reached?Curr Protein Pept Sci. 2010 Nov;11(7):550-61. doi: 10.2174/138920310794109184. Curr Protein Pept Sci. 2010. PMID: 20887261 Review.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources