TOPS++FATCAT: fast flexible structural alignment using constraints derived from TOPS+ Strings Model
- PMID: 18759993
- PMCID: PMC2553092
- DOI: 10.1186/1471-2105-9-358
TOPS++FATCAT: fast flexible structural alignment using constraints derived from TOPS+ Strings Model
Abstract
Background: Protein structure analysis and comparison are major challenges in structural bioinformatics. Despite the existence of many tools and algorithms, very few of them have managed to capture the intuitive understanding of protein structures developed in structural biology, especially in the context of rapid database searches. Such intuitions could help speed up similarity searches and make it easier to understand the results of such analyses.
Results: We developed a TOPS++FATCAT algorithm that uses an intuitive description of the proteins' structures as captured in the popular TOPS diagrams to limit the search space of the aligned fragment pairs (AFPs) in the flexible alignment of protein structures performed by the FATCAT algorithm. The TOPS++FATCAT algorithm is faster than FATCAT by more than an order of magnitude with a minimal cost in classification and alignment accuracy. For beta-rich proteins its accuracy is better than FATCAT, because the TOPS+ strings models contains important information of the parallel and anti-parallel hydrogen-bond patterns between the beta-strand SSEs (Secondary Structural Elements). We show that the TOPS++FATCAT errors, rare as they are, can be clearly linked to oversimplifications of the TOPS diagrams and can be corrected by the development of more precise secondary structure element definitions.
Software availability: The benchmark analysis results and the compressed archive of the TOPS++FATCAT program for Linux platform can be downloaded from the following web site: http://fatcat.burnham.org/TOPS/ CONCLUSION: TOPS++FATCAT provides FATCAT accuracy and insights into protein structural changes at a speed comparable to sequence alignments, opening up a possibility of interactive protein structure similarity searches.
Figures








Similar articles
-
Flexible structure alignment by chaining aligned fragment pairs allowing twists.Bioinformatics. 2003 Oct;19 Suppl 2:ii246-55. doi: 10.1093/bioinformatics/btg1086. Bioinformatics. 2003. PMID: 14534198
-
Multiple flexible structure alignment using partial order graphs.Bioinformatics. 2005 May 15;21(10):2362-9. doi: 10.1093/bioinformatics/bti353. Epub 2005 Mar 3. Bioinformatics. 2005. PMID: 15746292
-
FATCAT: a web server for flexible structure comparison and structure similarity searching.Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W582-5. doi: 10.1093/nar/gkh430. Nucleic Acids Res. 2004. PMID: 15215455 Free PMC article.
-
Using Variable-Length Aligned Fragment Pairs and an Improved Transition Function for Flexible Protein Structure Alignment.J Comput Biol. 2017 Jan;24(1):2-12. doi: 10.1089/cmb.2016.0135. Epub 2016 Oct 6. J Comput Biol. 2017. PMID: 27710035
-
An introduction to modeling structure from sequence.Curr Protoc Bioinformatics. 2006 Oct;Chapter 5:Unit 5.1. doi: 10.1002/0471250953.bi0501s15. Curr Protoc Bioinformatics. 2006. PMID: 18428765 Review.
Cited by
-
Protein function prediction: towards integration of similarity metrics.Curr Opin Struct Biol. 2011 Apr;21(2):180-8. doi: 10.1016/j.sbi.2011.02.001. Epub 2011 Feb 24. Curr Opin Struct Biol. 2011. PMID: 21353529 Free PMC article. Review.
-
Dali server: conservation mapping in 3D.Nucleic Acids Res. 2010 Jul;38(Web Server issue):W545-9. doi: 10.1093/nar/gkq366. Epub 2010 May 10. Nucleic Acids Res. 2010. PMID: 20457744 Free PMC article.
-
Biological insights from topology independent comparison of protein 3D structures.Nucleic Acids Res. 2011 Aug;39(14):e94. doi: 10.1093/nar/gkr348. Epub 2011 May 19. Nucleic Acids Res. 2011. PMID: 21596786 Free PMC article.
-
A local average distance descriptor for flexible protein structure comparison.BMC Bioinformatics. 2014 Apr 2;15:95. doi: 10.1186/1471-2105-15-95. BMC Bioinformatics. 2014. PMID: 24694083 Free PMC article.
-
The Remorin C-terminal Anchor was shaped by convergent evolution among membrane binding domains.Plant Signal Behav. 2013 Mar;8(3):e23207. doi: 10.4161/psb.23207. Epub 2013 Jan 8. Plant Signal Behav. 2013. PMID: 23299327 Free PMC article.
References
-
- Murzin AG, Brenner SE, Hubbard TJP, Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. Journal of Molecular Biology. 1995;247:536–540. - PubMed
-
- Ye Y, Godzik A. Flexible structure alignment by chaining aligned fragment pairs allowing twists. Bioinformatics. 2003;19 Suppl 2:II246–II255. - PubMed