Hardware-accelerated protein identification for mass spectrometry
- PMID: 15723443
- DOI: 10.1002/rcm.1853
Hardware-accelerated protein identification for mass spectrometry
Abstract
An ongoing issue in mass spectrometry is the time it takes to search DNA sequences with MS/MS peptide fragments (see, e.g., Choudary et al., Proteomics 2001; 1: 651-667.) Search times are far longer than spectra acquisition time, and parallelization of search software on clusters requires doubling the size of a conventional computing cluster to cut the search time in half. Field programmable gate arrays (FPGAs) are used to create hardware-accelerated algorithms that reduce operating costs and improve search speed compared to large clusters. We present a novel hardware design that takes full spectra and computes 6-frame translation word searches on DNA databases at a rate of approximately 3 billion base pairs per second, with queries of up to 10 amino acids in length and arbitrary wildcard positions. Hardware post-processing identifies in silico tryptic peptides and scores them using a variety of techniques including mass frequency expected values. With faster FPGAs protein identifications from the human genome can be achieved in less than a second, and this makes it an ideal solution for a number of proteome-scale applications.
Copyright 2005 John Wiley & Sons, Ltd.
Similar articles
-
Fast tandem mass spectra-based protein identification regardless of the number of spectra or potential modifications examined.Bioinformatics. 2005 May 15;21(10):2177-84. doi: 10.1093/bioinformatics/bti362. Epub 2005 Mar 3. Bioinformatics. 2005. PMID: 15746284
-
160-fold acceleration of the Smith-Waterman algorithm using a field programmable gate array (FPGA).BMC Bioinformatics. 2007 Jun 7;8:185. doi: 10.1186/1471-2105-8-185. BMC Bioinformatics. 2007. PMID: 17555593 Free PMC article.
-
Large-scale database searching using tandem mass spectra: looking up the answer in the back of the book.Nat Methods. 2004 Dec;1(3):195-202. doi: 10.1038/nmeth725. Nat Methods. 2004. PMID: 15789030 Review.
-
Sweet surrender to chemical genetics.Nat Biotechnol. 2002 Jun;20(6):561-3. doi: 10.1038/nbt0602-561. Nat Biotechnol. 2002. PMID: 12042857 No abstract available.
-
Searching for hypothetical proteins: theory and practice based upon original data and literature.Prog Neurobiol. 2005 Sep-Oct;77(1-2):90-127. doi: 10.1016/j.pneurobio.2005.10.001. Epub 2005 Nov 4. Prog Neurobiol. 2005. PMID: 16271823 Review.
Cited by
-
High-performance hardware implementation of a parallel database search engine for real-time peptide mass fingerprinting.Bioinformatics. 2008 Jul 1;24(13):1498-502. doi: 10.1093/bioinformatics/btn216. Epub 2008 May 3. Bioinformatics. 2008. PMID: 18453553 Free PMC article.
-
Accelerating string set matching in FPGA hardware for bioinformatics research.BMC Bioinformatics. 2008 Apr 15;9:197. doi: 10.1186/1471-2105-9-197. BMC Bioinformatics. 2008. PMID: 18412963 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources