Pairwise statistical significance and empirical determination of effective gap opening penalties for protein local sequence alignment
- PMID: 20063463
- DOI: 10.1504/ijcbdd.2008.022207
Pairwise statistical significance and empirical determination of effective gap opening penalties for protein local sequence alignment
Abstract
We evaluate various methods to estimate pairwise statistical significance of a pairwise local sequence alignment in terms of statistical significance accuracy and compare it with popular database search programs in terms of retrieval accuracy on a benchmark database. Results indicate that using pairwise statistical significance using standard substitution matrices is significantly better than database statistical significance reported by BLAST and PSI-BLAST, and that it is comparable and at times significantly better than SSEARCH. An application of pairwise statistical significance to empirically determine effective gap opening penalties for protein local sequence alignment using the widely used BLOSUM matrices is also presented.
Similar articles
-
Pairwise statistical significance of local sequence alignment using sequence-specific and position-specific substitution matrices.IEEE/ACM Trans Comput Biol Bioinform. 2011 Jan-Mar;8(1):194-205. doi: 10.1109/TCBB.2009.69. IEEE/ACM Trans Comput Biol Bioinform. 2011. PMID: 21071807
-
PSIBLAST_PairwiseStatSig: reordering PSI-BLAST hits using pairwise statistical significance.Bioinformatics. 2009 Apr 15;25(8):1082-3. doi: 10.1093/bioinformatics/btp089. Epub 2009 Feb 27. Bioinformatics. 2009. PMID: 19251771
-
On the significance of sequence alignments when using multiple scoring matrices.Bioinformatics. 2004 Apr 12;20(6):881-7. doi: 10.1093/bioinformatics/btg498. Epub 2004 Jan 29. Bioinformatics. 2004. PMID: 14751984
-
Clustered sequence representation for fast homology search.J Comput Biol. 2007 Jun;14(5):594-614. doi: 10.1089/cmb.2007.R005. J Comput Biol. 2007. PMID: 17683263 Review.
-
Protein database searches using compositionally adjusted substitution matrices.FEBS J. 2005 Oct;272(20):5101-9. doi: 10.1111/j.1742-4658.2005.04945.x. FEBS J. 2005. PMID: 16218944 Free PMC article. Review.
Cited by
-
New finite-size correction for local alignment score distributions.BMC Res Notes. 2012 Jun 12;5:286. doi: 10.1186/1756-0500-5-286. BMC Res Notes. 2012. PMID: 22691307 Free PMC article.
-
Pairwise statistical significance of local sequence alignment using multiple parameter sets and empirical justification of parameter set change penalty.BMC Bioinformatics. 2009 Mar 19;10 Suppl 3(Suppl 3):S1. doi: 10.1186/1471-2105-10-S3-S1. BMC Bioinformatics. 2009. PMID: 19344477 Free PMC article.
-
Accelerating pairwise statistical significance estimation for local alignment by harvesting GPU's power.BMC Bioinformatics. 2012 Apr 12;13 Suppl 5(Suppl 5):S3. doi: 10.1186/1471-2105-13-S5-S3. BMC Bioinformatics. 2012. PMID: 22537007 Free PMC article.
-
Where does the alignment score distribution shape come from?Evol Bioinform Online. 2010 Dec 12;6:159-87. doi: 10.4137/EBO.S5875. Evol Bioinform Online. 2010. PMID: 21258650 Free PMC article.
-
Bridging, Mapping, and Addressing Research Gaps in Health Sciences: The Naqvi-Gabr Research Gap Framework.Cureus. 2024 Mar 8;16(3):e55827. doi: 10.7759/cureus.55827. eCollection 2024 Mar. Cureus. 2024. PMID: 38590484 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Research Materials
Miscellaneous