Combining evidence using p-values: application to sequence homology searches

T L Bailey¹, M Gribskov

Affiliations

PMID: 9520501
DOI: 10.1093/bioinformatics/14.1.48

Combining evidence using p-values: application to sequence homology searches

T L Bailey et al. Bioinformatics. 1998.

. 1998;14(1):48-54.

doi: 10.1093/bioinformatics/14.1.48.

Authors

T L Bailey¹, M Gribskov

Affiliation

¹ San Diego Supercomputer Center, CA 92186-9784, USA.

PMID: 9520501
DOI: 10.1093/bioinformatics/14.1.48

Abstract

Motivation: To illustrate an intuitive and statistically valid method for combining independent sources of evidence that yields a p-value for the complete evidence, and to apply it to the problem of detecting simultaneous matches to multiple patterns in sequence homology searches.

Results: In sequence analysis, two or more (approximately) independent measures of the membership of a sequence (or sequence region) in some class are often available. We would like to estimate the likelihood of the sequence being a member of the class in view of all the available evidence. An example is estimating the significance of the observed match of a macromolecular sequence (DNA or protein) to a set of patterns (motifs) that characterize a biological sequence family. An intuitive way to do this is to express each piece of evidence as a p-value, and then use the product of these p-values as the measure of membership in the family. We derive a formula and algorithm (QFAST) for calculating the statistical distribution of the product of n independent p-values. We demonstrate that sorting sequences by this p-value effectively combines the information present in multiple motifs, leading to highly accurate and sensitive sequence homology searches.

PubMed Disclaimer

Comment in

Concerning the accuracy of MAST E-values.
Bailey TL, Gribskov M. Bailey TL, et al. Bioinformatics. 2000 May;16(5):488-9. doi: 10.1093/bioinformatics/16.5.488. Bioinformatics. 2000. PMID: 10871274 No abstract available.

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

Grants and funding

P41 RR-08605/RR/NCRR NIH HHS/United States

LinkOut - more resources

Full Text Sources
- Silverchair Information Systems
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Combining evidence using p-values: application to sequence homology searches

Affiliation

Combining evidence using p-values: application to sequence homology searches

Authors

Affiliation

Abstract

Comment in

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources