Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jul 15;26(14):1708-13.
doi: 10.1093/bioinformatics/btq270. Epub 2010 May 26.

Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics

Affiliations

Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics

Hyrum D Carroll et al. Bioinformatics. .

Abstract

Motivation: Since database retrieval is a fundamental operation, the measurement of retrieval efficacy is critical to progress in bioinformatics. This article points out some issues with current methods of measuring retrieval efficacy and suggests some improvements. In particular, many studies have used the pooled receiver operating characteristic for n irrelevant records (ROC(n)) score, the area under the ROC curve (AUC) of a 'pooled' ROC curve, truncated at n irrelevant records. Unfortunately, the pooled ROC(n) score does not faithfully reflect actual usage of retrieval algorithms. Additionally, a pooled ROC(n) score can be very sensitive to retrieval results from as little as a single query.

Methods: To replace the pooled ROC(n) score, we propose the Threshold Average Precision (TAP-k), a measure closely related to the well-known average precision in information retrieval, but reflecting the usage of E-values in bioinformatics. Furthermore, in addition to conditions previously given in the literature, we introduce three new criteria that an ideal measure of retrieval efficacy should satisfy.

Results: PSI-BLAST, GLOBAL, HMMER and RPS-BLAST provided examples of using the TAP-k and pooled ROC(n) scores to evaluate sequence retrieval algorithms. In particular, compelling examples using real data highlight the drawbacks of the pooled ROC(n) score, showing that it can produce evaluations skewing far from intuitive expectations. In contrast, the TAP-k satisfies most of the criteria desired in an ideal measure of retrieval efficacy.

Availability and implementation: The TAP-k web server and downloadable Perl script are freely available at http://www.ncbi.nlm.nih.gov/CBBresearch/Spouge/html.ncbi/tap/

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
An example of a PR graph and TAP curve. The E-values at each point are represented by the colors on the bar beneath.
Fig. 2.
Fig. 2.
Example retrieval list with relevant (blue ‘R’s) and irrelevant (red ‘I’s) records illustrating the j(E0)-th relevant record, TPIRs and the sentinel record.
Fig. 3.
Fig. 3.
The distribution of EPQ versus E-value for PSI-BLAST retrieval over all queries in DB_344_Pfam. The dashed green line indicates the mean EPQ; the solid red line, the median EPQ; the top and bottom of the blue boxes, the first and third quartiles of the EPQ distribution; and the top and bottom whiskers, the maximum and minimum EPQ over all queries.
Fig. 4.
Fig. 4.
PSI-BLAST retrieval results for the homoserine dehydrogenase Pfam family searching in DB_344_Pfam. (a) Individual ROC50 curves, along with the corresponding pooled ROC50. Note that the pooled ROC curve is lower than both of the queries. This same condition continues until 203 irrelevant records. (b) PR curves (and their average) for the same retrieval results. The TAP for each is the AP (with the precision of last record repeated).
Fig. 5.
Fig. 5.
TAP curves against the E-value threshold E0, for searching DB_331_CDD with each query from DB_8920_PDB in turn. Retrieval results for GLOBAL are represented with a solid green line; for HMMER_semi-global, with a long-dashed blue line; for HMMER_local, with a medium-dashed red line; and for RPS-BLAST, with a dotted black line. The arrows indicate the maximum TAP and its E-value threshold E0*(A) for each algorithm A.

Similar articles

  • Biotool2Web: creating simple Web interfaces for bioinformatics applications.
    Shahid M, Alam I, Fuellen G. Shahid M, et al. Appl Bioinformatics. 2006;5(1):63-6. doi: 10.2165/00822942-200605010-00009. Appl Bioinformatics. 2006. PMID: 16539540
  • BIAS: Bioinformatics Integrated Application Software.
    Finak G, Godin N, Hallett M, Pepin F, Rajabi Z, Srivastava V, Tang Z. Finak G, et al. Bioinformatics. 2005 Apr 15;21(8):1745-6. doi: 10.1093/bioinformatics/bti170. Epub 2004 Nov 30. Bioinformatics. 2005. PMID: 15572471
  • BioDownloader: bioinformatics downloads and updates in a few clicks.
    Shapovalov MV, Canutescu AA, Dunbrack RL Jr. Shapovalov MV, et al. Bioinformatics. 2007 Jun 1;23(11):1437-9. doi: 10.1093/bioinformatics/btm120. Epub 2007 May 5. Bioinformatics. 2007. PMID: 17483505
  • Interoperability with Moby 1.0--it's better than sharing your toothbrush!
    BioMoby Consortium; Wilkinson MD, Senger M, Kawas E, Bruskiewich R, Gouzy J, Noirot C, Bardou P, Ng A, Haase D, Saiz Ede A, Wang D, Gibbons F, Gordon PM, Sensen CW, Carrasco JM, Fernández JM, Shen L, Links M, Ng M, Opushneva N, Neerincx PB, Leunissen JA, Ernst R, Twigger S, Usadel B, Good B, Wong Y, Stein L, Crosby W, Karlsson J, Royo R, Párraga I, Ramírez S, Gelpi JL, Trelles O, Pisano DG, Jimenez N, Kerhornou A, Rosset R, Zamacola L, Tarraga J, Huerta-Cepas J, Carazo JM, Dopazo J, Guigo R, Navarro A, Orozco M, Valencia A, Claros MG, Pérez AJ, Aldana J, Rojano M, Fernandez-Santa Cruz R, Navas I, Schiltz G, Farmer A, Gessler D, Schoof H, Groscurth A. BioMoby Consortium, et al. Brief Bioinform. 2008 May;9(3):220-31. doi: 10.1093/bib/bbn003. Epub 2008 Jan 31. Brief Bioinform. 2008. PMID: 18238804 Review.
  • Automation of in-silico data analysis processes through workflow management systems.
    Romano P. Romano P. Brief Bioinform. 2008 Jan;9(1):57-68. doi: 10.1093/bib/bbm056. Epub 2007 Dec 2. Brief Bioinform. 2008. PMID: 18056132 Review.

Cited by

References

    1. Bamber D. Area above ordinal dominance graph and area below receiver operating characteristic graph. J. Math. Psychol. 1975;12:387–415.
    1. Berman H, et al. The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res. 2007;35:D301–303. - PMC - PubMed
    1. Brenner SE, et al. Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships. Proc. Natl Acad. Sci. USA. 1998;95:6073–6078. - PMC - PubMed
    1. Chen Z. Assessing sequence comparison methods with the average precision criterion. Bioinformatics. 2003;19:2456–2460. - PubMed
    1. Davis J, Goadrich M. Proceedings of the 23rd International Conference on Machine learning. Madison, Wisconsin, USA: ACM; 2006. The Relationship Between Precision-Recall and ROC Curves; pp. 233–240.

Publication types