Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 May-Jun;12(3):296-8.
doi: 10.1197/jamia.M1733. Epub 2005 Jan 31.

Agreement, the f-measure, and reliability in information retrieval

Affiliations

Agreement, the f-measure, and reliability in information retrieval

George Hripcsak et al. J Am Med Inform Assoc. 2005 May-Jun.

Abstract

Information retrieval studies that involve searching the Internet or marking phrases usually lack a well-defined number of negative cases. This prevents the use of traditional interrater reliability metrics like the kappa statistic to assess the quality of expert-generated gold standards. Such studies often quantify system performance as precision, recall, and F-measure, or as agreement. It can be shown that the average F-measure among pairs of experts is numerically identical to the average positive specific agreement among experts and that kappa approaches these measures as the number of negative cases grows large. Positive specific agreement-or the equivalent F-measure-may be an appropriate way to quantify interrater reliability and therefore to assess the reliability of a gold standard in these studies.

PubMed Disclaimer

References

    1. Hersh WR. Information retrieval: a health care prospective. New York: Springer, 1995, pp 45–50.
    1. Hripcsak G, Wilcox A. Reference standards, judges, comparison subjects: roles for experts in evaluating system performance. J Am Med Inform Assoc. 2002;9:1–15. - PMC - PubMed
    1. Friedman CP, Wyatt JC. Evaluation methods in medical informatics. New York: Springer, 1997.
    1. Fleiss JL. Statistical methods for rates and proportions. , 2nd ed. New York: John Wiley & Sons, 1981, pp 212–36.
    1. Uebersax JS. [cited 2005 March 23]. Available from: http://ourworld.compuserve.com/homepages/jsuebersax/agree.htm/.

Publication types

MeSH terms