Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1999 Apr;33(4):267-75.
doi: 10.1046/j.1365-2923.1999.00292.x.

The effect of a 'don't know' option on test scores: number-right and formula scoring compared

Affiliations

The effect of a 'don't know' option on test scores: number-right and formula scoring compared

A M Muijtjens et al. Med Educ. 1999 Apr.

Abstract

Objectives: In multiple-choice tests using a 'don't-know' option the number of correct minus incorrect answers was used as the test score (formula scoring) in order to reduce the measurement error resulting from random guessing. In the literature diverging results are reported when comparing formula scoring and number-right scoring, the scoring method without the don't-know option.

Design: To investigate which method was most appropriate, both scoring methods were used in true-false tests (block tests) taken at the end of a second- and third-year educational module (block). The students were asked to answer each item initially by choosing from the response options true, false or don't know, and secondly to replace all don't-know answers by a true-false answer.

Setting: Maastricht University, The Netherlands.

Subjects: Medical students.

Results: The correct scores for the don't-know answered items were found to be 4.5% and 5.9%, respectively, higher than expected with pure random guesswork. This represents a source of bias with formula scoring, because students who were less willing to guess obtained lower scores. The average difference in the correct minus incorrect score for the two scoring methods (2.5%, P < 0.001, and 3.4%, P < 0. 001, respectively) indicates the size of the bias (compare: the standard deviation of the score equals 11%). Test reliability was higher with formula scoring (0.72 vs. 0.66 and 0.74 vs. 0.66), but the difference decreased when the test was restricted to items which were close to the core content of the block (0.81 vs. 0.77, resp. 0. 75 vs. 0.70).

Conclusions: In deciding what scoring method to use, less bias (number-right scoring) has to be weighed against higher reliability (formula scoring). Apart from these psychometric reasons educational factors must be considered.

PubMed Disclaimer

Similar articles

Cited by

MeSH terms

LinkOut - more resources