Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 2;7(1):10.
doi: 10.1186/s41235-022-00362-0.

How one block of trials influences the next: persistent effects of disease prevalence and feedback on decisions about images of skin lesions in a large online study

Affiliations

How one block of trials influences the next: persistent effects of disease prevalence and feedback on decisions about images of skin lesions in a large online study

Jeremy M Wolfe. Cogn Res Princ Implic. .

Abstract

Using an online, medical image labeling app, 803 individuals rated images of skin lesions as either "melanoma" (skin cancer) or "nevus" (a skin mole). Each block consisted of 80 images. Blocks could have high (50%) or low (20%) target prevalence and could provide full, accurate feedback or no feedback. As in prior work, with feedback, decision criteria were more conservative at low prevalence than at high prevalence and resulted in more miss errors. Without feedback, this low prevalence effect was reversed (albeit, not significantly). Participants could participate in up to four different conditions a day on each of 6 days. Our main interest was in the effect of Block N on Block N + 1. Low prevalence with feedback made participants more conservative on a subsequent block. High prevalence with feedback made participants more liberal on a subsequent block. Conditions with no feedback had no significant impact on the subsequent block. The delay between Blocks 1 and 2 had no significant effect. The effect on the second half of Block 2 was just as large as on the first half. Medical expertise (over the range available in the study) had no impact on these effects, though medical students were better at the task than other groups. Overall, these seem to be robust effects where feedback may be 'teaching' participants how to respond in the future. This might have application in, for example, training or re-training situations.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Sample stimulus displays. Left: participants are asked if a lesion is a melanoma (cancer) or a nevus (benign ‘mole’). Right: after response, if this were a feedback condition, red or green feedback would inform the participant of the correctness of the answer. In no feedback conditions, neutral feedback indicated that the response had been registered
Fig. 2
Fig. 2
D′ and criterion as a function of condition. Each dot represents one of 2080 blocks of data. Black lines show mean and ± 95% CI of the mean
Fig. 3
Fig. 3
Median response time as a function of block type. RTs < 11 s were analyzed. A RT distributions with the graph truncated at 3 s for display purposes. Black lines show the mean. B Means of the median RTs on a much finer scale. Error bars show ± 95% confidence intervals
Fig. 4
Fig. 4
Change in d′ on block 2 as a function of the nature of block 1. p values show results for simple t-tests, testing against the null hypothesis that there is no change in d′. Black lines show means ± 95% confidence intervals
Fig. 5
Fig. 5
Change in criterion on block 2 as a function of the nature of block 1. P values show results for simple t-tests, testing against the null hypothesis that there is no change in criterion. Black lines show means ± 95% confidence intervals
Fig. 6
Fig. 6
Change in criterion as a function of time between the start of block 1 and start of block 2. 1440 min = 1 day. Colored lines are best-fit linear regressions for each of 16 pairs of block types. A All data, B Delay < 1 day, C Delay < 2 h, D Delay < 15 min. Repeat pairs (e.g., Low Feedback → Low Feedback) do not occur for shorter delays (C, D)
Fig. 7
Fig. 7
Change in criterion on block 2 as a function of the nature of block 1. Data restricted to pairs separated by more than 2 h
Fig. 8
Fig. 8
Mean change in criterion for the first and second halves of a block of trials
Fig. 9
Fig. 9
Effects of expertise category on d′ and crit, regardless of prevalence and/or feedback
Fig. 10
Fig. 10
Hypothetical curves showing the results of the feedback and no feedback conditions as two points on continuous functions

References

    1. Codella, N., Rotemberg, V., Tschandl, P., Celebi, M. E., Dusza, S., Gutman, D., et al. (2019). Skin lesion analysis toward melanoma detection 2018: A challenge hosted by the International Skin Imaging Collaboration (ISIC). arXiv:1902.03368.
    1. Cox PH, Kravitz DJ, Mitroff SR. Great expectations: Minor differences in initial instructions have a major impact on visual search in the absence of feedback. Cognitive Research: Principles and Implications (CRPI) 2021;6(1):19. doi: 10.1186/s41235-021-00286-1. - DOI - PMC - PubMed
    1. Evans KK, Birdwell RL, Wolfe JM. If you don’t find it often, you often don’t find it: Why some cancers are missed in breast cancer screening. PLoS ONE. 2013;8(5):366. doi: 10.1371/journal.pone.0064366. - DOI - PMC - PubMed
    1. Evans KK, Tambouret R, Wilbur DC, Evered A, Wolfe JM. Prevalence of abnormalities influences cytologists' error rates in screening for cervical cancer. Archives of Pathology & Laboratory Medicine. 2011;135(12):1557–1560. doi: 10.5858/arpa.2010-0739-OA. - DOI - PMC - PubMed
    1. Evered A. The prevalence problem in the era of human papillomavirus screening. Cytopathology. 2017;29:97–99. doi: 10.1111/cyt.12488. - DOI - PubMed

Publication types