Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul 10;4(7):100779.
doi: 10.1016/j.patter.2023.100779. eCollection 2023 Jul 14.

GPT detectors are biased against non-native English writers

Affiliations

GPT detectors are biased against non-native English writers

Weixin Liang et al. Patterns (N Y). .

Abstract

GPT detectors frequently misclassify non-native English writing as AI generated, raising concerns about fairness and robustness. Addressing the biases in these detectors is crucial to prevent the marginalization of non-native English speakers in evaluative and educational settings and to create a more equitable digital landscape.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Bias in GPT detectors against non-native English writing samples High misclassification of TOEFL essays written by non-native English authors as AI generated, with near-perfect accuracy for US eighth-grade essays. Improved word choice in TOEFL essays reduces misclassification (prompt: “Enhance the word choices to sound more like that of a native speaker”), while simplification of US eighth-grade essays increases misclassification (prompt: “Simplify word choices as if written by a non-native speaker”). Performance averaged across seven widely used GPT detectors. The error bars represent the standard deviation across the seven detectors.
Figure 2
Figure 2
Simple prompts effectively bypass GPT detectors Detection rates for ChatGPT-3.5-generated college essays and scientific abstracts drop significantly with a self-edit prompt (e.g., “Elevate the provided text by employing literary language”). Performance averaged across seven widely used GPT detectors. The error bars represent the standard deviation across the seven detectors.

References

    1. Mollman S. Yahoo! Finance; 2022. ChatGPT gained 1 million users in under a week. Here’s why the AI chatbot is primed to disrupt search as we know it.https://www.yahoo.com/video/chatgpt-gained-1-million-followers-224523258...
    1. Else H. Abstracts written by ChatGPT fool scientists. Nature. 2023;613:423. - PubMed
    1. Heikkilä M. How to spot AI-generated text. MIT Technol. Rev. 2022 https://www.technologyreview.com/2022/12/19/1065596/how-to-spot-ai-gener...
    1. Fowler G.A. The Washington Post; 2023. We tested a new ChatGPT-detector for teachers. It flagged an innocent student.https://www.washingtonpost.com/technology/2023/04/01/chatgpt-cheating-de...
    1. Liang W., Yuksekgonul M., Mao Y., Wu E., Zou J. GPT detectors are biased against non-native English writers. arXiv. 2023 doi: 10.48550/arXiv.2304.02819. https://arxiv.org/abs/2304.02819 Preprint at. - DOI - PMC - PubMed

LinkOut - more resources