GPT detectors are biased against non-native English writers

Weixin Liang¹, Mert Yuksekgonul¹, Yining Mao², Eric Wu², James Zou^{1

2

3}

Affiliations

¹ Department of Computer Science, Stanford University, Stanford, CA, USA.
² Department of Electrical Engineering, Stanford University, Stanford, CA, USA.
³ Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.

PMID: 37521038
PMCID: PMC10382961
DOI: 10.1016/j.patter.2023.100779

GPT detectors are biased against non-native English writers

Weixin Liang et al. Patterns (N Y). 2023.

. 2023 Jul 10;4(7):100779.

doi: 10.1016/j.patter.2023.100779. eCollection 2023 Jul 14.

Authors

Weixin Liang¹, Mert Yuksekgonul¹, Yining Mao², Eric Wu², James Zou^{1

2

3}

Affiliations

¹ Department of Computer Science, Stanford University, Stanford, CA, USA.
² Department of Electrical Engineering, Stanford University, Stanford, CA, USA.
³ Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.

PMID: 37521038
PMCID: PMC10382961
DOI: 10.1016/j.patter.2023.100779

Abstract

GPT detectors frequently misclassify non-native English writing as AI generated, raising concerns about fairness and robustness. Addressing the biases in these detectors is crucial to prevent the marginalization of non-native English speakers in evaluative and educational settings and to create a more equitable digital landscape.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Figure 1**
Bias in GPT detectors against non-native English writing samples High misclassification of TOEFL essays written by non-native English authors as AI generated, with near-perfect accuracy for US eighth-grade essays. Improved word choice in TOEFL essays reduces misclassification (prompt: “Enhance the word choices to sound more like that of a native speaker”), while simplification of US eighth-grade essays increases misclassification (prompt: “Simplify word choices as if written by a non-native speaker”). Performance averaged across seven widely used GPT detectors. The error bars represent the standard deviation across the seven detectors.

**Figure 2**
Simple prompts effectively bypass GPT detectors Detection rates for ChatGPT-3.5-generated college essays and scientific abstracts drop significantly with a self-edit prompt (e.g., “Elevate the provided text by employing literary language”). Performance averaged across seven widely used GPT detectors. The error bars represent the standard deviation across the seven detectors.

See this image and copyright information in PMC

References

1. Mollman S. Yahoo! Finance; 2022. ChatGPT gained 1 million users in under a week. Here’s why the AI chatbot is primed to disrupt search as we know it.https://www.yahoo.com/video/chatgpt-gained-1-million-followers-224523258...
1. Else H. Abstracts written by ChatGPT fool scientists. Nature. 2023;613:423. - PubMed
1. Heikkilä M. How to spot AI-generated text. MIT Technol. Rev. 2022 https://www.technologyreview.com/2022/12/19/1065596/how-to-spot-ai-gener...
1. Fowler G.A. The Washington Post; 2023. We tested a new ChatGPT-detector for teachers. It flagged an innocent student.https://www.washingtonpost.com/technology/2023/04/01/chatgpt-cheating-de...
1. Liang W., Yuksekgonul M., Mao Y., Wu E., Zou J. GPT detectors are biased against non-native English writers. arXiv. 2023 doi: 10.48550/arXiv.2304.02819. https://arxiv.org/abs/2304.02819 Preprint at. - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

GPT detectors are biased against non-native English writers

Affiliations

GPT detectors are biased against non-native English writers

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources