A model-independent redundancy measure for human versus ChatGPT authorship discrimination using a Bayesian probabilistic approach

Silvia Bozza^#^{1

2}, Claude-Alain Roten^#³, Antoine Jover^{3

4}, Valentina Cammarota⁴, Lionel Pousaz³, Franco Taroni⁴

Affiliations

¹ Ca' Foscari University of Venice, Department of Economics, Venice, 30121, Italy. silvia.bozza@unive.it.
² University of Lausanne, School of Criminal Justice, Lausanne, 1015, Switzerland. silvia.bozza@unive.it.
³ OrphAnalytics SA, Vevey, 1800, Switzerland.
⁴ University of Lausanne, School of Criminal Justice, Lausanne, 1015, Switzerland.

^# Contributed equally.

PMID: 37932415
PMCID: PMC10628141
DOI: 10.1038/s41598-023-46390-8

A model-independent redundancy measure for human versus ChatGPT authorship discrimination using a Bayesian probabilistic approach

Silvia Bozza et al. Sci Rep. 2023.

. 2023 Nov 6;13(1):19217.

doi: 10.1038/s41598-023-46390-8.

Authors

Silvia Bozza^#^{1

2}, Claude-Alain Roten^#³, Antoine Jover^{3

4}, Valentina Cammarota⁴, Lionel Pousaz³, Franco Taroni⁴

Affiliations

¹ Ca' Foscari University of Venice, Department of Economics, Venice, 30121, Italy. silvia.bozza@unive.it.
² University of Lausanne, School of Criminal Justice, Lausanne, 1015, Switzerland. silvia.bozza@unive.it.
³ OrphAnalytics SA, Vevey, 1800, Switzerland.
⁴ University of Lausanne, School of Criminal Justice, Lausanne, 1015, Switzerland.

^# Contributed equally.

PMID: 37932415
PMCID: PMC10628141
DOI: 10.1038/s41598-023-46390-8

Abstract

The academic and scientific world in general is increasingly concerned about their inability to determine and ascertain the identity of the writer of a text. More and more often the question arises as to whether a scientific article or work handed in by a student was actually produced by the alleged author of the questioned text. The role of artificial intelligence (AI) is increasingly debated due to its dangers of undeclared use. A current example is undoubtedly the undeclared use of ChatGPT to write a scientific text. The article promotes an AI model-independent redundancy measure to support discrimination between hypotheses on authorship of various multilingual texts written by humans or produced by intelligence media such as ChatGPT. The syntax of texts written by humans tends to differ from that of texts produced by AIs. This difference can be grasped and quantified even with short texts (i.e. 1800 characters). This aspect of length is extremely important, because short texts imply a greater difficulty of analysis to characterize authorship. To meet the efficiency criteria required for the evaluation of forensic evidence, a probabilistic approach is implemented. In particular, to assess the value of the redundancy measure and to offer a consistent classification criterion, a metric called Bayes factor is implemented. The proposed Bayesian probabilistic method represents an original approach in stylometry. Analyses performed over multilingual texts (English and French) covering different scientific and human areas of interest (forensic science and socio-psycho-artistic topics) reveal the feasibility of a successful authorship discrimination with limited misclassification rates. Model performance is satisfactory even with small sample sizes.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Figure 1**
Redundancy measure for words uni- (U), bi- (B), tri- (T) and quadri- (Q) grams in the two populations: forensic science papers (FSI, left) and student’s manuscripts (Students, right). A distinction is made for texts written by humans (h), either forensic scientists or students (blue colored boxplots), and text delivered by the artificial intelligence (c), either for scientific papers or students’ texts (red colored boxplots). The first population (FSI) is characterized by texts written in English, while the second one (Students) is characterized by texts written in French.

**Figure 2**
Weights of evidence, log(BF), for every texts written by (1) ChatGPT on forensic (solid red-colored line) and socio-psycho-artistic themes (dashed red-colored line) and by (2) humans scientists (solid blue-colored line) and students (dashed blue-colored line).

See this image and copyright information in PMC

References

1. Bacciu, A. et al. Bot and gender detection of Twitter accounts using distortion and LSA. Working Notes Papers of the CLEF 2019 Evaluation Labs Volume 2380 of CEUR Workshop, Lugano (2019).
1. Rangel, F. & Rosso, P. Overview of the 7th author profiling task at PAN 2019: Bots and gender profiling in Twitter. Working Notes Papers of the CLEF 2019 Evaluation Labs Volume 2380 of CEUR Workshop, Lugano (2019).
1. Espinosa, D. Y., Gómez-Adorno, H. & Sidorov, G. Bots and gender profiling using character bigrams notebook for PAN at CLEF 2019. Lugano (2019).
1. Savoy, J. Machine learning methods for stylometry: authorship attribution and author profiling (Springer, 10.1007/978-3-030-53360-1, 2020).
1. Holmes DI. Authorship attribution. Computers and the Humanities. 1994;28:87–106. doi: 10.1007/BF01830689. - DOI

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A model-independent redundancy measure for human versus ChatGPT authorship discrimination using a Bayesian probabilistic approach

Affiliations

A model-independent redundancy measure for human versus ChatGPT authorship discrimination using a Bayesian probabilistic approach

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources