ChatGPT for Sample-Size Calculation in Sports Medicine and Exercise Sciences: A Cautionary Note

Jabeur Methnani^{1

2}, Imed Latiri³, Ismail Dergaa^{4

5

6}, Karim Chamari⁷, Helmi Ben Saad^{2

8}

Affiliations

¹ LR19ES09, Laboratoire de Physiologie de l'Exercice et Physiopathologie: de l'Intégré au Moléculaire "Biologie, Médecine et Santé," Faculty of Medicine of Sousse, University of Sousse, Sousse,Tunisia.
² High Institute of Sport and Physical Education, Ksar said University of Manouba, Ksar said,Tunisia.
³ Research Laboratory LR12SP09 "Heart Failure" Farhat HACHED Hospital, University of Sousse, Sousse,Tunisia.
⁴ Primary Health Care Corporation (PHCC), Doha,Qatar.
⁵ Aspetar, Orthopedic and Sports Medicine Hospital, FIFA Medical Center of Excellence, Doha,Qatar.
⁶ Research Unit Physical Activity, Sport, and Health, UR18JS01, National Observatory of Sport, Tunis,Tunisia.
⁷ High Institute of Sport and Physical Education, University of Sfax, Sfax,Tunisia.
⁸ Service of Physiology and Functional Explorations, Farhat HACHED Hospital, University of Sousse, Sousse,Tunisia.

PMID: 37536678
DOI: 10.1123/ijspp.2023-0109

ChatGPT for Sample-Size Calculation in Sports Medicine and Exercise Sciences: A Cautionary Note

Jabeur Methnani et al. Int J Sports Physiol Perform. 2023.

. 2023 Aug 3;18(10):1219-1223.

doi: 10.1123/ijspp.2023-0109. Print 2023 Oct 1.

Authors

Jabeur Methnani^{1

2}, Imed Latiri³, Ismail Dergaa^{4

5

6}, Karim Chamari⁷, Helmi Ben Saad^{2

8}

Affiliations

¹ LR19ES09, Laboratoire de Physiologie de l'Exercice et Physiopathologie: de l'Intégré au Moléculaire "Biologie, Médecine et Santé," Faculty of Medicine of Sousse, University of Sousse, Sousse,Tunisia.
² High Institute of Sport and Physical Education, Ksar said University of Manouba, Ksar said,Tunisia.
³ Research Laboratory LR12SP09 "Heart Failure" Farhat HACHED Hospital, University of Sousse, Sousse,Tunisia.
⁴ Primary Health Care Corporation (PHCC), Doha,Qatar.
⁵ Aspetar, Orthopedic and Sports Medicine Hospital, FIFA Medical Center of Excellence, Doha,Qatar.
⁶ Research Unit Physical Activity, Sport, and Health, UR18JS01, National Observatory of Sport, Tunis,Tunisia.
⁷ High Institute of Sport and Physical Education, University of Sfax, Sfax,Tunisia.
⁸ Service of Physiology and Functional Explorations, Farhat HACHED Hospital, University of Sousse, Sousse,Tunisia.

PMID: 37536678
DOI: 10.1123/ijspp.2023-0109

Abstract

Purpose: To investigate the accuracy of ChatGPT (Chat generative pretrained transformer), a large language model, in calculating sample size for sport-sciences and sports-medicine research studies.

Methods: We conducted an analysis on 4 published papers (ie, examples 1-4) encompassing various study designs and approaches for calculating sample size in 3 sport-science and -medicine journals, including 3 randomized controlled trials and 1 survey paper. We provided ChatGPT with all necessary data such as mean, percentage SD, normal deviates (Zα/2 and Z1-β), and study design. Prompting from 1 example has subsequently been reused to gain insights into the reproducibility of the ChatGPT response.

Results: ChatGPT correctly calculated the sample size for 1 randomized controlled trial but failed in the remaining 3 examples, including the incorrect identification of the formula in one example of a survey paper. After interaction with ChatGPT, the correct sample size was obtained for the survey paper. Intriguingly, when the prompt from Example 3 was reused, ChatGPT provided a completely different sample size than its initial response.

Conclusions: While the use of artificial-intelligence tools holds great promise, it should be noted that it might lead to errors and inconsistencies in sample-size calculations even when the tool is fed with the necessary correct information. As artificial-intelligence technology continues to advance and learn from human feedback, there is hope for improvement in sample-size calculation and other research tasks. However, it is important for scientists to exercise caution in utilizing these tools. Future studies should assess more advanced/powerful versions of this tool (ie, ChatGPT4).

Keywords: artificial intelligence; effect size; higher education; large language model; methodology; natural language processing; peer review; power; precision; research; sport science; statistics.

PubMed Disclaimer

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Sheridan PubFactory
Medical
- MedlinePlus Health Information
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

ChatGPT for Sample-Size Calculation in Sports Medicine and Exercise Sciences: A Cautionary Note

Affiliations

ChatGPT for Sample-Size Calculation in Sports Medicine and Exercise Sciences: A Cautionary Note

Authors

Affiliations

Abstract

MeSH terms

LinkOut - more resources

Full Text Sources

Medical

Research Materials