Can AI Answer My Questions? Utilizing Artificial Intelligence in the Perioperative Assessment for Abdominoplasty Patients
- PMID: 38898239
- PMCID: PMC11645314
- DOI: 10.1007/s00266-024-04157-0
Can AI Answer My Questions? Utilizing Artificial Intelligence in the Perioperative Assessment for Abdominoplasty Patients
Abstract
Background: Abdominoplasty is a common operation, used for a range of cosmetic and functional issues, often in the context of divarication of recti, significant weight loss, and after pregnancy. Despite this, patient-surgeon communication gaps can hinder informed decision-making. The integration of large language models (LLMs) in healthcare offers potential for enhancing patient information. This study evaluated the feasibility of using LLMs for answering perioperative queries.
Methods: This study assessed the efficacy of four leading LLMs-OpenAI's ChatGPT-3.5, Anthropic's Claude, Google's Gemini, and Bing's CoPilot-using fifteen unique prompts. All outputs were evaluated using the Flesch-Kincaid, Flesch Reading Ease score, and Coleman-Liau index for readability assessment. The DISCERN score and a Likert scale were utilized to evaluate quality. Scores were assigned by two plastic surgical residents and then reviewed and discussed until a consensus was reached by five plastic surgeon specialists.
Results: ChatGPT-3.5 required the highest level for comprehension, followed by Gemini, Claude, then CoPilot. Claude provided the most appropriate and actionable advice. In terms of patient-friendliness, CoPilot outperformed the rest, enhancing engagement and information comprehensiveness. ChatGPT-3.5 and Gemini offered adequate, though unremarkable, advice, employing more professional language. CoPilot uniquely included visual aids and was the only model to use hyperlinks, although they were not very helpful and acceptable, and it faced limitations in responding to certain queries.
Conclusion: ChatGPT-3.5, Gemini, Claude, and Bing's CoPilot showcased differences in readability and reliability. LLMs offer unique advantages for patient care but require careful selection. Future research should integrate LLM strengths and address weaknesses for optimal patient education.
Level of evidence v: This journal requires that authors assign a level of evidence to each article. For a full description of these Evidence-Based Medicine ratings, please refer to the Table of Contents or the online Instructions to Authors www.springer.com/00266 .
Keywords: AI; Abdominoplasty; ChatGPT; LLM; Perioperative.
© 2024. The Author(s).
Conflict of interest statement
Declarations. Conflict of interest: The authors declare that they have no conflicts of interest to disclose. Human and Animal Rights, or Ethical Approval: This article does not contain any studies with human participants or animals performed by any of the authors. Informed Consent: For this type of study, informed consent is not required. Disclosure: Each author does not have any commercial interest.
Figures
Similar articles
-
Assessing the efficacy of artificial intelligence to provide peri-operative information for patients with a stoma.ANZ J Surg. 2025 Mar;95(3):464-496. doi: 10.1111/ans.19337. Epub 2024 Dec 2. ANZ J Surg. 2025. PMID: 39620607
-
Accuracy of ChatGPT, Gemini, Copilot, and Claude to Blepharoplasty-Related Questions.Aesthetic Plast Surg. 2025 Jul 21. doi: 10.1007/s00266-025-05071-9. Online ahead of print. Aesthetic Plast Surg. 2025. PMID: 40691658
-
Evaluating the Efficacy of Large Language Models in Generating Medical Documentation: A Comparative Study of ChatGPT-4, ChatGPT-4o, and Claude.Aesthetic Plast Surg. 2025 Apr 14. doi: 10.1007/s00266-025-04842-8. Online ahead of print. Aesthetic Plast Surg. 2025. PMID: 40229614
-
Harnessing artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in generating clinician-level bariatric surgery recommendations.Surg Obes Relat Dis. 2024 Jul;20(7):603-608. doi: 10.1016/j.soard.2024.03.011. Epub 2024 Mar 24. Surg Obes Relat Dis. 2024. PMID: 38644078 Review.
-
Performance of artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in the American Society for Metabolic and Bariatric Surgery textbook of bariatric surgery questions.Surg Obes Relat Dis. 2024 Jul;20(7):609-613. doi: 10.1016/j.soard.2024.04.014. Epub 2024 May 8. Surg Obes Relat Dis. 2024. PMID: 38782611 Review.
Cited by
-
Generative AI/LLMs for Plain Language Medical Information for Patients, Caregivers and General Public: Opportunities, Risks and Ethics.Patient Prefer Adherence. 2025 Jul 31;19:2227-2249. doi: 10.2147/PPA.S527922. eCollection 2025. Patient Prefer Adherence. 2025. PMID: 40771655 Free PMC article. Review.
-
Accuracy of LLMs in medical education: evidence from a concordance test with medical teacher.BMC Med Educ. 2025 Mar 26;25(1):443. doi: 10.1186/s12909-025-07009-w. BMC Med Educ. 2025. PMID: 40140805 Free PMC article.
-
A Performance Evaluation of Large Language Models in Keratoconus: A Comparative Study of ChatGPT-3.5, ChatGPT-4.0, Gemini, Copilot, Chatsonic, and Perplexity.J Clin Med. 2024 Oct 30;13(21):6512. doi: 10.3390/jcm13216512. J Clin Med. 2024. PMID: 39518652 Free PMC article.
References
-
- Regan JP, Casaubon JT (2024) Abdominoplasty. In: StatPearls, Treasure Island, FL
-
- Taylor DA, Merten SL, Sandercoe GD, Gahankari D, Ingram SB, Moncrieff NJ, Ho K, Sellars GD, Magnusson MR (2018) Abdominoplasty improves low back pain and urinary incontinence. Plast Reconstr Surg 141:637–645. 10.1097/PRS.0000000000004100 - PubMed
-
- de Brito MJ, Nahas FX, Barbosa MV, Dini GM, Kimura AK, Farah AB, Ferreira LM (2010) Abdominoplasty and its effect on body image, self-esteem, and mental health. Ann Plast Surg 65:5–10. 10.1097/SAP.0b013e3181bc30f7 - PubMed
-
- Oranges CM, Schaefer KM, Haug M, Schaefer DJ (2016) The impact of aesthetic surgery on body image and its implications for mental and physical health. Aesthet Surg J 36:NP256-258. 10.1093/asj/sjw066. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials