Investigating Potential Gender Differences in ChatGPT-Diagnosed Clinical Vignettes
Keywords:
AI, chatgpt, gender bias, healthcare, misdiagnosisAbstract
BACKGROUND: The integration of artificial intelligence (AI) in medical decision-making introduces additional concerns, particularly regarding information bias within AI models such as ChatGPT, which heavily rely on training data. With gender-based disparities in diagnosis and treatment being well-documented in healthcare, there is a pressing need to evaluate the potential of AI models to perpetuate or alleviate these gender biases.
AIMS: This study seeks to investigate gender differences in diagnostic accuracy within ChatGPT 3.5 by evaluating the accuracy and completeness of its responses to various clinical vignettes.
METHODS: Ten medical conditions (including psychiatric, respiratory, cardiac, and cerebrovascular cases) previously reported for gender-based misdiagnoses, were selected for the study. Two identical clinical vignettes were created for each condition, with the only difference being the gender of the patient. These 20 vignettes were entered into ChatGPT 3.5 randomly by a single researcher, each accompanied by a prompt requesting the most likely explanation for the patient’s symptoms and the next appropriate step in management. The responses generated by ChatGPT were evaluated for accuracy and completeness by two independent evaluators, utilizing a scale set by Johnson et al., which included a six-point Likert scale ranging from 1 (completely incorrect) to 6 (correct) for accuracy, and a three-point scale for completeness, ranging from 1 (incomplete) to 3 (comprehensive). Discrepancies were resolved through a blind consensus process. Data analysis and visualization was done using RStudio v4.3.2, with statistical significance between accuracy and completeness was determined using Spearman’s R and Mann-Whitney U Tests.
RESULTS: Among the 20 cases, six were incorrectly diagnosed, with two instances attributed to gender-based misdiagnoses. Specifically, ChatGPT misclassified ectopic pregnancy as appendicitis, and paroxysmal supraventricular tachycardia (PSVT) as a panic attack in female patients, despite indicative symptoms and prior correct diagnoses in male counterparts. Additionally, systemic lupus erythematosus (SLE) was inaccurately labeled as rheumatoid arthritis (RA) in both male and female patients. Moreover, eating disorders were misidentified, with ChatGPT failing to provide definitive diagnoses for these conditions. The overall median accuracy score was 6, (Mean = 5.5, SD = 0.6), while the median completeness score was 2.5 (Mean = 2.5, SD = 0.5). Correlation analysis indicated a non-significant relationship between accuracy and completeness (Spearman's R: rs = 0.23139, p = 0.3263), although Mann Whitney U test results suggested significant discrepancies in accuracy between correctly and incorrectly diagnosed cases (z-score = 5.39649, p < .00001).
CONCLUSION: While the AI's responses were generally accurate and complete, the observed misdiagnoses of conditions such as PSVT and eating disorders highlight the need for a more thorough examination of potential biases in AI-driven chatbots. The varying outcomes in the Spearman’s R and Mann-Whitney U tests indicate that, although there may not be a consistent linear relationship between accuracy and completeness, ChatGPT's performance differs significantly across scenarios, necessitating further investigation. Moreover, the small sample size of vignette may not fully capture the extent of potential biases. Despite these limitations, the findings underscore the complexity of AI in healthcare and the critical importance of continuous scrutiny and refinement of these models.
Metrics
References
Sagynbekov K. Gender-based health disparities: a state-level study of the American adult population. Milken Institute. 2017.
Lenti MV, Di Sabatino A. Disease-and gender-related characteristics of coeliac disease influence diagnostic delay. European Journal of Internal Medicine. 2021;83:12-3.
Johnson D, Goodman R, Patrinely J, Stone C, Zimmerman E, Donald R, et al. Assessing the accuracy and reliability of AI-generated medical responses: an evaluation of the Chat-GPT model. Research square. 2023.
Downloads
Published
How to Cite
Issue
Section
Categories
License
Copyright (c) 2024 Anjali Mediboina, Meghana Bhupathi, Keerthana Janapareddy
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- The Author retains copyright in the Work, where the term “Work” shall include all digital objects that may result in subsequent electronic publication or distribution.
- Upon acceptance of the Work, the author shall grant to the Publisher the right of first publication of the Work.
- The Author shall grant to the Publisher and its agents the nonexclusive perpetual right and license to publish, archive, and make accessible the Work in whole or in part in all forms of media now or hereafter known under a Creative Commons Attribution 4.0 International License or its equivalent, which, for the avoidance of doubt, allows others to copy, distribute, and transmit the Work under the following conditions:
- Attribution—other users must attribute the Work in the manner specified by the author as indicated on the journal Web site; with the understanding that the above condition can be waived with permission from the Author and that where the Work or any of its elements is in the public domain under applicable law, that status is in no way affected by the license.
- The Author is able to enter into separate, additional contractual arrangements for the nonexclusive distribution of the journal's published version of the Work (e.g., post it to an institutional repository or publish it in a book), as long as there is provided in the document an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post online a prepublication manuscript (but not the Publisher’s final formatted PDF version of the Work) in institutional repositories or on their Websites prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work. Any such posting made before acceptance and publication of the Work shall be updated upon publication to include a reference to the Publisher-assigned DOI (Digital Object Identifier) and a link to the online abstract for the final published Work in the Journal.
- Upon Publisher’s request, the Author agrees to furnish promptly to Publisher, at the Author’s own expense, written evidence of the permissions, licenses, and consents for use of third-party material included within the Work, except as determined by Publisher to be covered by the principles of Fair Use.
- The Author represents and warrants that:
- the Work is the Author’s original work;
- the Author has not transferred, and will not transfer, exclusive rights in the Work to any third party;
- the Work is not pending review or under consideration by another publisher;
- the Work has not previously been published;
- the Work contains no misrepresentation or infringement of the Work or property of other authors or third parties; and
- the Work contains no libel, invasion of privacy, or other unlawful matter.
- The Author agrees to indemnify and hold Publisher harmless from the Author’s breach of the representations and warranties contained in Paragraph 6 above, as well as any claim or proceeding relating to Publisher’s use and publication of any content contained in the Work, including third-party content.
Enforcement of copyright
The IJMS takes the protection of copyright very seriously.
If the IJMS discovers that you have used its copyright materials in contravention of the license above, the IJMS may bring legal proceedings against you seeking reparation and an injunction to stop you using those materials. You could also be ordered to pay legal costs.
If you become aware of any use of the IJMS' copyright materials that contravenes or may contravene the license above, please report this by email to contact@ijms.org
Infringing material
If you become aware of any material on the website that you believe infringes your or any other person's copyright, please report this by email to contact@ijms.org