Sex trouble: Sex/gender slippage, sex confusion, and sex obsession in machine learning using electronic health records
- PMID: 36033589
- PMCID: PMC9403398
- DOI: 10.1016/j.patter.2022.100534
Sex trouble: Sex/gender slippage, sex confusion, and sex obsession in machine learning using electronic health records
Abstract
False assumptions that sex and gender are binary, static, and concordant are deeply embedded in the medical system. As machine learning researchers use medical data to build tools to solve novel problems, understanding how existing systems represent sex/gender incorrectly is necessary to avoid perpetuating harm. In this perspective, we identify and discuss three factors to consider when working with sex/gender in research: "sex/gender slippage," the frequent substitution of sex and sex-related terms for gender and vice versa; "sex confusion," the fact that any given sex variable holds many different potential meanings; and "sex obsession," the idea that the relevant variable for most inquiries related to sex/gender is sex assigned at birth. We then explore how these phenomena show up in medical machine learning research using electronic health records, with a specific focus on HIV risk prediction. Finally, we offer recommendations about how machine learning researchers can engage more carefully with questions of sex/gender.
Keywords: electronic health records; gender; healthcare; machine learning; non-binary; sex; sex/gender; transgender.
© 2022 The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Comment in
-
Responsible and accountable data science.Patterns (N Y). 2022 Nov 11;3(11):100629. doi: 10.1016/j.patter.2022.100629. eCollection 2022 Nov 11. Patterns (N Y). 2022. PMID: 36419445 Free PMC article. No abstract available.
References
-
- John Lewis at the National Constitution Center https://www.youtube.com/watch?v=atF3WWELkmE
-
- Desautels T., Calvert J., Hoffman J., Jay M., Kerem Y., Shieh L., Shimabukuro D., Chettipally U., Feldman M.D., Barton C., et al. Prediction of sepsis in the intensive care unit with minimal electronic health record data: a machine learning approach. JMIR Med. Informatics. September 2016;4:e28. doi: 10.2196/medinform.5909. - DOI - PMC - PubMed
-
- National Institutes of Health NOT-OD-02-001: amendment: NIH policy and guidelines on the inclusion of women and minorities as subjects in clinical research. 2001. https://grants.nih.gov/grants/guide/notice-files/NOT-OD-02-001.html
-
- National Institutes of Health NOT-OD-18-014: revision: NIH Policy and guidelines on the inclusion of women and minorities as subjects in clinical research. 2017. https://grants.nih.gov/grants/guide/notice-files/NOT-OD-18-014.html
Publication types
LinkOut - more resources
Full Text Sources
Research Materials
