Misrepresentation of Overall and By-Gender Mortality Causes in Film Using Online, Crowd-Sourced Data: Quantitative Analysis
- PMID: 40554798
- PMCID: PMC12212890
- DOI: 10.2196/70853
Misrepresentation of Overall and By-Gender Mortality Causes in Film Using Online, Crowd-Sourced Data: Quantitative Analysis
Abstract
Background: The common phrase "representation matters" asserts that media has a measurable and important impact on civic society's perception of self and others. The representation of health in media, in particular, may reflect and perpetuate a society's disease burden.
Objective: In this study, for the top 10 major causes of death in the United States, we aimed to examine how cinematic representation overall and by-gender mortality diverges from reality.
Methods: Using crowd-sourced data on over 68,000 film deaths from Cinemorgue Wiki, we employ natural language processing techniques to analyze shifts in representation of deaths in movies versus the 2021 National Vital Statistics Survey top 10 mortality causes. We parsed, stemmed, and classified each film death database entry, and then categorized film deaths by gender using a specifically trained gender text classifier.
Results: Overall, movies strongly overrepresent suicide and, to a lesser degree, accidents. In terms of gender, movies overrepresent men and underrepresent women for nearly every major mortality cause, including heart disease and cerebrovascular disease (chi-square test, P<.001); 73.6% (477/648) of film deaths from heart disease were men (vs 384,866/695,547, 55.4% in real life) and 69.4% (50/72) of film deaths from cerebrovascular disease were men (vs 70,852/162,890, 43.5% in real life). The 2 exceptions for which women were overrepresented are suicide and accidents (chi-square test, P<.001), with 39.7% (945/2382) deaths from suicide in film being women (vs 9825/48,183, 20.4% in real life) and 38.8% (485/1250) deaths from accidents in film being women (vs 75,333/225,935, 33.5% in real life).
Conclusions: We discuss the implications of under- and overrepresenting causes of death overall and by gender, as well as areas of future research.
Keywords: NLP; data science; gender; media representation; mortality; natural language processing.
© Calla Glavin Beauregard, Christopher M Danforth, Peter Sheridan Dodds. Originally published in JMIR Formative Research (https://formative.jmir.org).
Conflict of interest statement
Figures


Similar articles
-
Surveillance for Violent Deaths - National Violent Death Reporting System, 50 States, the District of Columbia, and Puerto Rico, 2022.MMWR Surveill Summ. 2025 Jun 12;74(5):1-42. doi: 10.15585/mmwr.ss7405a1. MMWR Surveill Summ. 2025. PMID: 40493548 Free PMC article.
-
Gender differences in the context of interventions for improving health literacy in migrants: a qualitative evidence synthesis.Cochrane Database Syst Rev. 2024 Dec 12;12(12):CD013302. doi: 10.1002/14651858.CD013302.pub2. Cochrane Database Syst Rev. 2024. PMID: 39665382
-
A Data Science Approach to Estimating the Frequency of Driving Cessation Associated Suicide in the US: Evidence From the National Violent Death Reporting System.Front Public Health. 2021 Aug 16;9:689967. doi: 10.3389/fpubh.2021.689967. eCollection 2021. Front Public Health. 2021. PMID: 34485220 Free PMC article.
-
Factors that influence participation in physical activity for people with bipolar disorder: a synthesis of qualitative evidence.Cochrane Database Syst Rev. 2024 Jun 4;6(6):CD013557. doi: 10.1002/14651858.CD013557.pub2. Cochrane Database Syst Rev. 2024. PMID: 38837220 Free PMC article. Review.
-
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3. Syst Rev. 2024. PMID: 39593159 Free PMC article.
References
-
- Curtin SC, Tejada-Vera B, Bastian BA. Deaths: leading causes for 2021. [29-05-2025];Natl Vital Stat Rep. 2024 73(4) https://www.cdc.gov/nchs/data/nvsr/nvsr73/nvsr73-04.pdf URL. Accessed. - PubMed
-
- Isakadze N, Mehta PK, Law K, Dolan M, Lundberg GP. Addressing the gap in physician preparedness to assess cardiovascular risk in women: a comprehensive approach to cardiovascular risk assessment in women. Curr Treat Options Cardiovasc Med. 2019 Jul 29;21(9):47. doi: 10.1007/s11936-019-0753-0. doi. Medline. - DOI - PubMed
-
- Wenger NK, Lloyd-Jones DM, Elkind MSV, et al. Call to action for cardiovascular disease in women: epidemiology, awareness, access, and delivery of equitable health care: a Presidential Advisory from the American Heart Association. Circulation. 2022 Jun 7;145(23):e1059–e1071. doi: 10.1161/CIR.0000000000001071. doi. Medline. - DOI - PMC - PubMed
-
- Brooks D, Hébert L. The SAGE Handbook of Gender and Communication. SAGE Publications; 2006. Gender, race, and media representation. doi. - DOI
-
- Dill‐Shackleford KE, Vinney C, Hopper‐Losenicky K. Connecting the dots between fantasy and reality: the social psychology of our engagement with fictional narrative and its functional value. Social & Personality Psych. 2016 Nov;10(11):634–646. doi: 10.1111/spc3.12274. doi. - DOI
MeSH terms
LinkOut - more resources
Full Text Sources