Boys don't cry (or kiss or dance): A computational linguistic lens into gendered actions in film

Victor R Martinez¹, Krishna Somandepalli², Shrikanth Narayanan^{1

2}

Affiliations

¹ Department of Computer Science, University of Southern California, Los Angeles, California, United States of America.
² Department of Electrical & Computer Engineering, University of Southern California, Los Angeles, California, United States of America.

PMID: 36542600
PMCID: PMC9770346
DOI: 10.1371/journal.pone.0278604

Boys don't cry (or kiss or dance): A computational linguistic lens into gendered actions in film

Victor R Martinez et al. PLoS One. 2022.

. 2022 Dec 21;17(12):e0278604.

doi: 10.1371/journal.pone.0278604. eCollection 2022.

Authors

Victor R Martinez¹, Krishna Somandepalli², Shrikanth Narayanan^{1

2}

Affiliations

¹ Department of Computer Science, University of Southern California, Los Angeles, California, United States of America.
² Department of Electrical & Computer Engineering, University of Southern California, Los Angeles, California, United States of America.

PMID: 36542600
PMCID: PMC9770346
DOI: 10.1371/journal.pone.0278604

Abstract

Contemporary media is full of images that reflect traditional gender notions and stereotypes, some of which may perpetuate harmful gender representations. In an effort to highlight the occurrence of these adverse portrayals, researchers have proposed machine-learning methods to identify stereotypes in the language patterns found in character dialogues. However, not all of the harmful stereotypes are communicated just through dialogue. As a complementary approach, we present a large-scale machine-learning framework that automatically identifies character's actions from scene descriptions found in movie scripts. For this work, we collected 1.2+ million scene descriptions from 912 movie scripts, with more than 50 thousand actions and 20 thousand movie characters. Our framework allow us to study systematic gender differences in movie portrayals at a scale. We show this through a series of statistical analyses that highlight differences in gender portrayals. Our findings provide further evidence to claims from prior media studies including: (i) male characters display higher agency than female characters; (ii) female actors are more frequently the subject of gaze, and (iii) male characters are less likely to display affection. We hope that these data resources and findings help raise awareness on portrayals of character actions that reflect harmful gender stereotypes, and demonstrate novel possibilities for computational approaches in media analysis.

Copyright: © 2022 Martinez et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Computational linguistic lens into gendered actions in film.**
Our framework starts with a dataset of over 1.2 million movie descriptors from 912 movies and then implements three steps: first, the annotation process, where we collect manual annotations for 9,613 descriptions and over 1.5 million gender expression labels for characters. In the second step, we develop a machine learning model to identify actions, agents and patients from the natural language found in the movie scripts. Our model—trained on a large-set of general-domain documents and fine-tuned on a manually labeled description set—identifies actions, agents and patients from the natural language descriptions found in movie scripts. We use this model to automatically label the complete dataset for analysis. In the final third step, we perform a series of statistical analysis to uncover portrayal differences along characters’ portrayed attributes.

**Fig 2. Labeling task.**
Annotators are presented with a sentence and an action. They are asked to either select the agent (source) and patient (target) of the action. For cases where one of these is missing, the annotator has the option to check the ‘Does not say’ box.

**Fig 3. Proposed SRL system.**
Starting at the bottom, the input to the system is an action description in natural language. The output, shown at the top of the figure, is a sequence of labels (one per word). Labels indicate whether this word depicts an action, agent, patient or none. From its inputs, our model obtains a highly-contextualized representation for each word using the BERT transformer [53]. Each representation corresponds to a high dimensional dense vector that encodes the semantics of that word and the context it plays within the sentence. The sequence of vector representations is then fed into a recurrent neural network and a softmax layer for sequence labeling. As a post-processing step, a set of heuristics aggregate conjunctions to handle the case of groups of agents or patients.

See this image and copyright information in PMC

References

1. Holtzman L, Sharpe L. Media messages: What film, television, and popular music teach us about race, class, gender, and sexual orientation. Routledge; 2014.
1. Gerbner G, Gross L, Morgan M, Signorielli N. Growing up with television: The cultivation perspective. In: Bryant J, Zillmann D, editors. Media effects: Advances in theory and research. New Jersey, NJ: Lawrence Erlbaum Associates, Inc; 1994. p. 17–41.
1. Browne Graves S. Television and prejudice reduction: When does television as a vicarious experience make a difference? Journal of Social Issues. 1999;55(4):707–727. doi: 10.1111/0022-4537.00143 - DOI
1. Cheryan S, Drury BJ, Vichayapai M. Enduring influence of stereotypical computer science role models on women’s academic aspirations. Psychology of Women Quarterly. 2013;37(1):72–79. doi: 10.1177/0361684312459328 - DOI
1. Steinke J. Adolescent girls’ STEM identity formation and media images of STEM professionals: Considering the influence of contextual cues. Frontiers in psychology. 2017;8:716. doi: 10.3389/fpsyg.2017.00716 - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Boys don't cry (or kiss or dance): A computational linguistic lens into gendered actions in film

Affiliations

Boys don't cry (or kiss or dance): A computational linguistic lens into gendered actions in film

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous