Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 21;17(12):e0278604.
doi: 10.1371/journal.pone.0278604. eCollection 2022.

Boys don't cry (or kiss or dance): A computational linguistic lens into gendered actions in film

Affiliations

Boys don't cry (or kiss or dance): A computational linguistic lens into gendered actions in film

Victor R Martinez et al. PLoS One. .

Abstract

Contemporary media is full of images that reflect traditional gender notions and stereotypes, some of which may perpetuate harmful gender representations. In an effort to highlight the occurrence of these adverse portrayals, researchers have proposed machine-learning methods to identify stereotypes in the language patterns found in character dialogues. However, not all of the harmful stereotypes are communicated just through dialogue. As a complementary approach, we present a large-scale machine-learning framework that automatically identifies character's actions from scene descriptions found in movie scripts. For this work, we collected 1.2+ million scene descriptions from 912 movie scripts, with more than 50 thousand actions and 20 thousand movie characters. Our framework allow us to study systematic gender differences in movie portrayals at a scale. We show this through a series of statistical analyses that highlight differences in gender portrayals. Our findings provide further evidence to claims from prior media studies including: (i) male characters display higher agency than female characters; (ii) female actors are more frequently the subject of gaze, and (iii) male characters are less likely to display affection. We hope that these data resources and findings help raise awareness on portrayals of character actions that reflect harmful gender stereotypes, and demonstrate novel possibilities for computational approaches in media analysis.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Computational linguistic lens into gendered actions in film.
Our framework starts with a dataset of over 1.2 million movie descriptors from 912 movies and then implements three steps: first, the annotation process, where we collect manual annotations for 9,613 descriptions and over 1.5 million gender expression labels for characters. In the second step, we develop a machine learning model to identify actions, agents and patients from the natural language found in the movie scripts. Our model—trained on a large-set of general-domain documents and fine-tuned on a manually labeled description set—identifies actions, agents and patients from the natural language descriptions found in movie scripts. We use this model to automatically label the complete dataset for analysis. In the final third step, we perform a series of statistical analysis to uncover portrayal differences along characters’ portrayed attributes.
Fig 2
Fig 2. Labeling task.
Annotators are presented with a sentence and an action. They are asked to either select the agent (source) and patient (target) of the action. For cases where one of these is missing, the annotator has the option to check the ‘Does not say’ box.
Fig 3
Fig 3. Proposed SRL system.
Starting at the bottom, the input to the system is an action description in natural language. The output, shown at the top of the figure, is a sequence of labels (one per word). Labels indicate whether this word depicts an action, agent, patient or none. From its inputs, our model obtains a highly-contextualized representation for each word using the BERT transformer [53]. Each representation corresponds to a high dimensional dense vector that encodes the semantics of that word and the context it plays within the sentence. The sequence of vector representations is then fed into a recurrent neural network and a softmax layer for sequence labeling. As a post-processing step, a set of heuristics aggregate conjunctions to handle the case of groups of agents or patients.

References

    1. Holtzman L, Sharpe L. Media messages: What film, television, and popular music teach us about race, class, gender, and sexual orientation. Routledge; 2014.
    1. Gerbner G, Gross L, Morgan M, Signorielli N. Growing up with television: The cultivation perspective. In: Bryant J, Zillmann D, editors. Media effects: Advances in theory and research. New Jersey, NJ: Lawrence Erlbaum Associates, Inc; 1994. p. 17–41.
    1. Browne Graves S. Television and prejudice reduction: When does television as a vicarious experience make a difference? Journal of Social Issues. 1999;55(4):707–727. doi: 10.1111/0022-4537.00143 - DOI
    1. Cheryan S, Drury BJ, Vichayapai M. Enduring influence of stereotypical computer science role models on women’s academic aspirations. Psychology of Women Quarterly. 2013;37(1):72–79. doi: 10.1177/0361684312459328 - DOI
    1. Steinke J. Adolescent girls’ STEM identity formation and media images of STEM professionals: Considering the influence of contextual cues. Frontiers in psychology. 2017;8:716. doi: 10.3389/fpsyg.2017.00716 - DOI - PMC - PubMed