Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 17:10:e1811.
doi: 10.7717/peerj-cs.1811. eCollection 2024.

Using artificial intelligence to explore sound symbolic expressions of gender in American English

Affiliations

Using artificial intelligence to explore sound symbolic expressions of gender in American English

Alexander Kilpatrick et al. PeerJ Comput Sci. .

Abstract

This study investigates the extent to which gender can be inferred from the phonemes that make up given names and words in American English. Two extreme gradient boosted algorithms were constructed to classify words according to gender, one using a list of the most common given names (N∼1,000) in North America and the other using the Glasgow Norms (N∼5,500), a corpus consisting of nouns, verbs, adjectives, and adverbs which have each been assigned a psycholinguistic score of how they are associated with male or female behaviour. Both models report significant findings, but the model constructed using given names achieves a greater accuracy despite being trained on a smaller dataset suggesting that gender is expressed more robustly in given names than in other word classes. Feature importance was examined to determine which features were contributing to the decision-making process. Feature importance scores revealed a general pattern across both models, but also show that not all word classes express gender the same way. Finally, the models were reconstructed and tested on the opposite dataset to determine whether they were useful in classifying opposite samples. The results showed that the models were not as accurate when classifying opposite samples, suggesting that they are more suited to classifying words of the same class.

Keywords: American English; Gender; Gradient Boosting; Sound symbolism.

PubMed Disclaimer

Conflict of interest statement

The authors declare there are no competing interests.

Figures

Figure 1
Figure 1. Distribution of AmE vowels.
Monopthongs and dipthongs marked with an asterisk (*) had a distribution skew to masculine words in both datasets and those marked with a circumflex (ˆ) had a distribution skew to female words in both datasets.

Similar articles

References

    1. Adelman JS, Estes Z, Cossu M. Emotional sound symbolism: languages rapidly signal valence via phonemes. Cognition. 2018;175:122–130. doi: 10.1016/j.cognition.2018.02.007. - DOI - PubMed
    1. Akita K. Ostman JO, Verschueren J, editors. Sound symbolism. Amsterdam/Philadelphia: John Benjaminshttps://benjamins.com/online/hop/articles/sou1 Handbook of pragmatics. 2015
    1. Aryani A, Conrad M, Schmidtke D, Jacobs A. Why ‘piss’ is ruder than ‘pee’? The role of sound in affective meaning making. PLOS ONE. 2018;13(6):e0198430. doi: 10.1371/journal.pone.0198430. - DOI - PMC - PubMed
    1. Bee MA, Perrill SA, Owen PC. Male green frogs lower the pitch of acoustic signals in defense of territories: a possible dishonest signal of size? Behavioral Ecology. 2000;11(2):169–177. doi: 10.1093/beheco/11.2.169. - DOI
    1. Berlin B. The first congress of ethnozoological nomenclature. Journal of the Royal Anthropological Institute. 2006;12:S23–S44.

LinkOut - more resources