Using artificial intelligence to explore sound symbolic expressions of gender in American English
- PMID: 38283586
- PMCID: PMC10821993
- DOI: 10.7717/peerj-cs.1811
Using artificial intelligence to explore sound symbolic expressions of gender in American English
Abstract
This study investigates the extent to which gender can be inferred from the phonemes that make up given names and words in American English. Two extreme gradient boosted algorithms were constructed to classify words according to gender, one using a list of the most common given names (N∼1,000) in North America and the other using the Glasgow Norms (N∼5,500), a corpus consisting of nouns, verbs, adjectives, and adverbs which have each been assigned a psycholinguistic score of how they are associated with male or female behaviour. Both models report significant findings, but the model constructed using given names achieves a greater accuracy despite being trained on a smaller dataset suggesting that gender is expressed more robustly in given names than in other word classes. Feature importance was examined to determine which features were contributing to the decision-making process. Feature importance scores revealed a general pattern across both models, but also show that not all word classes express gender the same way. Finally, the models were reconstructed and tested on the opposite dataset to determine whether they were useful in classifying opposite samples. The results showed that the models were not as accurate when classifying opposite samples, suggesting that they are more suited to classifying words of the same class.
Keywords: American English; Gender; Gradient Boosting; Sound symbolism.
©2024 Kilpatrick and Ćwiek.
Conflict of interest statement
The authors declare there are no competing interests.
Figures
Similar articles
-
Sound symbolism in Japanese names: Machine learning approaches to gender classification.PLoS One. 2024 Mar 11;19(3):e0297440. doi: 10.1371/journal.pone.0297440. eCollection 2024. PLoS One. 2024. PMID: 38466741 Free PMC article.
-
What's in a Name? Sound Symbolism and Gender in First Names.PLoS One. 2015 May 27;10(5):e0126809. doi: 10.1371/journal.pone.0126809. eCollection 2015. PLoS One. 2015. PMID: 26016856 Free PMC article.
-
Improved vocabulary production after naming therapy in aphasia: can gains in picture naming generalize to connected speech?Int J Lang Commun Disord. 2009 Nov-Dec;44(6):1036-62. doi: 10.1080/13682820802585975. Int J Lang Commun Disord. 2009. PMID: 19294554
-
Impaired language in Alzheimer's disease: A comparison between English and Persian implicates content-word frequency rather than the noun-verb distinction.medRxiv [Preprint]. 2024 Apr 10:2024.04.09.24305534. doi: 10.1101/2024.04.09.24305534. medRxiv. 2024. PMID: 38645255 Free PMC article. Preprint.
-
Sex-biased sound symbolism in english-language first names.PLoS One. 2013 Jun 5;8(6):e64825. doi: 10.1371/journal.pone.0064825. Print 2013. PLoS One. 2013. PMID: 23755148 Free PMC article.
References
-
- Akita K. Ostman JO, Verschueren J, editors. Sound symbolism. Amsterdam/Philadelphia: John Benjaminshttps://benjamins.com/online/hop/articles/sou1 Handbook of pragmatics. 2015
-
- Bee MA, Perrill SA, Owen PC. Male green frogs lower the pitch of acoustic signals in defense of territories: a possible dishonest signal of size? Behavioral Ecology. 2000;11(2):169–177. doi: 10.1093/beheco/11.2.169. - DOI
-
- Berlin B. The first congress of ethnozoological nomenclature. Journal of the Royal Anthropological Institute. 2006;12:S23–S44.
LinkOut - more resources
Full Text Sources