Sound symbolism in Japanese names: Machine learning approaches to gender classification
- PMID: 38466741
- PMCID: PMC10927153
- DOI: 10.1371/journal.pone.0297440
Sound symbolism in Japanese names: Machine learning approaches to gender classification
Abstract
This study investigates the sound symbolic expressions of gender in Japanese names with machine learning algorithms. The main goal of this study is to explore how gender is expressed in the phonemes that make up Japanese names and whether systematic sound-meaning mappings, observed in Indo-European languages, extend to Japanese. In addition to this, this study compares the performance of machine learning algorithms. Random Forest and XGBoost algorithms are trained using the sounds of names and the typical gender of the referents as the dependent variable. Each algorithm is cross-validated using k-fold cross-validation (28 folds) and tested on samples not included in the training cycle. Both algorithms are shown to be reasonably accurate at classifying names into gender categories; however, the XGBoost model performs significantly better than the Random Forest algorithm. Feature importance scores reveal that certain sounds carry gender information. Namely, the voiced bilabial nasal /m/ and voiceless velar consonant /k/ were associated with femininity, and the high front vowel /i/ were associated with masculinity. The association observed for /i/ and /k/ stand contrary to typical patterns found in other languages, suggesting that Japanese is unique in the sound symbolic expression of gender. This study highlights the importance of considering cultural and linguistic nuances in sound symbolism research and underscores the advantage of XGBoost in capturing complex relationships within the data for improved classification accuracy. These findings contribute to the understanding of sound symbolism and gender associations in language.
Copyright: © 2024 Ngai et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Conflict of interest statement
The authors have declared that no competing interests exist.
Similar articles
-
Random forests, sound symbolism and Pokémon evolution.PLoS One. 2023 Jan 4;18(1):e0279350. doi: 10.1371/journal.pone.0279350. eCollection 2023. PLoS One. 2023. PMID: 36598905 Free PMC article.
-
What's in a Name? Sound Symbolism and Gender in First Names.PLoS One. 2015 May 27;10(5):e0126809. doi: 10.1371/journal.pone.0126809. eCollection 2015. PLoS One. 2015. PMID: 26016856 Free PMC article.
-
Sound symbolism in the languages of Australia.PLoS One. 2014 Apr 21;9(4):e92852. doi: 10.1371/journal.pone.0092852. eCollection 2014. PLoS One. 2014. PMID: 24752356 Free PMC article.
-
The sound symbolism bootstrapping hypothesis for language acquisition and language evolution.Philos Trans R Soc Lond B Biol Sci. 2014 Sep 19;369(1651):20130298. doi: 10.1098/rstb.2013.0298. Philos Trans R Soc Lond B Biol Sci. 2014. PMID: 25092666 Free PMC article. Review.
-
Iconicity in the lab: a review of behavioral, developmental, and neuroimaging research into sound-symbolism.Front Psychol. 2015 Aug 24;6:1246. doi: 10.3389/fpsyg.2015.01246. eCollection 2015. Front Psychol. 2015. PMID: 26379581 Free PMC article. Review.
References
-
- Hockett CF. The Origin of Speech. Sci Am. 1960;203: 88–97. - PubMed
-
- Nuckolls JB. The case for sound symbolism. Annu Rev Anthropol. 1999;28: 225–252. doi: 10.1146/annurev.anthro.28.1.225 - DOI
-
- Jespersen O. Symbolic value of the vowel i. Linguistics: Selected papers in English, French and German. Copenhagen: Levin and Munksgaard; 1933. pp. 283–303.
-
- Sapir E. A study in phonetic symbolism. J Exp Psychol. 1929;12: 225–239. doi: 10.1037/h0070931 - DOI
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials