Large-scale patterns of number use in spoken and written English
- PMID: 38344039
- PMCID: PMC10853912
- DOI: 10.1515/cllt-2022-0082
Large-scale patterns of number use in spoken and written English
Abstract
This paper describes patterns of number use in spoken and written English and the main factors that contribute to these patterns. We analysed more than 1.7 million occurrences of numbers between 0 and a billion in the British National Corpus, including conversational speech, presentational speech (e.g., lectures, interviews), imaginative writing (e.g., fiction), and informative writing (e.g., academic books). We find that four main factors affect number frequency: (1) Magnitude - smaller numbers are more frequent than larger numbers; (2) Roundness - round numbers are more frequent than unround numbers of a comparable magnitude, and some round numbers are more frequent than others; (3) Cultural salience - culturally salient numbers (e.g., recent years) are more frequent than non-salient numbers; and (4) Register - more informational texts contain more numbers (in writing), types of numbers, decimals, and larger numbers than less informational texts. In writing, we find that the numbers 1-9 are mostly represented by number words (e.g., 'three'), 10-999,999 are mostly represented by numerals (e.g., '14'), and 1 million-1 billion are mostly represented by a mix of numerals and number words (e.g., '8 million'). Altogether, this study builds a detailed profile of number use in spoken and written English.
Keywords: big data; number frequencies; numerical cognition; register studies; rounding.
© 2023 the author(s), published by De Gruyter, Berlin/Boston.
Figures




References
-
- Ayonrinde Oyedeji A., Stefatos Anthi, Miller Shadé, Richer Amanda, Nadkarni Pallavi, She Jennifer, Alghofaily Ahmad, Mngoma Nomusa. The salience and symbolism of numbers across cultural beliefs and practice. International Review of Psychiatry . 2021;33(1–2):179–188. doi: 10.1080/09540261.2020.1769289. - DOI - PubMed
-
- Barchas-Lichtenstein Jena, Voiklis John, Attaway Bennett, Santhanam Laura, Parson Patti, Grace Thomas Uduak, Isaacs-Thomas Isabella, Ishwar Shivani, Fraser John. Number soup: Case studies of quantitatively dense news. Journalism Practice . 2022:1–28. doi: 10.1080/17512786.2022.2099954. - DOI
-
- Batorsky Ben, Ledvosky Alex, Yarkoni Tal, Groove Buttered. Word2Number. . 2021. [10 May 2021]. https://w2n.readthedocs.io/en/latest/ accessed.
-
- BBC Good Food Chilli con carne recipe. . 2022. [8 September 2022]. https://www.bbcgoodfood.com/recipes/chilli-con-carne-recipe BBC Good Food . accessed.
-
- Beltrama Andrea, Solt Stephanie, Burnett Heather. Context, precision, and social perception: A sociopragmatic study. Language in Society . 2022:1–31. doi: 10.1017/S0047404522000240. - DOI