Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct;52(5):1893-1905.
doi: 10.3758/s13428-020-01353-z.

Matrices of the frequency and similarity of Arabic letters and allographs

Affiliations

Matrices of the frequency and similarity of Arabic letters and allographs

Sami Boudelaa et al. Behav Res Methods. 2020 Oct.

Abstract

Indicators of letter frequency and similarity have long been available for Indo-European languages. They have not only been pivotal in controlling the design of experimental psycholinguistic studies seeking to determine the factors that underlie reading ability and literacy acquisition, but have also been useful for studies examining the more general aspects of human cognition. Despite their importance, however, such indicators are still not available for Modern Standard Arabic (MSA), a language that, by virtue of its orthographic system, presents an invaluable environment for the experimental investigation of visual word processing. This paper presents for the first time the frequencies of Arabic letters and their allographs based on a 40-million-word corpus, along with their similarity/confusability indicators in three domains: (1) the visual domain, based on human ratings; (2) the auditory domain, based on an analysis of the phonetic features of letter sounds; and (3) the motoric domain, based on an analysis of the stroke features used to write letters and their allographs. Taken together, the frequency and similarity of Arabic letters and their allographs in the visual and motoric domains, as well as the similarities among the letter sounds, will be useful for researchers interested in the processes underpinning orthographic processing, visual word recognition, reading, and literacy acquisition.

Keywords: Allographs; Arabic letters; Frequency; Motoric similarity; Phonetic similarity; Sounds; Visual similarity.

PubMed Disclaimer

References

    1. Abandah, G. A., Younis, K. S., & Khedher, M. Z. (2014). Handwritten Arabic character recognition using multiple classifiers based on letter form. In Proceedings of the 5th IASTED International Conference on Signal Processing, Pattern Recognition, & Applications (SPPRA 2008), Feb. 13–15, Innsbruck, Austria.
    1. Asadi, I. A., Khateb, A., & Shany, M. (2017). How simple is reading in Arabic? A cross-sectional investigation of reading comprehension from first to sixth grade. Journal of Research in Reading, 40 (S1), S1–S22. doi: https://doi.org/10.1111/1467-9817.12093 . - DOI
    1. Austin, W. M. (1957). Criteria for phonetic similarity. Language, 33, 538–543.
    1. Bailey, T. M., & Hahn, U. (2005). Phoneme similarity and confusability. Journal of Memory and Language, 52, 339–362.
    1. Boles, D. B., & Clifford, J. E. (1989). An upper- and lowercase alphabetic similarity matrix, with derived generation similarity values. Behavior Research Methods, 21, 579–586.

Publication types

LinkOut - more resources