Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Feb;49(1):83-96.
doi: 10.3758/s13428-015-0698-5.

HelexKids: A word frequency database for Greek and Cypriot primary school children

Affiliations

HelexKids: A word frequency database for Greek and Cypriot primary school children

Aris R Terzopoulos et al. Behav Res Methods. 2017 Feb.

Abstract

In this article, we introduce HelexKids, an online written-word database for Greek-speaking children in primary education (Grades 1 to 6). The database is organized on a grade-by-grade basis, and on a cumulative basis by combining Grade 1 with Grades 2 to 6. It provides values for Zipf, frequency per million, dispersion, estimated word frequency per million, standard word frequency, contextual diversity, orthographic Levenshtein distance, and lemma frequency. These values are derived from 116 textbooks used in primary education in Greece and Cyprus, producing a total of 68,692 different word types. HelexKids was developed to assist researchers in studying language development, educators in selecting age-appropriate items for teaching, as well as writers and authors of educational books for Greek/Cypriot children. The database is open access and can be searched online at www.helexkids.org .

Keywords: Children; Contextual diversity; Frequency; Greek language; Word database.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Distribution of textbooks per subject
Fig. 2
Fig. 2
Distribution of textbooks per grade

Similar articles

Cited by

References

    1. Adelman JS, Brown GDA, Quesada JF. Contextual diversity, not word frequency, determines word naming and lexical decision times. Psychological Science. 2006;17:814–823. doi: 10.1111/j.1467-9280.2006.01787.x. - DOI - PubMed
    1. Aristotle University of Thessaloniki (1998). Lexicon of Common Modern Greek. Thessaloniki: Institute for Modern Greek Studies.
    1. Baayen RH. Demythologizing the word frequency effect: A discriminative learning perspective. Mental Lexicon. 2010;5:436–461. doi: 10.1075/ml.5.1.06baa. - DOI
    1. Baayen RH, Feldman LB, Schreuder R. Morphological influences on the recognition of monosyllabic monomorphemic words. Journal of Memory and Language. 2006;55:290–313. doi: 10.1016/j.jml.2006.03.008. - DOI
    1. Baayen RH, Piepenbrock R, Gulikers L. The CELEX lexical database (Release 2, CD-ROM) Philadelphia, PA: Linguistic Data Consortium, University of Pennsylvania; 1995.