HelexKids: A word frequency database for Greek and Cypriot primary school children
- PMID: 26822666
- PMCID: PMC5352803
- DOI: 10.3758/s13428-015-0698-5
HelexKids: A word frequency database for Greek and Cypriot primary school children
Abstract
In this article, we introduce HelexKids, an online written-word database for Greek-speaking children in primary education (Grades 1 to 6). The database is organized on a grade-by-grade basis, and on a cumulative basis by combining Grade 1 with Grades 2 to 6. It provides values for Zipf, frequency per million, dispersion, estimated word frequency per million, standard word frequency, contextual diversity, orthographic Levenshtein distance, and lemma frequency. These values are derived from 116 textbooks used in primary education in Greece and Cyprus, producing a total of 68,692 different word types. HelexKids was developed to assist researchers in studying language development, educators in selecting age-appropriate items for teaching, as well as writers and authors of educational books for Greek/Cypriot children. The database is open access and can be searched online at www.helexkids.org .
Keywords: Children; Contextual diversity; Frequency; Greek language; Word database.
Figures
Similar articles
-
CCLOWW: A grade-level Chinese children's lexicon of written words.Behav Res Methods. 2023 Jun;55(4):1874-1889. doi: 10.3758/s13428-022-01890-9. Epub 2022 Jul 1. Behav Res Methods. 2023. PMID: 35776384
-
MANULEX: a grade-level lexical database from French elementary school readers.Behav Res Methods Instrum Comput. 2004 Feb;36(1):156-66. doi: 10.3758/bf03195560. Behav Res Methods Instrum Comput. 2004. PMID: 15190710
-
ESCOLEX: a grade-level lexical database from European Portuguese elementary to middle school textbooks.Behav Res Methods. 2014 Mar;46(1):240-53. doi: 10.3758/s13428-013-0350-1. Behav Res Methods. 2014. PMID: 23709164
-
GreekLex: a lexical database of Modern Greek.Behav Res Methods. 2008 Aug;40(3):773-83. doi: 10.3758/brm.40.3.773. Behav Res Methods. 2008. PMID: 18697673
-
Phonics training for English-speaking poor readers.Cochrane Database Syst Rev. 2018 Nov 14;11(11):CD009115. doi: 10.1002/14651858.CD009115.pub3. Cochrane Database Syst Rev. 2018. PMID: 30480759 Free PMC article.
Cited by
-
The Children and Young People's Books Lexicon (CYP-LEX): A large-scale lexical database of books read by children and young people in the United Kingdom.Q J Exp Psychol (Hove). 2024 Dec;77(12):2418-2438. doi: 10.1177/17470218241229694. Epub 2024 Mar 12. Q J Exp Psychol (Hove). 2024. PMID: 38262912 Free PMC article.
-
Multi-LEX: A database of multi-word frequencies for French and English.Behav Res Methods. 2023 Dec;55(8):4315-4328. doi: 10.3758/s13428-022-02018-9. Epub 2022 Nov 28. Behav Res Methods. 2023. PMID: 36443580
-
The Children's Picture Books Lexicon (CPB-LEX): A large-scale lexical database from children's picture books.Behav Res Methods. 2024 Aug;56(5):4504-4521. doi: 10.3758/s13428-023-02198-y. Epub 2023 Aug 11. Behav Res Methods. 2024. PMID: 37566336 Free PMC article.
-
CCLOWW: A grade-level Chinese children's lexicon of written words.Behav Res Methods. 2023 Jun;55(4):1874-1889. doi: 10.3758/s13428-022-01890-9. Epub 2022 Jul 1. Behav Res Methods. 2023. PMID: 35776384
-
Picture naming test through the prism of cognitive neuroscience and linguistics: adapting the test for cerebellar tumor survivors-or pouring new wine in old sacks?Front Psychol. 2024 Mar 19;15:1332391. doi: 10.3389/fpsyg.2024.1332391. eCollection 2024. Front Psychol. 2024. PMID: 38566942 Free PMC article. Review.
References
-
- Aristotle University of Thessaloniki (1998). Lexicon of Common Modern Greek. Thessaloniki: Institute for Modern Greek Studies.
-
- Baayen RH. Demythologizing the word frequency effect: A discriminative learning perspective. Mental Lexicon. 2010;5:436–461. doi: 10.1075/ml.5.1.06baa. - DOI
-
- Baayen RH, Feldman LB, Schreuder R. Morphological influences on the recognition of monosyllabic monomorphemic words. Journal of Memory and Language. 2006;55:290–313. doi: 10.1016/j.jml.2006.03.008. - DOI
-
- Baayen RH, Piepenbrock R, Gulikers L. The CELEX lexical database (Release 2, CD-ROM) Philadelphia, PA: Linguistic Data Consortium, University of Pennsylvania; 1995.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources