Simulating Language-specific and Language-general Effects in a Statistical Learning Model of Chinese Reading
- PMID: 20161189
- PMCID: PMC2728242
- DOI: 10.1016/j.jml.2009.05.001
Simulating Language-specific and Language-general Effects in a Statistical Learning Model of Chinese Reading
Abstract
Many theoretical models of reading assume that different writing systems require different processing assumptions. For example, it is often claimed that print-to-sound mappings in Chinese are not represented or processed sub-lexically. We present a connectionist model that learns the print to sound mappings of Chinese characters using the same functional architecture and learning rules that have been applied to English. The model predicts an interaction between item frequency and print-to-sound consistency analogous to what has been found for English, as well as a language-specific regularity effect particular to Chinese. Behavioral naming experiments using the same test items as the model confirmed these predictions. Corpus properties and the analyses of internal representations that evolved over training revealed that the model was able to capitalize on information in "phonetic components" - sub-lexical structures of variable size that convey probabilistic information about pronunciation. The results suggest that adult reading performance across very different writing systems may be explained as the result of applying the same learning mechanisms to the particular input statistics of writing systems shaped by both culture and the exigencies of communicating spoken language in a visual medium.
Figures
) and control phonograms; Within P. Family = comparisons among all items that share the critical phonetic component; Orth. Control = comparisons between all items with the critical phonetic component and control items selected to share the same amount of orthographic information; Phon. Control = comparisons between all items with the critical phonetic component and their homophones that do not overlap orthographically. Panel B depicts the similarity space based on orthographic inputs for the test items. Grey patches indicate clusters of items that share orthographic structure; black circles indicate items that share a phonetic component. In Panel C, the similarity space based on hidden unit activations before training is shown. Grey patches and black circles as in Panel B. Panel D shows the similarity space based on hidden unit activations after 3 million trials of training on spelling to sound translation. The gray patch indicates items that share the same phonetic component, and cluster together only after training on spelling-to-sound correspondences. Black circles encompass items that share the same pronunciation in addition to sharing a phonetic component.
) have overlapping representations, whereas control items, matched for the degree of orthographic similarity, do not.References
-
- Balota D, Ferraro F. A dissociation of frequency and regularity effects in pronunciation performance across young adults, older adults, and individuals with senile dementia of the Alzheimer type. Journal of Memory and Language. 1993;32(5):573–592.
-
- Bentin S, Frost R. Processing lexical ambiguity and visual word recognition in a deep orthography. Memory & Cognition. 1987;15(1):13–23. - PubMed
-
- Bi Y, Han Z, Weekes B, Shu H. The interaction between semantic and the nonsemantic systems in reading: Evidence from Chinese. Neuropsychologia. 2007;45(12):2660–2673. - PubMed
-
- Chen H-C, Flores d'Arcais GB, Cheung S-L. Orthographic and phonological activation in recognizing Chinese characters. Psychological Research. 1995;58(2):144–153.
-
- Chen H-C, Shu H. Lexical activation during the recognition of Chinese characters: Evidence against early phonological activation. Psychonomic Bulletin & Review. 2001;8(3):511–518. - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources