Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Dec;47(6):1454-68.
doi: 10.1044/1092-4388(2004/108).

Methods for minimizing the confounding effects of word length in the analysis of phonotactic probability and neighborhood density

Affiliations

Methods for minimizing the confounding effects of word length in the analysis of phonotactic probability and neighborhood density

Holly L Storkel. J Speech Lang Hear Res. 2004 Dec.

Abstract

Recent research suggests that phonotactic probability (the likelihood of occurrence of a sound sequence) and neighborhood density (the number of words phonologically similar to a given word) influence spoken language processing and acquisition across the lifespan in both normal and clinical populations. The majority of research in this area has tended to focus on controlled laboratory studies rather than naturalistic data such as spontaneous speech samples or elicited probes. One difficulty in applying current measures of phonotactic probability and neighborhood density to more naturalistic samples is the significant correlation between these variables and word length. This study examines several alternative transformations of phonotactic probability and neighborhood density as a means of reducing or eliminating this correlation with word length. Computational analyses of the words in a large database and reanalysis of archival data supported the use of z scores for the analysis of phonotactic probability as a continuous variable and the use of median transformation scores for the analysis of phonotactic probability as a dichotomous variable. Neighborhood density results were less clear with the conclusion that analysis of neighborhood density as a continuous variable warrants further investigation to differentiate the utility of z scores in comparison to median transformation scores. Furthermore, balanced dichotomous coding of neighborhood density was difficult to achieve, suggesting that analysis of neighborhood density as a dichotomous variable should be approached with caution. Recommendations for future application and analyses are discussed.

PubMed Disclaimer

Similar articles

Cited by

Publication types

LinkOut - more resources