Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 1;4(2):025202.
doi: 10.1121/10.0024633.

Characterizing correlations in partial credit speech recognition scoring with beta-binomial distributions

Affiliations

Characterizing correlations in partial credit speech recognition scoring with beta-binomial distributions

Adam K Bosen. JASA Express Lett. .

Abstract

Partial credit scoring for speech recognition tasks can improve measurement precision. However, assessing the magnitude of this improvement with partial credit scoring is challenging because meaningful speech contains contextual cues, which create correlations between the probabilities of correctly identifying each token in a stimulus. Here, beta-binomial distributions were used to estimate recognition accuracy and intraclass correlation for phonemes in words and words in sentences in listeners with cochlear implants (N = 20). Estimates demonstrated substantial intraclass correlation in recognition accuracy within stimuli. These correlations were invariant across individuals. Intraclass correlations should be addressed in power analysis of partial credit scoring.

PubMed Disclaimer

Conflict of interest statement

The author has no conflicts to disclose.

Figures

Fig. 1.
Fig. 1.
Example beta-binomial distributions of phoneme recognition accuracy within monosyllabic words across varying levels of correct response probability (μ) and intraclass correlation (ρ). Nonzero values of ρ indicate overdispersion of recognition accuracy across phonemes within words, which reflects the use of context to facilitate recognition.
Fig. 2.
Fig. 2.
Group-level distribution of correct phonemes within MSTB CNC words and words within PRESTO sentences are shown as bars. Beta-binomial distributions are shown as solid black lines and binomial distributions are shown as dashed gray lines. PRESTO sentences have a variable number of keywords per trial, so number of keywords correct is shown as a proportion out of total keywords.
Fig. 3.
Fig. 3.
Individual differences in recognition accuracy μp and intraclass correlation ρp estimated via Bayesian modeling. Most likely values for model parameters for each participant are shown as circles, with colors representing different participants. Shaded gradients around each most likely value show the corresponding posterior probability density for both model parameters for that participant, with darker shades corresponding to higher probability density. The dashed black line shows the most likely value for ρgroup.
Fig. 4.
Fig. 4.
The standard deviation of the mean for repeated draws from whole word scoring (n =1, μ = 0.5) and a three-phoneme beta-binomial distribution (n = 3, μ = 0.5, ρ = 0.35) as a function of the number of words tested. Dashed lines show the comparison of the number of trials required to equate standard deviation of the mean for the beta-binomial distribution with whole word scoring for 50 words.

References

    1. Bierer, J. A. , Spindler, E. , Bierer, S. M. , and Wright, R. (2016). “ An examination of sources of variability across the consonant-nucleus-consonant test in cochlear implant listeners,” Trends Hear. 20, 1–8.10.1177/2331216516646556 - DOI - PMC - PubMed
    1. Billings, C. J. , Penman, T. M. , Ellis, E. M. , Baltzell, L. S. , and McMillan, G. P. (2016). “ Phoneme and word scoring in speech-in-noise audiometry,” Am. J. Audiol. 25(March), 75–83.10.1044/2016_AJA-15-0068 - DOI - PMC - PubMed
    1. Boothroyd, A. , and Nittrouer, S. (1988). “ Mathematical treatment of context effects in phoneme and word recognition,” J. Acoust. Soc. Am. 84(1), 101–114.10.1121/1.396976 - DOI - PubMed
    1. Bosen, A. K. , Sevich, V. A. , and Cannon, S. A. (2021). “ Forward digit span and word familiarity do not correlate with differences in speech recognition in individuals with cochlear implants after accounting for auditory resolution,” J. Speech. Lang. Hear. Res. 64(8), 3330–3342.10.1044/2021_JSLHR-20-00574 - DOI - PMC - PubMed
    1. Carpenter, B. , Gelman, A. , Hoffman, M. D. , Lee, D. , Goodrich, B. , Betancourt, M. , Brubaker, M. A. , Guo, J. , Li, P. , and Riddell, A. (2017). “ Stan: A probabilistic programming language,” J. Stat. Softw. 76(1), 1–32.10.18637/jss.v076.i01 - DOI - PMC - PubMed

Publication types