Accuracy of the Language Environment Analysis System Segmentation and Metrics: A Systematic Review

Alejandrina Cristia¹, Federica Bulgarelli², Elika Bergelson²

Affiliations

¹ Laboratoire de Sciences Cognitives et Psycholinguistique, Département d'Études Cognitives, ENS, EHESS, CNRS, PSL University, Paris, France.
² Psychology & Neuroscience, Duke University, Durham, NC.

PMID: 32302262
PMCID: PMC7242991
DOI: 10.1044/2020_JSLHR-19-00017

Accuracy of the Language Environment Analysis System Segmentation and Metrics: A Systematic Review

Alejandrina Cristia et al. J Speech Lang Hear Res. 2020.

. 2020 Apr 27;63(4):1093-1105.

doi: 10.1044/2020_JSLHR-19-00017. Epub 2020 Apr 17.

Authors

Alejandrina Cristia¹, Federica Bulgarelli², Elika Bergelson²

Affiliations

¹ Laboratoire de Sciences Cognitives et Psycholinguistique, Département d'Études Cognitives, ENS, EHESS, CNRS, PSL University, Paris, France.
² Psychology & Neuroscience, Duke University, Durham, NC.

PMID: 32302262
PMCID: PMC7242991
DOI: 10.1044/2020_JSLHR-19-00017

Abstract

Purpose The Language Environment Analysis (LENA) system provides automated measures facilitating clinical and nonclinical research and interventions on language development, but there are only a few, scattered independent reports of these measures' validity. The objectives of the current systematic review were to (a) discover studies comparing LENA output with manual annotation, namely, accuracy of talker labels, as well as involving adult word counts (AWCs), conversational turn counts (CTCs), and child vocalization counts (CVCs); (b) describe them qualitatively; (c) quantitatively integrate them to assess central tendencies; and (d) quantitatively integrate them to assess potential moderators. Method Searches on Google Scholar, PubMed, Scopus, and PsycInfo were combined with expert knowledge, and interarticle citations resulting in 238 records screened and 73 records whose full text was inspected. To be included, studies must target children under the age of 18 years and report on accuracy of LENA labels (e.g., precision and/or recall) and/or AWC, CTC, or CVC (correlations and/or error metrics). Results A total of 33 studies, in 28 articles, were discovered. A qualitative review revealed most validation studies had not been peer reviewed as such and failed to report key methodology and results. Quantitative integration of the results was possible for a broad definition of recall and precision (M = 59% and 68%, respectively; N = 12-13), for AWC (mean r = .79, N = 13), CVC (mean r = .77, N = 5), and CTC (mean r = .36, N = 6). Publication bias and moderators could not be assessed meta-analytically. Conclusion Further research and improved reporting are needed in studies evaluating LENA segmentation and quantification accuracy, with work investigating CTC being particularly urgent. Supplemental Material https://osf.io/4nhms/.

PubMed Disclaimer

Figures

**Figure 1.**
PRISMA flowchart. AWC = adult word count; CTC = conversational turn count; CVC = child vocalization count.

**Figure 2.**
Outcomes by participant moderators. Top left panel: infant language (within panel: The left side shows North American English [NAE], while the right depicts other languages). Top right panel; match of infant population to LENA training sample (within panel: matching samples on the right, mismatching on the left). Bottom panels: infant mean age (bottom left) and infant age range (bottom right). Each point indicates one study; numbers indicate study identity (see Table 1). Filled square points indicate authors affiliated with LENA. y-axes indicate the scale for the variable indicated in the panel title (e.g., precision). N. B. axes values vary since different studies may be included across panels, depending on what articles are reported. Red lines indicate bootstrapped confidence intervals (CIs); gray bands in the bottom panel indicate 95% CIs from a linear fit to the data. See text for details and interpretive caveats. AWC = adult word count; LENA = Language Environment Analysis.

**Figure 3.**
Outcomes by methodological moderators. Top left panel: segment selection by algo(rithm) (left) versus randomly (right). Top right panel: segment size (continuous [left] vs. single segment [right]). Bottom left panel: duration of individual samples. Bottom right panel: total cumulative annotated data. Each point indicates one study; numbers indicate study identity (see Table 1). Filled square points indicate authors affiliated with LENA. y-axes indicate the scale for the variable indicated in the panel title (e.g., precision). N. B. axes values vary since different studies may be included across panels, depending on what articles are reported. Red lines indicate bootstrapped confidence intervals (CIs); gray bands in the bottom panel indicate 95% CIs from a linear fit to the data. See text for details and interpretive caveats. AWC = adult word count; LENA = Language Environment Analysis.

See this image and copyright information in PMC

References

1. Adams K. A., Marchman V. A., Loi E. C., Ashland M. D., Fernald A., & Feldman H. M. (2018). Caregiver talk and medical risk as predictors of language outcomes in full term and preterm toddlers. Child Development, 89(5), 1674–1690. https://doi.org/10.1111/cdev.12818 - PMC - PubMed
1. Berends C. (2015). The LENA system in parent–child interaction in Dutch preschool children with language delay (Master's thesis). Utrecht University, Utrecht, the Netherlands.
1. Bergelson E., Casillas M., Soderstrom M., Seidl A., Warlaumont A. S., & Amatuni A. (2018). What do North American babies hear? A large-scale cross-corpus analysis. Developmental Science, 22(1), 1–12. https://doi.org/10.1111/desc.12724 - PMC - PubMed
1. Bredin-Oja S. L., Fielding H., Fleming K. K., & Warren S. F. (2018). Clinician vs. machine: Estimating vocalizations rates in young children with developmental disorders. American Journal of Speech-Language Pathology, 27(3), 1066–1072. https://doi.org/10.1044/2018_AJSLP-17-0016 - PMC - PubMed
1. Bulgarelli F., & Bergelson E. (2019). Look who's talking: A comparison of automated and human-generated speaker tags in naturalistic daylong recordings. Behavior Research Methods. Advance online publication. https://doi.org/10.3758/s13428-019-01265-7 - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Accuracy of the Language Environment Analysis System Segmentation and Metrics: A Systematic Review

Affiliations

Accuracy of the Language Environment Analysis System Segmentation and Metrics: A Systematic Review

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources