Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Apr 27;63(4):1093-1105.
doi: 10.1044/2020_JSLHR-19-00017. Epub 2020 Apr 17.

Accuracy of the Language Environment Analysis System Segmentation and Metrics: A Systematic Review

Affiliations

Accuracy of the Language Environment Analysis System Segmentation and Metrics: A Systematic Review

Alejandrina Cristia et al. J Speech Lang Hear Res. .

Abstract

Purpose The Language Environment Analysis (LENA) system provides automated measures facilitating clinical and nonclinical research and interventions on language development, but there are only a few, scattered independent reports of these measures' validity. The objectives of the current systematic review were to (a) discover studies comparing LENA output with manual annotation, namely, accuracy of talker labels, as well as involving adult word counts (AWCs), conversational turn counts (CTCs), and child vocalization counts (CVCs); (b) describe them qualitatively; (c) quantitatively integrate them to assess central tendencies; and (d) quantitatively integrate them to assess potential moderators. Method Searches on Google Scholar, PubMed, Scopus, and PsycInfo were combined with expert knowledge, and interarticle citations resulting in 238 records screened and 73 records whose full text was inspected. To be included, studies must target children under the age of 18 years and report on accuracy of LENA labels (e.g., precision and/or recall) and/or AWC, CTC, or CVC (correlations and/or error metrics). Results A total of 33 studies, in 28 articles, were discovered. A qualitative review revealed most validation studies had not been peer reviewed as such and failed to report key methodology and results. Quantitative integration of the results was possible for a broad definition of recall and precision (M = 59% and 68%, respectively; N = 12-13), for AWC (mean r = .79, N = 13), CVC (mean r = .77, N = 5), and CTC (mean r = .36, N = 6). Publication bias and moderators could not be assessed meta-analytically. Conclusion Further research and improved reporting are needed in studies evaluating LENA segmentation and quantification accuracy, with work investigating CTC being particularly urgent. Supplemental Material https://osf.io/4nhms/.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
PRISMA flowchart. AWC = adult word count; CTC = conversational turn count; CVC = child vocalization count.
Figure 2.
Figure 2.
Outcomes by participant moderators. Top left panel: infant language (within panel: The left side shows North American English [NAE], while the right depicts other languages). Top right panel; match of infant population to LENA training sample (within panel: matching samples on the right, mismatching on the left). Bottom panels: infant mean age (bottom left) and infant age range (bottom right). Each point indicates one study; numbers indicate study identity (see Table 1). Filled square points indicate authors affiliated with LENA. y-axes indicate the scale for the variable indicated in the panel title (e.g., precision). N. B. axes values vary since different studies may be included across panels, depending on what articles are reported. Red lines indicate bootstrapped confidence intervals (CIs); gray bands in the bottom panel indicate 95% CIs from a linear fit to the data. See text for details and interpretive caveats. AWC = adult word count; LENA = Language Environment Analysis.
Figure 3.
Figure 3.
Outcomes by methodological moderators. Top left panel: segment selection by algo(rithm) (left) versus randomly (right). Top right panel: segment size (continuous [left] vs. single segment [right]). Bottom left panel: duration of individual samples. Bottom right panel: total cumulative annotated data. Each point indicates one study; numbers indicate study identity (see Table 1). Filled square points indicate authors affiliated with LENA. y-axes indicate the scale for the variable indicated in the panel title (e.g., precision). N. B. axes values vary since different studies may be included across panels, depending on what articles are reported. Red lines indicate bootstrapped confidence intervals (CIs); gray bands in the bottom panel indicate 95% CIs from a linear fit to the data. See text for details and interpretive caveats. AWC = adult word count; LENA = Language Environment Analysis.

References

    1. Adams K. A., Marchman V. A., Loi E. C., Ashland M. D., Fernald A., & Feldman H. M. (2018). Caregiver talk and medical risk as predictors of language outcomes in full term and preterm toddlers. Child Development, 89(5), 1674–1690. https://doi.org/10.1111/cdev.12818 - PMC - PubMed
    1. Berends C. (2015). The LENA system in parent–child interaction in Dutch preschool children with language delay (Master's thesis). Utrecht University, Utrecht, the Netherlands.
    1. Bergelson E., Casillas M., Soderstrom M., Seidl A., Warlaumont A. S., & Amatuni A. (2018). What do North American babies hear? A large-scale cross-corpus analysis. Developmental Science, 22(1), 1–12. https://doi.org/10.1111/desc.12724 - PMC - PubMed
    1. Bredin-Oja S. L., Fielding H., Fleming K. K., & Warren S. F. (2018). Clinician vs. machine: Estimating vocalizations rates in young children with developmental disorders. American Journal of Speech-Language Pathology, 27(3), 1066–1072. https://doi.org/10.1044/2018_AJSLP-17-0016 - PMC - PubMed
    1. Bulgarelli F., & Bergelson E. (2019). Look who's talking: A comparison of automated and human-generated speaker tags in naturalistic daylong recordings. Behavior Research Methods. Advance online publication. https://doi.org/10.3758/s13428-019-01265-7 - PMC - PubMed

Publication types

LinkOut - more resources