Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 May 29:10:1154.
doi: 10.3389/fpsyg.2019.01154. eCollection 2019.

Acoustic Correlates and Adult Perceptions of Distress in Infant Speech-Like Vocalizations and Cries

Affiliations

Acoustic Correlates and Adult Perceptions of Distress in Infant Speech-Like Vocalizations and Cries

Hyunjoo Yoo et al. Front Psychol. .

Abstract

Prior research has not evaluated acoustic features contributing to perception of human infant vocal distress or lack thereof on a continuum. The present research evaluates perception of infant vocalizations along a continuum ranging from the most prototypical intensely distressful cry sounds ("wails") to the most prototypical of infant sounds that typically express no distress (non-distress "vocants"). Wails are deemed little if at all related to speech while vocants are taken to be clear precursors to speech. We selected prototypical exemplars of utterances representing the whole continuum from 0 and 1 month-olds. In this initial study of the continuum, our goals are to determine (1) listener agreement on level of vocal distress across the continuum, (2) acoustic parameters predicting ratings of distress, (3) the extent to which individual listeners maintain or change their acoustic criteria for distress judgments across the study, (4) the extent to which different listeners use similar or different acoustic criteria to make judgments, and (5) the role of short-term experience among the listeners in judgments of infant vocalization distress. Results indicated that (1) both inter-rater and intra-rater listener agreement on degree of vocal distress was high, (2) the best predictors of vocal distress were number of vibratory regimes within utterances, utterance duration, spectral ratio (spectral concentration) in vibratory regimes within utterances, and mean pitch, (3) individual listeners significantly modified their acoustic criteria for distress judgments across the 10 trial blocks, (4) different listeners, while showing overall similarities in ratings of the 42 stimuli, also showed significant differences in acoustic criteria used in assigning the ratings of vocal distress, and (5) listeners who were both experienced and inexperienced in infant vocalizations coding showed high agreement in rating level of distress, but differed in the extent to which they relied on the different acoustic cues in making the ratings. The study provides clearer characterization of vocal distress expression in infants based on acoustic parameters and a new perspective on active adult perception of infant vocalizations. The results also highlight the importance of vibratory regime segmentation and analysis in acoustically based research on infant vocalizations and their perception.

Keywords: acoustic analysis; active perception; adult perception; babbling; cry; distress sounds; fuss; infant vocalizations.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Number of mean ratings across the ten trials within all listeners across the entire 100-point scale in intervals of size 5. The figure illustrates that the entire rating scale was used by the listeners, that is, that ratings occurred within all the intervals of possible ratings. To understand the figure, note that each of 39 listeners produced 42 mean ratings over 10 trials on each of the 42 stimuli. Thus the figure represents 42 × 39 = 1638 mean ratings organized in 20 intervals. For example, the interval from 0 to 5 accounted for 94 mean ratings. The interval with the largest number of ratings (129) was 11 to 15, and the intervals with the smallest number of mean ratings (57) were tied at intervals 56 to 60, 61 to 65, and 96 to 100.
FIGURE 2
FIGURE 2
Pearson correlations between each of the nine acoustic parameters selected for the full model and the mean perception ratings of distress level. Average Pitch (f0 mean) represents mean fundamental frequency within each utterance. Max pitch represents the maximum f0 within each utterance. Max amplitude (peak of the root-mean-square amplitude) represents maximum amplitude (in volts) across each utterance. Spectral ratio represents the ratio for each utterance of spectral energy below 2 kHz to the energy above 2 kHz in the regime segment with the minimum ratio. Spectral mean represents the maximum mean of spectral concentration (long-term spectral average) in kHz across the regime segments in each utterance. Spectral dispersion (SD) represents the maximum standard deviation of spectral concentration in kHz across the regime segments in each utterance. Periodicity represents the minimum cepstral peak prominence in dB across the regime segments in each utterance, a measure of periodicity. Number of regimes represents the number of regime segments within each utterance. Error bars = ± 1 SEM.
FIGURE 3
FIGURE 3
Pearson correlations between each of nine acoustic parameters and the mean ratings of distress level by experienced and inexperienced listeners. Spectral Ratio, Spectral Mean, and Spectral Dispersion (SD) were significantly different across groups (see Appendix 4 to view statistics on all 9 parameters). Error bars = ± 1 SEM. p < 0.05, ∗∗p < 0.001.

Similar articles

Cited by

References

    1. Awan S. N. (2011). Analysis of Dysphonia in Speech and Voice (ADSV): An Application Guide. Montvale, NJ: KayPentax.
    1. Awan S. N., Roy N., Dromey C. (2009). Estimating dysphonia severity in continuous speech: application of a multi-parameter spectral/cepstral model. Clin. Linguist. Phon. 23 825–841. 10.3109/02699200903242988 - DOI - PubMed
    1. Bajcsy R. (1988). Active perception. Proc. IEEE 75 996–1005. 10.1109/5.5968 - DOI
    1. Bard K. A. (2000). “Crying in infant primates: Insights into the development of crying in chimpanzees,” in Crying as a Sign, a Sympton, and a Signal: Developmental and Clinical Aspects of Early Crying Behavior, eds Barr R., Hopkins B., Green J. (London: MacKeith Press; ), 157–175.
    1. Bertossa R. C. (2011). Morphology and behaviour: functional links in development and evolution. Philos. Trans. R. Soc. B 366 2056–2068. 10.1098/rstb.2011.0035 - DOI - PMC - PubMed

LinkOut - more resources