Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update

Daryush D Mehta¹, Jarrad H Van Stan², Matías Zañartu³, Marzyeh Ghassemi⁴, John V Guttag⁴, Víctor M Espinoza⁵, Juan P Cortés³, Harold A Cheyne 2nd⁶, Robert E Hillman¹

Affiliations

¹ Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital , Boston, MA , USA ; Department of Surgery, Harvard Medical School , Boston, MA , USA ; MGH Institute of Health Professions, Massachusetts General Hospital , Boston, MA , USA.
² Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital , Boston, MA , USA ; MGH Institute of Health Professions, Massachusetts General Hospital , Boston, MA , USA.
³ Department of Electronic Engineering, Universidad Técnica Federico Santa María , Valparaíso , Chile.
⁴ Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology , Cambridge, MA , USA.
⁵ Department of Electronic Engineering, Universidad Técnica Federico Santa María , Valparaíso , Chile ; Department of Music and Sonology, Faculty of Arts, Universidad de Chile , Santiago , Chile.
⁶ Bioacoustics Research Laboratory, Laboratory of Ornithology, Cornell University , Ithaca, NY , USA.

PMID: 26528472
PMCID: PMC4607864
DOI: 10.3389/fbioe.2015.00155

Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update

Daryush D Mehta et al. Front Bioeng Biotechnol. 2015.

. 2015 Oct 16:3:155.

doi: 10.3389/fbioe.2015.00155. eCollection 2015.

Authors

Daryush D Mehta¹, Jarrad H Van Stan², Matías Zañartu³, Marzyeh Ghassemi⁴, John V Guttag⁴, Víctor M Espinoza⁵, Juan P Cortés³, Harold A Cheyne 2nd⁶, Robert E Hillman¹

Affiliations

¹ Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital , Boston, MA , USA ; Department of Surgery, Harvard Medical School , Boston, MA , USA ; MGH Institute of Health Professions, Massachusetts General Hospital , Boston, MA , USA.
² Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital , Boston, MA , USA ; MGH Institute of Health Professions, Massachusetts General Hospital , Boston, MA , USA.
³ Department of Electronic Engineering, Universidad Técnica Federico Santa María , Valparaíso , Chile.
⁴ Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology , Cambridge, MA , USA.
⁵ Department of Electronic Engineering, Universidad Técnica Federico Santa María , Valparaíso , Chile ; Department of Music and Sonology, Faculty of Arts, Universidad de Chile , Santiago , Chile.
⁶ Bioacoustics Research Laboratory, Laboratory of Ornithology, Cornell University , Ithaca, NY , USA.

PMID: 26528472
PMCID: PMC4607864
DOI: 10.3389/fbioe.2015.00155

Abstract

Many common voice disorders are chronic or recurring conditions that are likely to result from inefficient and/or abusive patterns of vocal behavior, referred to as vocal hyperfunction. The clinical management of hyperfunctional voice disorders would be greatly enhanced by the ability to monitor and quantify detrimental vocal behaviors during an individual's activities of daily life. This paper provides an update on ongoing work that uses a miniature accelerometer on the neck surface below the larynx to collect a large set of ambulatory data on patients with hyperfunctional voice disorders (before and after treatment) and matched-control subjects. Three types of analysis approaches are being employed in an effort to identify the best set of measures for differentiating among hyperfunctional and normal patterns of vocal behavior: (1) ambulatory measures of voice use that include vocal dose and voice quality correlates, (2) aerodynamic measures based on glottal airflow estimates extracted from the accelerometer signal using subject-specific vocal system models, and (3) classification based on machine learning and pattern recognition approaches that have been used successfully in analyzing long-term recordings of other physiological signals. Preliminary results demonstrate the potential for ambulatory voice monitoring to improve the diagnosis and treatment of common hyperfunctional voice disorders.

Keywords: accelerometer; glottal inverse filtering; machine learning; vocal function; vocal hyperfunction; voice disorders; voice monitoring.

PubMed Disclaimer

Figures

**Figure 1**
**Treatment tracks for patients exhibiting **(A)** phonotraumatic and **(B)** non-phonotraumatic hyperfunctional vocal behaviors**. Week numbers (W1, W2, W3, and W4) refer to time points during which ambulatory monitoring of voice use is being acquired using the smartphone-based voice health monitor. The current enrollment of each patient and matched-control pairing is listed above each week number.

**Figure 2**
**In-laboratory data acquisition setup**. **(A)** Synchronized recordings are made of signals from an acoustic microphone (MIC), electroglottography electrodes (EGG), accelerometer sensor (ACC), high-bandwidth oral airflow (FLO), and intraoral pressure (PRE). **(B)** Signal snapshot of a string of “pae” tokens required for the estimation of subglottal pressure and airflow during phonation. © 2013 IEEE. Reprinted, with permission, from Mehta et al. (2013).

**Figure 3**
Ambulatory voice health monitor: (A) smartphone, accelerometer sensor, and cable with interface circuit encased in epoxy; (B) the wired accelerometer mounted on a silicone pad affixed to the neck midway between the Adam’s apple and V-shaped notch of the collarbone. © 2013 IEEE. Reprinted, with permission, from Mehta et al. (X).

**Figure 4**
Parameterization of the (A) original and (B) inverse-filtered waveforms from the oral airflow (black) and neck-surface acceleration (ACC, red-dashed) waveform processed with subglottal impedance-based inverse filtering. Shown are the time waveform, frequency spectrum, and cepstrum, along with the parameterization of each domain to yield clinically salient measures of voice production.

**Figure 5**
**Illustration of a daily voice use profile for an adult female diagnosed with bilateral vocal fold nodules**. Shown are 5-min moving averages of the median and 95th percentile of frame-based voice quality measures, along with self-reported ratings of effort, discomfort, and fatigue at the beginning and end of day. The daylong histograms of each measure are shown to the right of each time series. The plots below display the occurrence histograms of contiguous voiced segments (left) and estimates of speech phrases between breaths (right).

**Figure 6**
**Time-varying estimation of measures derived from the airflow-derived (black) and accelerometer-derived (red-dashed) glottal airflow signal using subglottal impedance-based inverse filtering**. Trajectories are shown for an adult female with no vocal pathology for the difference between the first two harmonic amplitudes (H1-H2), peak-to-peak flow (AC Flow), maximum flow declination rate (MFDR), open quotient (OQ), speed quotient (SQ), and normalized amplitude quotient (NAQ).

**Figure 7**
**Exemplary results using subglottal impedance-based inverse filtering of a weeklong neck-surface acceleration signal from an adult female with a normal voice**. Histograms of the maximum flow declination rate (MFDR) measure are displayed in physical and logarithmic units. The logarithm of MFDR is plotted against sound pressure level (SPL) to confirm the expected linear correlation (r = 0.94) and slope (1.13 dB/dB).

**Figure 8**
**Classification results on 102 adult female subjects, 51 with vocal fold nodules or polyps and 51 matched-control subjects with normal voices**. Per-patient unbiased model performance using summary statistics of sound pressure level and fundamental frequency from non-overlapping, 5-min windows.

**Figure 9**
**Occurrence histogram of voiced/unvoiced contiguous segment pairs**. The figure includes the number of times (per hour) that a voiced segment of a given duration is followed by an unvoiced segment of a given duration.

See this image and copyright information in PMC

References

1. Alku P., Backstrom T., Vilkman E. (2002). Normalized amplitude quotient for parametrization of the glottal flow. J. Acoust. Soc. Am. 112, 701–710. 10.1121/1.1490365 - DOI - PubMed
1. Awan S. N., Roy N., Jetté M. E., Meltzner G. S., Hillman R. E. (2010). Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: comparisons with auditory-perceptual judgements from the CAPE-V. Clin. Linguist. Phon. 24, 742–758. 10.3109/02699206.2010.492446 - DOI - PubMed
1. Bhattacharyya N. (2014). The prevalence of voice problems among adults in the United States. Laryngoscope 124, 2359–2362. 10.1002/lary.24740 - DOI - PubMed
1. Carroll T., Nix J., Hunter E., Emerich K., Titze I., Abaza M. (2006). Objective measurement of vocal fatigue in classical singers: a vocal dosimetry pilot study. Otolaryngol. Head Neck Surg. 135, 595–602. 10.1016/j.otohns.2006.06.1268 - DOI - PMC - PubMed
1. Clifford G. D., Clifton D. (2012). Wireless technology in disease management and medicine. Annu. Rev. Med. 63, 479–492. 10.1146/annurev-med-051210-114650 - DOI - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update

Affiliations

Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update

Authors

Affiliations

Abstract

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources