Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 15;62(7):2065-2081.
doi: 10.1044/2019_JSLHR-S-MSC18-18-0187. Epub 2019 Jul 15.

Optimized and Predictive Phonemic Interfaces for Augmentative and Alternative Communication

Affiliations

Optimized and Predictive Phonemic Interfaces for Augmentative and Alternative Communication

Gabriel J Cler et al. J Speech Lang Hear Res. .

Abstract

Purpose We empirically assessed the results of computational optimization and prediction in communication interfaces that were designed to allow individuals with severe motor speech disorders to select phonemes and generate speech output. Method Interface layouts were either random or optimized, in which phoneme targets that were likely to be selected together were located in proximity. Target sizes were either static or predictive, such that likely targets were dynamically enlarged following each selection. Communication interfaces were evaluated by 36 users without motor impairments using an alternate access method. Each user was assigned to 1 of 4 interfaces varying in layout and whether prediction was implemented (random/static, random/predictive, optimized/static, optimized/predictive) and participated in 12 sessions over a 3-week period. Six participants with severe motor impairments used both the optimized/static and optimized/predictive interfaces in 1-2 sessions. Results In individuals without motor impairments, prediction provided significantly faster communication rates during training (Sessions 1-9), as users were learning the interface target locations and the novel access method. After training, optimization acted to significantly increase communication rates. The optimization likely became relevant only after training when participants knew the target locations and moved directly to the targets. Participants with motor impairments could use the interfaces with alternate access methods and generally rated the interface with prediction as preferred. Conclusions Optimization and prediction led to increases in communication rates in users without motor impairments. Predictive interfaces were preferred by users with motor impairments. Future research is needed to translate these results into clinical practice. Supplemental Material https://doi.org/10.23641/asha.8636948.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Surface electromyographic minisensor locations (and associated grounds on chest and mastoids), placed to capture muscle activity during a particular facial gesture and subsequent cursor action: left (half smile), right (half smile), up (eyebrow raise), down (chin contraction), and click (wink). Combining gestures allows the cursor to move in any 360° direction, and magnitude of activity controls cursor speed (Cler & Stepp, 2015).
Figure 2.
Figure 2.
Four interfaces used by different groups of participants. Top left: random/static interface. Top right: random/predictive interface. Bottom left: optimized/static interface. Bottom right: optimized/predictive interface. Phoneme labels are a standard set (Shoup, 1980). Colors are consistent across groups and were isoluminant. Colors denote rough phoneme category: simple vowels in green; complex vowels (diphthongs, r-colored vowels) in purple; fricatives and affricates in yellow; stops in red; and liquids, nasals, and semivowels in blue.
Figure 3.
Figure 3.
Experimental design. (a) Processes required to recreate a given prompt with the phonemic interface: translate the stimulus to the phoneme set, find phonemes on given interface, and use access method to move to and select the targets. (b) Main task, with outcome measure communication rate (phonemes per minute). (c–e) Probes designed to assess participant acuity on each task: (c) Aural stimulus and phonemic representation with one phoneme missing are presented, and accuracy (% correct) and reaction time (responses per second) were collected. (d) Participants indicated when they visually located the given label (outcome measure: reaction time in responses per second). (e) Participants used facial surface electromyographic (sEMG) cursor to select circular targets and were assessed on speed (selections per second).
Figure 4.
Figure 4.
Communication rates per session averaged by group. Error bars are standard error.
Figure 5.
Figure 5.
Communication rates for the six participants with motor impairments (see Table 1 for participant characteristics). Participants all used optimized interfaces. Interfaces were either static (gray) or predictive (red, dark) in alternating blocks. When possible, participants completed blocks over 2 days; black dotted vertical lines indicate separation from Day 1 to Day 2. Error bars are standard deviation.
Figure 6.
Figure 6.
Session in which each participant reached criterion of 80% accuracy of selecting [AY] on /aɪ/-initial trials over other vowel labels. Red (dark) bars: predictive groups. Note that these participants largely reach criterion in the first two sessions. Gray striped bars: static groups. Note that these participants take longer to reach criterion, and one participant never reaches criterion (gray checked box).

Similar articles

Cited by

References

    1. Anton F., Mioc D. & Gold C. M. (1998). Dynamic additively weighted Voronoi diagrams made easy. In Canadian Conference on Computational Geometry (CCCG).
    1. Bates D., Mächler M., Bolker B., & Walker S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48. https://doi.org/10.18637/jss.v067.i01
    1. Beddoes M. P., & Hu Z. (1994). A chord stenograph keyboard: A possible solution to the learning problem in stenography. IEEE Transactions on Systems, Man, and Cybernetics, 24(7), 953–960. https://doi.org/10.1109/21.297785
    1. Beukelman D. R., Fager S. K., Ball L., & Dietz A. (2007). AAC for adults with acquired neurological conditions: A review. Augmentative and Alternative Communication, 23(3), 230–242. https://doi.org/10.1080/07434610701553668 - PubMed
    1. Beukelman D. R., & Gutmann M. (1999). Generic message list for AAC users with ALS. Lincoln: University of Nebraska–Lincoln; Retrieved from https://cehs.unl.edu/documents/secd/aac/vocablists/ALS_Message_List1.pdf

Publication types