Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Sep 7;13(9):e0202180.
doi: 10.1371/journal.pone.0202180. eCollection 2018.

Speed-accuracy tradeoffs in human speech production

Affiliations

Speed-accuracy tradeoffs in human speech production

Adam C Lammert et al. PLoS One. .

Abstract

Speech motor actions are performed quickly, while simultaneously maintaining a high degree of accuracy. Are speed and accuracy in conflict during speech production? Speed-accuracy tradeoffs have been shown in many domains of human motor action, but have not been directly examined in the domain of speech production. The present work seeks evidence for Fitts' law, a rigorous formulation of this fundamental tradeoff, in speech articulation kinematics by analyzing USC-TIMIT, a real-time magnetic resonance imaging data set of speech production. A theoretical framework for considering Fitts' law with respect to models of speech motor control is elucidated. Methodological challenges in seeking relationships consistent with Fitts' law are addressed, including the operational definitions and measurement of key variables in real-time MRI data. Results suggest the presence of speed-accuracy tradeoffs for certain types of speech production actions, with wide variability across syllable position, and substantial variability also across subjects. Coda consonant targets immediately following the syllabic nucleus show the strongest evidence of this tradeoff, with correlations as high as 0.72 between speed and accuracy. A discussion is provided concerning the potentially limited applicability of Fitts' law in the context of speech production, as well as the theoretical context for interpreting the results.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Schematic representation of the Task Dynamics framework.
The variable X is the displacement of the controlled variable in task space and X0 is the target. The Forward Dynamics component implements a second-order dynamical system, conforming to Eq 3, that transforms (via inverse kinematics) the error signal, ΔX, into the second derivative of the articulator-space variable u. The integrals u˙ and u function as motor commands to the Plant, or speech production apparatus.
Fig 2
Fig 2. Schematic representation of the VITE neural model [13].
Note the many similarities of this structure to that of Task Dynamics in Fig 1. TPC is a representation of the target position, which produces a target position X0. The DV population compares the target to the system’s current position, and computes the task-space dynamics of the network. The PPC population integrates the DV population activation into position information. The network dynamics have the form described in Eqs 6 and 7.
Fig 3
Fig 3. A key concept behind the methodology developed in the present work is that motor tasks in speech articulation can be viewed as a sequence of movements toward and away from target points in articulatory space.
Those targets are assumed to be approached and approximated, but not necessarily reached, at the temporal center of each phone interval. The initial position for a given task is assumed to be the target immediately preceding the current one.
Fig 4
Fig 4. Illustration of the key relationships in calculating ID from articulatory data, with most variable names taken from the text.
Target vectors are defined in the high-dimensional articulatory space, represented in the illustration by features x1, x2, x3. In the analysis, this articulatory space is actually composed of L total features. The articulatory target vector Fg is the target of the previous movement, and represents the starting point of the current movement. The target of the current movement is Fh. The distance to the target is the Euclidean distance between these two vectors. The width around the target is calculated with respect to a hypersphere around the current target, which is used to estimate the density of other target vectors that are not the current one.
Fig 5
Fig 5. Illustration of the different syllable position-specific task categories used in the present analysis, shown on a traditional, generic syllable structure tree.
Categories are numbered outward from the nucleus, and include tasks leading into and out of the nucleus (1 & 2), tasks between consonants in the onset and coda (3 & 4) and tasks leading from one syllable to the next (5).
Fig 6
Fig 6. Images illustrating stages in the data pre-processing pipeline for a single vocal tract posture.
Shown are (a) an image of a single posture, in its original form, (b) the same image with low-variance pixels masked out (c) the image again, reconstructed as an image, but using only the L PCA-generated features.
Fig 7
Fig 7. Example high- and low-ID tasks for subject M2.
The top row, (a)-(b), represents one of the highest ID tasks, while the bottom row (c)-(d) represents one of the lowest. Images were reconstructed from the L articulatory features in Z (see text).
Fig 8
Fig 8. Movement time (MT) vs. index of difficulty (ID) for subject M2.
All context-target tasks are shown, divided by syllable position-based category (see text for details concerning categories).

References

    1. Fitts PM. The Information Capacity of the Human Motor System in Controlling the Amplitude of Movement. Journal of Experimental Psychology. 1954;47(6):381–391. 10.1037/h0055392 - DOI - PubMed
    1. Drury CG. Application of Fitts’ Law to foot-pedal design. Human Factors: The Journal of the Human Factors and Ergonomics Society. 1975;17(4):368–373. 10.1177/001872087501700408 - DOI
    1. Duarte M, Freitas S. Speed-accuracy trade-off in voluntary postural movements. Motor control. 2005;9(2):180–196. 10.1123/mcj.9.2.180 - DOI - PubMed
    1. Card SK, English WK, Burr BJ. Evaluation of mouse, rate-controlled isometric joystick, step keys, and text keys for text selection on a CRT. Ergonomics. 1978;21(8):601–613. 10.1080/00140137808931762 - DOI
    1. Ware C, Mikaelian HH. An evaluation of an eye tracker as a device for computer input In: ACM SIGCHI Bulletin. vol. 17 ACM; 1987. p. 183–188.

Publication types