Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Aug 24:6:24.
doi: 10.3389/fninf.2012.00024. eCollection 2012.

Decoding semantics across fMRI sessions with different stimulus modalities: a practical MVPA study

Affiliations

Decoding semantics across fMRI sessions with different stimulus modalities: a practical MVPA study

Hiroyuki Akama et al. Front Neuroinform. .

Abstract

Both embodied and symbolic accounts of conceptual organization would predict partial sharing and partial differentiation between the neural activations seen for concepts activated via different stimulus modalities. But cross-participant and cross-session variability in BOLD activity patterns makes analyses of such patterns with MVPA methods challenging. Here, we examine the effect of cross-modal and individual variation on the machine learning analysis of fMRI data recorded during a word property generation task. We present the same set of living and non-living concepts (land-mammals, or work tools) to a cohort of Japanese participants in two sessions: the first using auditory presentation of spoken words; the second using visual presentation of words written in Japanese characters. Classification accuracies confirmed that these semantic categories could be detected in single trials, with within-session predictive accuracies of 80-90%. However cross-session prediction (learning from auditory-task data to classify data from the written-word-task, or vice versa) suffered from a performance penalty, achieving 65-75% (still individually significant at p « 0.05). We carried out several follow-on analyses to investigate the reason for this shortfall, concluding that distributional differences in neither time nor space alone could account for it. Rather, combined spatio-temporal patterns of activity need to be identified for successful cross-session learning, and this suggests that feature selection strategies could be modified to take advantage of this.

Keywords: GLM; MVPA; computational neurolinguistics; embodiment; fMRI; individual variability; machine learning.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The activation maps of the two contrasts (hot color: mammal > tool; cool color: tool > mammal) computed from the 10 datasets of our participants. The apparently sharp cutoff of values in the most ventral slices was not due to the mismatch with the contours of the normalized space, but to the relative narrowness and the shape of the coverage extent (due to only 15 oblique slices as the result of TR = 1 s), which was the logical AND of the individual coverage spheres.
Figure 2
Figure 2
The classification accuracies obtained under the within-session uni-modal conditions from the five participants (BOLD delay = 4; number of volumes = 4).
Figure 3
Figure 3
The classification accuracies obtained under the inter-session cross-modal conditions from the five participants (BOLD delay = 4; number of volumes = 4).
Figure 4
Figure 4
Comparison between the model accuracy function and the canonical HRF in the range of 0–20 s after stimulus onset.
Figure 5
Figure 5
BOLD accuracy grids containing the overall results of 1620 (= 9 × 9 × 4 × 5) machine learning computations using PLR. The first two columns (“audio-audio” and “ortho-ortho”) stand for the results of the within-session uni-modal predictions for P1 (row 1), P2 (row2), P3 (row3), P4 (row4), and P5 (row5). The columns 3 (“audio-ortho”) and 4 (“ortho-audio”) are for the results of the inter-session cross-modal prediction. “audio” and “ortho” stand for auditory and orthographic conditions, respectively. The horizontal axis represents the BOLD delay relative to stimulus onset (1–9 s) and the vertical one number of volumes, or width (1–9 s). The initial default boxcar parameters (delay = 4 s, width = 4 s) is outlined in black on each plot.
Figure 6
Figure 6
Number of most informative voxels extracted by anatomical area (AAL brain atlas), ranging from 0 (black) to 20 (white), on a log-adjusted scale. Columns represent participant numbers, and stimulus modality (“a”, auditory; “o”, orthographic).
Figure 7
Figure 7
Correlations between the vectors of the mammal/tool difference time-courses recorded at the voxels selected for the audio-ortho (blue) and ortho-audio predictions (red). Each error bar represents Standard Error.

Similar articles

Cited by

References

    1. Aguirre G. K., Zarahn E., D'Esposito M. (1998). The variability of human BOLD haemodynamic responses. Neuroimage 8, 360–369 10.1006/nimg.1998.0369 - DOI - PubMed
    1. Aron A. R., Gluck M. A., Poldrack R. A. (2006). Long-term test–retest reliability of functional MRI in a classification learning task, Neuroimage 29, 1000–1006 10.1016/j.neuroimage.2005.08.010 - DOI - PMC - PubMed
    1. Barsalou L. W. (1999). Perceptual symbol systems. Behav. Brain Sci. 22, 577–660 - PubMed
    1. Barsalou L. W. (2003). Situated simulation in the human conceptual system. Lang. Cogn. Process. 18, 513–562
    1. Barsalou L. W., Solomon K. O., Wu L.-L. (1999). Perceptual simulation in conceptual tasks. Cultural, typological, and psychological perspectives in cognitive linguistics, in The Proceedings of the 4th Conference of the International Cognitive Linguistics Association, Vol. 3, (Albuquerque, NM: ), 209–228

LinkOut - more resources