Decoding semantics across fMRI sessions with different stimulus modalities: a practical MVPA study

Hiroyuki Akama¹, Brian Murphy, Li Na, Yumiko Shimizu, Massimo Poesio

Affiliations

PMID: 22936912
PMCID: PMC3426793
DOI: 10.3389/fninf.2012.00024

Decoding semantics across fMRI sessions with different stimulus modalities: a practical MVPA study

Hiroyuki Akama et al. Front Neuroinform. 2012.

. 2012 Aug 24:6:24.

doi: 10.3389/fninf.2012.00024. eCollection 2012.

Authors

Hiroyuki Akama¹, Brian Murphy, Li Na, Yumiko Shimizu, Massimo Poesio

Affiliation

¹ Akama Laboratory, Graduate School of Decision Science and Technology, Tokyo Institute of Technology Tokyo, Japan.

PMID: 22936912
PMCID: PMC3426793
DOI: 10.3389/fninf.2012.00024

Abstract

Both embodied and symbolic accounts of conceptual organization would predict partial sharing and partial differentiation between the neural activations seen for concepts activated via different stimulus modalities. But cross-participant and cross-session variability in BOLD activity patterns makes analyses of such patterns with MVPA methods challenging. Here, we examine the effect of cross-modal and individual variation on the machine learning analysis of fMRI data recorded during a word property generation task. We present the same set of living and non-living concepts (land-mammals, or work tools) to a cohort of Japanese participants in two sessions: the first using auditory presentation of spoken words; the second using visual presentation of words written in Japanese characters. Classification accuracies confirmed that these semantic categories could be detected in single trials, with within-session predictive accuracies of 80-90%. However cross-session prediction (learning from auditory-task data to classify data from the written-word-task, or vice versa) suffered from a performance penalty, achieving 65-75% (still individually significant at p « 0.05). We carried out several follow-on analyses to investigate the reason for this shortfall, concluding that distributional differences in neither time nor space alone could account for it. Rather, combined spatio-temporal patterns of activity need to be identified for successful cross-session learning, and this suggests that feature selection strategies could be modified to take advantage of this.

Keywords: GLM; MVPA; computational neurolinguistics; embodiment; fMRI; individual variability; machine learning.

PubMed Disclaimer

Figures

**Figure 1**
**The activation maps of the two contrasts (hot color: mammal > tool; cool color: tool > mammal) computed from the 10 datasets of our participants**. The apparently sharp cutoff of values in the most ventral slices was not due to the mismatch with the contours of the normalized space, but to the relative narrowness and the shape of the coverage extent (due to only 15 oblique slices as the result of TR = 1 s), which was the logical AND of the individual coverage spheres.

**Figure 2**
**The classification accuracies obtained under the within-session uni-modal conditions from the five participants (BOLD delay = 4; number of volumes = 4)**.

**Figure 3**
**The classification accuracies obtained under the inter-session cross-modal conditions from the five participants (BOLD delay = 4; number of volumes = 4)**.

**Figure 4**
**Comparison between the model accuracy function and the canonical HRF in the range of 0–20 s after stimulus onset**.

**Figure 5**
**BOLD accuracy grids containing the overall results of 1620 (= 9 × 9 × 4 × 5) machine learning computations using PLR**. The first two columns (“audio-audio” and “ortho-ortho”) stand for the results of the within-session uni-modal predictions for P1 (row 1), P2 (row2), P3 (row3), P4 (row4), and P5 (row5). The columns 3 (“audio-ortho”) and 4 (“ortho-audio”) are for the results of the inter-session cross-modal prediction. “audio” and “ortho” stand for auditory and orthographic conditions, respectively. The horizontal axis represents the BOLD delay relative to stimulus onset (1–9 s) and the vertical one number of volumes, or width (1–9 s). The initial default boxcar parameters (delay = 4 s, width = 4 s) is outlined in black on each plot.

**Figure 6**
**Number of most informative voxels extracted by anatomical area (AAL brain atlas), ranging from 0 (black) to 20 (white), on a log-adjusted scale**. Columns represent participant numbers, and stimulus modality (“a”, auditory; “o”, orthographic).

**Figure 7**
**Correlations between the vectors of the mammal/tool difference time-courses recorded at the voxels selected for the audio-ortho (blue) and ortho-audio predictions (red)**. Each error bar represents Standard Error.

See this image and copyright information in PMC

Cited by

MEG Evidence That Modality-Independent Conceptual Representations Contain Semantic and Visual Features.
Dirani J, Pylkkänen L. Dirani J, et al. J Neurosci. 2024 Jul 3;44(27):e0326242024. doi: 10.1523/JNEUROSCI.0326-24.2024. J Neurosci. 2024. PMID: 38806251 Free PMC article.
Is the Sensorimotor Cortex Relevant for Speech Perception and Understanding? An Integrative Review.
Schomers MR, Pulvermüller F. Schomers MR, et al. Front Hum Neurosci. 2016 Sep 21;10:435. doi: 10.3389/fnhum.2016.00435. eCollection 2016. Front Hum Neurosci. 2016. PMID: 27708566 Free PMC article. Review.
Convergent and invariant object representations for sight, sound, and touch.
Man K, Damasio A, Meyer K, Kaplan JT. Man K, et al. Hum Brain Mapp. 2015 Sep;36(9):3629-40. doi: 10.1002/hbm.22867. Epub 2015 Jun 5. Hum Brain Mapp. 2015. PMID: 26047030 Free PMC article.
Temporal embedding and spatiotemporal feature selection boost multi-voxel pattern analysis decoding accuracy.
Choupan J, Douglas PK, Gal Y, Cohen MS, Reutens DC, Yang Z. Choupan J, et al. J Neurosci Methods. 2020 Nov 1;345:108836. doi: 10.1016/j.jneumeth.2020.108836. Epub 2020 Jul 26. J Neurosci Methods. 2020. PMID: 32726664 Free PMC article.
Searchlight Classification Informative Region Mixture Model (SCIM): Identification of Cortical Regions Showing Discriminable BOLD Patterns in Event-Related Auditory fMRI Data.
Urbschat A, Uppenkamp S, Anemüller J. Urbschat A, et al. Front Neurosci. 2021 Feb 1;14:616906. doi: 10.3389/fnins.2020.616906. eCollection 2020. Front Neurosci. 2021. PMID: 33597841 Free PMC article.

See all "Cited by" articles

References

1. Aguirre G. K., Zarahn E., D'Esposito M. (1998). The variability of human BOLD haemodynamic responses. Neuroimage 8, 360–369 10.1006/nimg.1998.0369 - DOI - PubMed
1. Aron A. R., Gluck M. A., Poldrack R. A. (2006). Long-term test–retest reliability of functional MRI in a classification learning task, Neuroimage 29, 1000–1006 10.1016/j.neuroimage.2005.08.010 - DOI - PMC - PubMed
1. Barsalou L. W. (1999). Perceptual symbol systems. Behav. Brain Sci. 22, 577–660 - PubMed
1. Barsalou L. W. (2003). Situated simulation in the human conceptual system. Lang. Cogn. Process. 18, 513–562
1. Barsalou L. W., Solomon K. O., Wu L.-L. (1999). Perceptual simulation in conceptual tasks. Cultural, typological, and psychological perspectives in cognitive linguistics, in The Proceedings of the 4th Conference of the International Cognitive Linguistics Association, Vol. 3, (Albuquerque, NM: ), 209–228

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Decoding semantics across fMRI sessions with different stimulus modalities: a practical MVPA study

Affiliation

Decoding semantics across fMRI sessions with different stimulus modalities: a practical MVPA study

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources