. 2015 Jun 9:9:327.

doi: 10.3389/fnhum.2015.00327. eCollection 2015.

Accurately decoding visual information from fMRI data obtained in a realistic virtual environment

Andrew Floren¹, Bruce Naylor², Risto Miikkulainen³, David Ress⁴

Affiliations

¹ Electrical and Computer Engineering Department, The University of Texas at Austin Austin, TX, USA.
² Department of Neuroscience, The University of Texas at Austin Austin, TX, USA.
³ Department of Computer Science, The University of Texas at Austin Austin, TX, USA.
⁴ Human Neuroimaging Laboratory, Baylor College of Medicine Houston, TX, USA.

PMID: 26106315
PMCID: PMC4460535
DOI: 10.3389/fnhum.2015.00327

Accurately decoding visual information from fMRI data obtained in a realistic virtual environment

Andrew Floren et al. Front Hum Neurosci. 2015.

. 2015 Jun 9:9:327.

doi: 10.3389/fnhum.2015.00327. eCollection 2015.

Authors

Andrew Floren¹, Bruce Naylor², Risto Miikkulainen³, David Ress⁴

Affiliations

¹ Electrical and Computer Engineering Department, The University of Texas at Austin Austin, TX, USA.
² Department of Neuroscience, The University of Texas at Austin Austin, TX, USA.
³ Department of Computer Science, The University of Texas at Austin Austin, TX, USA.
⁴ Human Neuroimaging Laboratory, Baylor College of Medicine Houston, TX, USA.

PMID: 26106315
PMCID: PMC4460535
DOI: 10.3389/fnhum.2015.00327

Abstract

Three-dimensional interactive virtual environments (VEs) are a powerful tool for brain-imaging based cognitive neuroscience that are presently under-utilized. This paper presents machine-learning based methods for identifying brain states induced by realistic VEs with improved accuracy as well as the capability for mapping their spatial topography on the neocortex. VEs provide the ability to study the brain under conditions closer to the environment in which humans evolved, and thus to probe deeper into the complexities of human cognition. As a test case, we designed a stimulus to reflect a military combat situation in the Middle East, motivated by the potential of using real-time functional magnetic resonance imaging (fMRI) in the treatment of post-traumatic stress disorder. Each subject experienced moving through the virtual town where they encountered 1-6 animated combatants at different locations, while fMRI data was collected. To analyze the data from what is, compared to most studies, more complex and less controlled stimuli, we employed statistical machine learning in the form of Multi-Voxel Pattern Analysis (MVPA) with special attention given to artificial Neural Networks (NN). Extensions to NN that exploit the block structure of the stimulus were developed to improve the accuracy of the classification, achieving performances from 58 to 93% (chance was 16.7%) with six subjects. This demonstrates that MVPA can decode a complex cognitive state, viewing a number of characters, in a dynamic virtual environment. To better understand the source of this information in the brain, a novel form of sensitivity analysis was developed to use NN to quantify the degree to which each voxel contributed to classification. Compared with maps produced by general linear models and the searchlight approach, these sensitivity maps revealed a more diverse pattern of information relevant to the classification of cognitive state.

Keywords: fMRI BOLD; human vision; machine learning; natural stimuli; virtual environments.

PubMed Disclaimer

Figures

**Figure 1**
The stimulus in the experiment described in this paper employs a virtual environment and a blocked design where the view alternates between moving through the environment and viewing groups of animated characters. (A) An example frame from the stimulus where the camera is traveling through the virtual environment with no characters presented. **(B)** An example frame from the stimulus where five friendly characters are being presented. **(C)** An example frame from the stimulus where three hostile characters are being presented. Such stimuli allow studying how the brain responds in a more natural and complex environment.

**Figure 2**
**The estimated performance of the classifiers averaged across all sessions and plotted across the four training-and-test-split methods; error bars show bootstrapped 68% confidence intervals**. There is a statistically significant drop in the estimated performance when the average minimum temporal delay increases from 2.6 to 21 s, though performance stays above the chance performance of 16.7%. This result confirms that short delays result in optimistic performance estimates because of temporal correlations.

**Figure 3**
**The estimated performance of all four classifiers averaged across all sessions**. The performance of individual sessions are indicated by the symbols. Each subject performed two sessions and there are therefore two symbols per subject. The performances estimates were bootstrapped across sessions in order to obtain 68% confidence intervals. While the SVM had the best average performance, all four classifiers performed well above a chance performance of 16.7%.

**Figure 4**
**Individual session accuracies and relative accuracy for all four averaging methods**. The sessions have been sorted by average performance for improved readability. The chance probability for all sessions is 16.7%. The chart on the right shows the impact of the individual aggregation methods calculated relative to the baseline score as 10 log (score/baseline) for each session. These relative accuracy scores (in dB) are averaged across all sessions and bootstrapped to obtain 68% confidence intervals.

**Figure 5**
**The average confusion matrices for the feed-forward NN with output averaging across all subjects**. The value in cell (i,j) of the matrix is the percent of examples from class i that were labeled as class j; values along the diagonal indicate correctly classified examples while the rest indicate incorrectly classified examples. The color of the cell indicates deviation from chance probability (16.7%); greener cells indicating values above chance, and redder cells indicating values below chance.

**Figure 6**
**Cross-validated session accuracy plotted against average session confidence**.

**Figure 7**
**A qualitative comparison of sensitivity, GLM linear-response Z-statistic, and searchlight accuracy maps projected onto semi-inflated cortical surfaces for three different subjects**. The maps are roughly similar across subjects and hemispheres, but substantial individual variations are evident.

**Figure 8**
**A plot of the feedforward neural network estimated performance and the fraction of voxels remaining at each iteration of the recursive feature elimination procedure**. The fraction of voxels is calculated with respect to the 2000 voxels selected by ANOVA. The performance estimates and voxel counts were bootstrapped across sessions in order to obtain 68% confidence intervals.

**Figure 9**
**A bar graph depicting the coverage percent from the sensitivity, GLM, and searchlight maps across automatically generated labels from Freesurfer**. The coverage percent is the percent of the map contained within that area. In this way, the variation in total map size between approaches is controlled for and specificity and coverage of the maps can be directly compared. Coverage percentages were bootstrapped across sessions to provide confidence intervals.

See this image and copyright information in PMC

Cited by

Listening in Naturalistic Scenes: What Can Functional Near-Infrared Spectroscopy and Intersubject Correlation Analysis Tell Us About the Underlying Brain Activity?
Rowland SC, Hartley DEH, Wiggins IM. Rowland SC, et al. Trends Hear. 2018 Jan-Dec;22:2331216518804116. doi: 10.1177/2331216518804116. Trends Hear. 2018. PMID: 30345888 Free PMC article.
Decoding task specific and task general functional architectures of the brain.
Gupta S, Lim M, Rajapakse JC. Gupta S, et al. Hum Brain Mapp. 2022 Jun 15;43(9):2801-2816. doi: 10.1002/hbm.25817. Epub 2022 Feb 27. Hum Brain Mapp. 2022. PMID: 35224817 Free PMC article.
Framework for Accurate Classification of Self-Reported Stress From Multisession Functional MRI Data of Veterans With Posttraumatic Stress.
Goel R, Tse T, Smith LJ, Floren A, Naylor B, Williams MW, Salas R, Rizzo AS, Ress D. Goel R, et al. Chronic Stress (Thousand Oaks). 2023 Sep 28;7:24705470231203655. doi: 10.1177/24705470231203655. eCollection 2023 Jan-Dec. Chronic Stress (Thousand Oaks). 2023. PMID: 37780807 Free PMC article.
Representations within the Intraparietal Sulcus Distinguish Numerical Tasks and Formats.
Koch GE, Libertus ME, Fiez JA, Coutanche MN. Koch GE, et al. J Cogn Neurosci. 2023 Feb 1;35(2):226-240. doi: 10.1162/jocn_a_01933. J Cogn Neurosci. 2023. PMID: 36306247 Free PMC article.
Deep Learning Methods to Process fMRI Data and Their Application in the Diagnosis of Cognitive Impairment: A Brief Overview and Our Opinion.
Wen D, Wei Z, Zhou Y, Li G, Zhang X, Han W. Wen D, et al. Front Neuroinform. 2018 Apr 26;12:23. doi: 10.3389/fninf.2018.00023. eCollection 2018. Front Neuroinform. 2018. PMID: 29755334 Free PMC article. No abstract available.

References

1. Baeck A., Wagemans J., Op de Beeck H. P. (2013). The distributed representation of random and meaningful object pairs in human occipitotemporal cortex: the weighted average as a general rule. Neuroimage 70, 37–47. 10.1016/j.neuroimage.2012.12.023 - DOI - PubMed
1. Bishop C. M. (2006). Pattern Recognition and Machine Learning. New York, NY: Springer.
1. Cabral C., Silveira M., Figueiredo P. (2012). Decoding visual brain states from fMRI using an ensemble of classifiers. Pattern Recognit. 45, 2064–2074. 10.1016/j.patcog.2011.04.015 - DOI
1. Cirşan D., Meier U., Masci J., Schmidhuber J. (2012). Multi-column deep neural network for traffic sign classification. Neural Netw. 32, 333–338. 10.1016/j.neunet.2012.02.023 - DOI - PubMed
1. Cortes C., Vapnik V. (1995). Support-vector networks. Mach. Learn. 20, 273–297. 10.1007/BF00994018 - DOI - PubMed

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Accurately decoding visual information from fMRI data obtained in a realistic virtual environment

Affiliations

Accurately decoding visual information from fMRI data obtained in a realistic virtual environment

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials