Preparing Laboratory and Real-World EEG Data for Large-Scale Analysis: A Containerized Approach
- PMID: 27014048
- PMCID: PMC4782059
- DOI: 10.3389/fninf.2016.00007
Preparing Laboratory and Real-World EEG Data for Large-Scale Analysis: A Containerized Approach
Abstract
Large-scale analysis of EEG and other physiological measures promises new insights into brain processes and more accurate and robust brain-computer interface models. However, the absence of standardized vocabularies for annotating events in a machine understandable manner, the welter of collection-specific data organizations, the difficulty in moving data across processing platforms, and the unavailability of agreed-upon standards for preprocessing have prevented large-scale analyses of EEG. Here we describe a "containerized" approach and freely available tools we have developed to facilitate the process of annotating, packaging, and preprocessing EEG data collections to enable data sharing, archiving, large-scale machine learning/data mining and (meta-)analysis. The EEG Study Schema (ESS) comprises three data "Levels," each with its own XML-document schema and file/folder convention, plus a standardized (PREP) pipeline to move raw (Data Level 1) data to a basic preprocessed state (Data Level 2) suitable for application of a large class of EEG analysis methods. Researchers can ship a study as a single unit and operate on its data using a standardized interface. ESS does not require a central database and provides all the metadata data necessary to execute a wide variety of EEG processing pipelines. The primary focus of ESS is automated in-depth analysis and meta-analysis EEG studies. However, ESS can also encapsulate meta-information for the other modalities such as eye tracking, that are increasingly used in both laboratory and real-world neuroimaging. ESS schema and tools are freely available at www.eegstudy.org and a central catalog of over 850 GB of existing data in ESS format is available at studycatalog.org. These tools and resources are part of a larger effort to enable data sharing at sufficient scale for researchers to engage in truly large-scale EEG analysis and data mining (BigEEG.org).
Keywords: BCI; EEG; large scale analysis; neuroinformatics.
Figures
References
-
- Alamgir M., Grosse-Wentrup M., Altun Y. (2010). “Multitask learning for brain-computer interfaces,” in Proceedings of the International Conference on 13th Artificial Intelligence and Statistics, 2010, Sardinia, 17–24.
-
- Bigdely-Shamlo N., Kreutz-Delgado K., Kothe C., Makeig S. (2013a). “Towards an EEG search engine,” in Proceedings of the 2013 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Austin, TX, 25–28. 10.1109/GlobalSIP.2013.6736802 - DOI
-
- Bigdely-Shamlo N., Kreutz-Delgado K., Miyakoshi M., Westerfield M., Bel-Bahar T., Kothe C., et al. (2013b). “Hierarchical event descriptor (HED) tags for analysis of event-related EEG studies,” in Proceedings of the IEEE Global Conference on Signal and Information Processing (GlobalSIP), Austin, TX; 10.1109/globalsip.2013.6736796 - DOI
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
