Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul;22(3):229-238.
doi: 10.1007/s12021-024-09659-5. Epub 2024 Mar 26.

An Automated Tool to Classify and Transform Unstructured MRI Data into BIDS Datasets

Affiliations

An Automated Tool to Classify and Transform Unstructured MRI Data into BIDS Datasets

Alexander Bartnik et al. Neuroinformatics. 2024 Jul.

Abstract

The increasing use of neuroimaging in clinical research has driven the creation of many large imaging datasets. However, these datasets often rely on inconsistent naming conventions in image file headers to describe acquisition, and time-consuming manual curation is necessary. Therefore, we sought to automate the process of classifying and organizing magnetic resonance imaging (MRI) data according to acquisition types common to the clinical routine, as well as automate the transformation of raw, unstructured images into Brain Imaging Data Structure (BIDS) datasets. To do this, we trained an XGBoost model to classify MRI acquisition types using relatively few acquisition parameters that are automatically stored by the MRI scanner in image file metadata, which are then mapped to the naming conventions prescribed by BIDS to transform the input images to the BIDS structure. The model recognizes MRI types with 99.475% accuracy, as well as a micro/macro-averaged precision of 0.9995/0.994, a micro/macro-averaged recall of 0.9995/0.989, and a micro/macro-averaged F1 of 0.9995/0.991. Our approach accurately and quickly classifies MRI types and transforms unstructured data into standardized structures with little-to-no user intervention, reducing the barrier of entry for clinical scientists and increasing the accessibility of existing neuroimaging data.

Keywords: Automation; BIDS; Data Curation; Machine Learning; Magnetic Resonance Imaging; Reproducibility.

PubMed Disclaimer

References

    1. Bedetti, C., arnaudbore, Guay, S., Carlin, J., Nick, Dastous, A. (2022, May). UNFmontreal/Dcm2Bids: 2.1.7. Zenodo. https://doi.org/10.5281/zenodo.6596007 .
    1. Butzkueven, H., Chapman, J., Cristiano, E., Grand’Maison, F., Hoffmann, M., Izquierdo, G., et al. (2006). MSBase: An international, online registry and platform for collaborative outcomes research in multiple sclerosis. Multiple Sclerosis Journal, 12(6), 769–774. - DOI - PubMed
    1. Chen, T., He, T., Benesty, M., Khotilovich, V., Tang, Y., Cho, H., & Chen, K. (2015). Xgboost: Extreme gradient boosting. R Package Version 0 4-2, 1(4), 1–4.
    1. Esteban, O., Birman, D., Schaer, M., Koyejo, O. O., Poldrack, R. A., & Gorgolewski, K. J. (2017). MRIQC: Advancing the automatic prediction of image quality in MRI from unseen sites. PLOS ONE, 12(9), e0184661.
    1. Esteban, O., Wright, J., Markiewicz, C. J., Thompson, W. H., Goncalves, M., Ciric, R. (2019). NiPreps: enabling the division of labor in neuroimaging beyond fMRIPrep, 7–9.

MeSH terms

LinkOut - more resources