Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Mar 13:rs.3.rs-2649734.
doi: 10.21203/rs.3.rs-2649734/v1.

Neurodesk: An accessible, flexible, and portable data analysis environment for reproducible neuroimaging

Affiliations

Neurodesk: An accessible, flexible, and portable data analysis environment for reproducible neuroimaging

Angela I Renton et al. Res Sq. .

Update in

  • Neurodesk: an accessible, flexible and portable data analysis environment for reproducible neuroimaging.
    Renton AI, Dao TT, Johnstone T, Civier O, Sullivan RP, White DJ, Lyons P, Slade BM, Abbott DF, Amos TJ, Bollmann S, Botting A, Campbell MEJ, Chang J, Close TG, Dörig M, Eckstein K, Egan GF, Evas S, Flandin G, Garner KG, Garrido MI, Ghosh SS, Grignard M, Halchenko YO, Hannan AJ, Heinsfeld AS, Huber L, Hughes ME, Kaczmarzyk JR, Kasper L, Kuhlmann L, Lou K, Mantilla-Ramos YJ, Mattingley JB, Meier ML, Morris J, Narayanan A, Pestilli F, Puce A, Ribeiro FL, Rogasch NC, Rorden C, Schira MM, Shaw TB, Sowman PF, Spitz G, Stewart AW, Ye X, Zhu JD, Narayanan A, Bollmann S. Renton AI, et al. Nat Methods. 2024 May;21(5):804-808. doi: 10.1038/s41592-023-02145-x. Epub 2024 Jan 8. Nat Methods. 2024. PMID: 38191935 Free PMC article.

Abstract

Neuroimaging data analysis often requires purpose-built software, which can be challenging to install and may produce different results across computing environments. Beyond being a roadblock to neuroscientists, these issues of accessibility and portability can hamper the reproducibility of neuroimaging data analysis pipelines. Here, we introduce the Neurodesk platform, which harnesses software containers to support a comprehensive and growing suite of neuroimaging software (https://www.neurodesk.org/). Neurodesk includes a browser-accessible virtual desktop environment and a command line interface, mediating access to containerized neuroimaging software libraries on various computing platforms, including personal and high-performance computers, cloud computing and Jupyter Notebooks. This community-oriented, open-source platform enables a paradigm shift for neuroimaging data analysis, allowing for accessible, flexible, fully reproducible, and portable data analysis pipelines.

PubMed Disclaimer

Conflict of interest statement

Competing interests The authors declare no financial conflicts of interest.

Figures

Figure 1.
Figure 1.
The Neurodesk platform. (a) The Neurodesk platform is built by and for the scientific community, enabling anyone to contribute recipes for new software containers to the repository. (b) Recipes contributed by the community are automatically used to build software containers and stored in the Neurocontainers repository. (c) Each software container packages a tool together with all the required runtime dependencies. The packaged software can therefore run identically in any supported computing environment. (d) Neurodesk provides two layers of accessibility: 1. Neurodesktop is a browser-accessible virtual desktop environment, allowing users to interact with the containerized software. 2. Neurocommand is a command-line interface that allows users to run the same software containers programmatically. These interfaces allow users to reproduce the same analysis pipelines across various computing environments.
Figure 2.
Figure 2.
Discrepancies in image registration and tissue segmentation. (a) Calculation of the Dice dissimilarity coefficients; for each image, the voxel-wise disagreement in image intensity (FLIRT) or label (FIRST) calculated on System A vs System B was expressed as a proportion of the total number of voxels for each participant. (b) Histograms of Dice dissimilarity coefficients for image intensity calculated with FSL-FLIRT on Neurodesk vs. Local Install. To calculate these Dice coefficients, “disagreement” meant a voxel had a different intensity after image registration on System A vs. System B. Thus, the Dice coefficient of 0 for every participant whose images were registered using Neurodesk, means that the image intensity of each participant was matched across systems at every voxel. (c) Histograms of Dice dissimilarity coefficients for subcortical structure labels calculated using FSL-FIRST on Neurodesk vs. Local Install. To calculate these Dice coefficients, “disagreement” meant a voxel had different labels (e.g., amygdala, hippocampus, etc.) after image segmentation on System A vs. System B. Note that these Dice coefficients are much smaller than for image registration. This is expected because there are 73 times more “classes” for the image registration task, which uses image intensity (Range: 0 – 1903) as a class, than the classification task, which has labels for 15 structures. However, while both Neurodesk and the local system show strong agreement across systems overall, these distributions are completely non-overlapping, with Neurodesk showing much greater reliability across systems.
Figure 3.
Figure 3.
Inter-system differences in image intensity in subcortical structures and subsequent classification of these subcortical structures. (a,b) Absolute voxel-wise differences in image intensity within subcortical structures after image registration with FSL-FLIRT on each system (i.e. ∣ Intensitysystem A – Intensitysystem B∣), averaged across participants. Projections are shown for image registration performed (a) using locally installed software, and (b) using Neurodesk (for which there were no intersystem differences). (c,d) Inter-system disagreement in subcortical structure labels after image segmentation with FSL-FIRST, averaged across participants. Projections are shown for image segmentation performed (c) using locally installed software and (d) using Neurodesk. (e) Scatter plot showing the mean inter-system image intensity differences across all voxels within the classified subcortical structures vs. the number of voxels subsequently classified with different labels across systems. For analyses performed with locally installed software, participants with larger differences in image intensity typically also had more prolific disagreement in labels between systems (Pearson’s r = 0.608, p < 0.001). This trend could not be assessed for Neurodesk, as there were no differences in image intensity across systems.
Figure 4.
Figure 4.
Cumulative difference in the numbers of system library calls between System A and System B for the analysis run using the (a) locally installed and (b) Neurodesk version of FSL FIRST. Note that calls to floorf() were excluded from the plot as they occurred earlier in time and the discrepancies for floorf() far outnumbered those for any other function from the locally installed tool.

References

    1. Brand A., Allen L., Altman M., Hlava M. & Scott J. Beyond authorship: attribution, contribution, collaboration, and credit. Learn. Publ. 28, 151–155 (2015).
    1. Halchenko Y. & Hanke M. Open is Not Enough. Let’s Take the Next Step: An Integrated, Community-Driven Computing Platform for Neuroscience. Front. Neuroinformatics 6, 22 (2012). - PMC - PubMed
    1. Hanke M. & Halchenko Y. Neuroscience Runs on GNU/Linux. Front. Neuroinformatics 5, 8 (2011). - PMC - PubMed
    1. Niso G. et al. Open and reproducible neuroimaging: From study inception to publication. NeuroImage 263, 119623 (2022). - PMC - PubMed
    1. The FAIR Guiding Principles for scientific data management and stewardship ∣ Scientific Data. https://www.nature.com/articles/sdata201618. - PMC - PubMed

Publication types

LinkOut - more resources