Two-step clustering-based pipeline for big dynamic functional network connectivity data
- PMID: 35958983
- PMCID: PMC9358255
- DOI: 10.3389/fnins.2022.895637
Two-step clustering-based pipeline for big dynamic functional network connectivity data
Abstract
Background: Dynamic functional network connectivity (dFNC) estimated from resting-state functional magnetic imaging (rs-fMRI) studies the temporally varying functional integration between brain networks. In a conventional dFNC pipeline, a clustering stage to summarize the connectivity patterns that are transiently but reliably realized over the course of a scanning session. However, identifying the right number of clusters (or states) through a conventional clustering criterion computed by running the algorithm repeatedly over a large range of cluster numbers is time-consuming and requires substantial computational power even for typical dFNC datasets, and the computational demands become prohibitive as datasets become larger and scans longer. Here we developed a new dFNC pipeline based on a two-step clustering approach to analyze large dFNC data without having access to huge computational power.
Methods: In the proposed dFNC pipeline, we implement two-step clustering. In the first step, we randomly use a sub-sample dFNC data and identify several sets of states at different model orders. In the second step, we aggregate all dFNC states estimated from all iterations in the first step and use this to identify the optimum number of clusters using the elbow criteria. Additionally, we use this new reduced dataset and estimate a final set of states by performing a second kmeans clustering on the aggregated dFNC states from the first k-means clustering. To validate the reproducibility of results in the new pipeline, we analyzed four dFNC datasets from the human connectome project (HCP).
Results: We found that both conventional and proposed dFNC pipelines generate similar brain dFNC states across all four sessions with more than 99% similarity. We found that the conventional dFNC pipeline evaluates the clustering order and finds the final dFNC state in 275 min, while this process takes only 11 min for the proposed dFNC pipeline. In other words, the new pipeline is 25 times faster than the traditional method in finding the optimum number of clusters and finding the final dFNC states. We also found that the new method results in better clustering quality than the conventional approach (p < 0.001). We show that the results are replicated across four different datasets from HCP.
Conclusion: We developed a new analytic pipeline that facilitates the analysis of large dFNC datasets without having access to a huge computational power source. We validated the reproducibility of the result across multiple datasets.
Keywords: big data; dynamic functional network connectivity; human connectome project; kmeans clustering; reproducibility.
Copyright © 2022 Sendi, Salat, Miller and Calhoun.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures







Similar articles
-
A two-step clustering-based pipeline for big dynamic functional network connectivity data.Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:3741-3744. doi: 10.1109/EMBC48229.2022.9871997. Annu Int Conf IEEE Eng Med Biol Soc. 2022. PMID: 36085804
-
Dynamic Functional Network Connectivity in Schizophrenia with Magnetoencephalography and Functional Magnetic Resonance Imaging: Do Different Timescales Tell a Different Story?Brain Connect. 2019 Apr;9(3):251-262. doi: 10.1089/brain.2018.0608. Brain Connect. 2019. PMID: 30632385 Free PMC article.
-
NEUROMARK DFNC PATTERNS: A FULLY AUTOMATED PIPELINE TO ESTIMATE SUBJECT-SPECIFIC STATES FROM RS-FMRI DATA VIA CONSTRAINED ICA OF DFNC IN +100K SUBJECTS.bioRxiv [Preprint]. 2025 Feb 2:2025.01.29.635539. doi: 10.1101/2025.01.29.635539. bioRxiv. 2025. PMID: 39975182 Free PMC article. Preprint.
-
Prediction and classification of sleep quality based on phase synchronization related whole-brain dynamic connectivity using resting state fMRI.Neuroimage. 2020 Nov 1;221:117190. doi: 10.1016/j.neuroimage.2020.117190. Epub 2020 Jul 22. Neuroimage. 2020. PMID: 32711063
-
Characterizing Whole Brain Temporal Variation of Functional Connectivity via Zero and First Order Derivatives of Sliding Window Correlations.Front Neurosci. 2019 Jun 27;13:634. doi: 10.3389/fnins.2019.00634. eCollection 2019. Front Neurosci. 2019. PMID: 31316333 Free PMC article.
Cited by
-
Pairing explainable deep learning classification with clustering to uncover effects of schizophrenia upon whole brain functional network connectivity dynamics.Neuroimage Rep. 2023 Sep 29;3(4):100186. doi: 10.1016/j.ynirp.2023.100186. eCollection 2023 Dec. Neuroimage Rep. 2023. PMID: 40567816 Free PMC article.
-
Development of a machine learning-based predictive model for maxillary sinus cysts and exploration of clustering patterns.Head Face Med. 2025 Mar 12;21(1):17. doi: 10.1186/s13005-025-00492-y. Head Face Med. 2025. PMID: 40069749 Free PMC article.
-
Functional brain network dynamics of brooding in depression: insights from real-time fMRI neurofeedback.medRxiv [Preprint]. 2024 May 6:2024.05.05.24306889. doi: 10.1101/2024.05.05.24306889. medRxiv. 2024. Update in: J Affect Disord. 2025 Jul 1;380:191-202. doi: 10.1016/j.jad.2025.03.121. PMID: 38766116 Free PMC article. Updated. Preprint.
-
Increased functional connectivity between default mode network and visual network potentially correlates with duration of residual dizziness in patients with benign paroxysmal positional vertigo.Front Neurol. 2024 Mar 4;15:1363869. doi: 10.3389/fneur.2024.1363869. eCollection 2024. Front Neurol. 2024. PMID: 38500812 Free PMC article.
-
Explainable fuzzy clustering framework reveals divergent default mode network connectivity dynamics in schizophrenia.Front Psychiatry. 2024 Feb 15;15:1165424. doi: 10.3389/fpsyt.2024.1165424. eCollection 2024. Front Psychiatry. 2024. PMID: 38495909 Free PMC article.
References
-
- Béjar Alonso J. (2013). K-means vs Mini Batch K-Means: A Comparison. Available online at: http://hdl.handle.net/2117/23414 (accessed May, 2013).
LinkOut - more resources
Full Text Sources
Miscellaneous