Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Oct:2011:3108-3111.
doi: 10.1109/NSSMIC.2011.6152564.

Sparse Clustering with Resampling for Subject Classification in PET Amyloid Imaging Studies

Affiliations

Sparse Clustering with Resampling for Subject Classification in PET Amyloid Imaging Studies

Wenzhu Bi et al. IEEE Nucl Sci Symp Conf Rec (1997). 2011 Oct.

Abstract

Sparse k-means clustering (Sparse_kM) can exclude uninformative variables and yield reliable parsimonious clustering results, especially for p≫n. In this work, Sparse_kM and data resampling were combined to identify variables of greatest interest and define confidence levels for the clustering. The method was evaluated by statistical simulation and applied to PiB PET amyloid imaging data to identify normal control (NC) subjects with (+) or without (-) evidence of amyloid, i.e., PiB(+/-).

Simulations: A dataset of n=60 observations (3 groups of 20) and p=500 variables was generated for each simulation run; only 50 variables were truly different across groups. The dataset was resampled 20 times, Sparse_kM was applied to each sample and average variable weights were calculated. Probabilities of cluster membership, also called confidence levels, were computed (n=60). Simulations were performed 250 times. The 50 truly different variables were identified by variable weights that were 13-32 times greater than those for the 450 uninformative variables.

Human data: For the PiB PET dataset, images (ECAT HR+, 10-15 mCi, 90 min) were acquired for 64 cognitively normal subjects (74.1±5.4 yrs). Parametric PiB distribution volume ratio images were generated (Logan method, cerebellum reference) and normalized to the MNI template (SPM8) to produce a dataset of n=64 subjects and p=343,099 voxels/image. The dataset was resampled 10 times and Sparse_kM was applied. An average voxel weight image was computed that indicated cortical areas of greatest interest that included precuneus and frontal cortex; these are key areas linked to early amyloid deposition. Seven of 64 subjects were identified as PiB(+) and 47 as PiB(-) with confidence ≥ 90%, where another subject was PiB(+) at lower confidence (80%) and the other 9 subjects were PiB(-) at confidence in the range of 50-70%. In conclusion, Sparse_kM with resampling can help to establish confidence levels for clustering when p≫n and may be a promising method for revealing informative voxels/spatial patterns that distinguish levels of amyloid load, including that at the transitional amyloid +/- boundary.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Group averages for the 500 variables in an example simulated dataset X60×500. There are 3 groups with 20 samples per group. Groups differ in first q = 50 of p = 500 total variables.
Fig. 2
Fig. 2
Example of regional [11C]PiB DVR values for the 64 controls that shows range of retention from negligible (black) to high (red). The black line is PiB(+) cut-off based on single ROIs or CTX5 DVR values shown with and without correction for CSF dilution.
Fig. 3
Fig. 3
Images of the MNI MRI template (Left) superimposed with the average weights from cluster analysis of [11C]PiB DVR images for 64 control subjects (Right). Bright voxels contribute most to classification of PiB(+) and PiB(−) subjects. Weight range: 0 – 2.71 × 10−5, which is also 0 to 9.31 fold of 1/p(1/p = 2.91 × 10−6).

References

    1. Mathis CA, Wang Y, Holt DP, Huang GF, Debnath ML, Klunk WE. Synthesis and evaluation of 11C-labeled 6-substituted 2-arylbenzothiazoles as amyloid imaging agents. J Med Chem. 2003 Jun;46:2740–54. - PubMed
    1. Klunk WE, et al. Imaging the pathology of Alzheimer’s disease: amyloid-imaging with positron emission tomography. Neuroimaging Clin N Am. 2003 Nov;13:781–9. ix. - PubMed
    1. Aizenstein HJ, et al. Frequent amyloid deposition without significant cognitive impairment among the elderly. Arch Neurol. 2008 Nov;65:1509–17. - PMC - PubMed
    1. Witten DM, Tibshirani R. A framework for feature selection in clustering. J Am Stat Assoc. 2010 Jun;105:713–726. - PMC - PubMed
    1. Tseng GC, Wong WH. Tight clustering: a resampling-based approach for identifying stable and tight patterns in data. Biometrics. 2005 Mar;61:10–6. - PubMed

LinkOut - more resources