Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Randomized Controlled Trial
. 2013 Nov 20:14:333.
doi: 10.1186/1471-2105-14-333.

Similarity maps and hierarchical clustering for annotating FT-IR spectral images

Affiliations
Randomized Controlled Trial

Similarity maps and hierarchical clustering for annotating FT-IR spectral images

Qiaoyong Zhong et al. BMC Bioinformatics. .

Abstract

Background: Unsupervised segmentation of multi-spectral images plays an important role in annotating infrared microscopic images and is an essential step in label-free spectral histopathology. In this context, diverse clustering approaches have been utilized and evaluated in order to achieve segmentations of Fourier Transform Infrared (FT-IR) microscopic images that agree with histopathological characterization.

Results: We introduce so-called interactive similarity maps as an alternative annotation strategy for annotating infrared microscopic images. We demonstrate that segmentations obtained from interactive similarity maps lead to similarly accurate segmentations as segmentations obtained from conventionally used hierarchical clustering approaches. In order to perform this comparison on quantitative grounds, we provide a scheme that allows to identify non-horizontal cuts in dendrograms. This yields a validation scheme for hierarchical clustering approaches commonly used in infrared microscopy.

Conclusions: We demonstrate that interactive similarity maps may identify more accurate segmentations than hierarchical clustering based approaches, and thus are a viable and due to their interactive nature attractive alternative to hierarchical clustering. Our validation scheme furthermore shows that performance of hierarchical two-means is comparable to the traditionally used Ward's clustering. As the former is much more efficient in time and memory, our results suggest another less resource demanding alternative for annotating large spectral images.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Principle of FT-IR microscopy. (A) At each pixel (indicated by circles), the infrared absorbance spectrum is measured, reflecting the biochemical status of the sample at the corresponding position. (B) Average spectra for different tissue components from a well established training dataset [3] exhibit relatively subtle differences on an absolute scale. (C) Mean spectra of crypts and tumor regions, with shaded areas around the mean spectra indicating standard deviation. Spectral variability within each class is small even in relation to the subtle differences between average spectra, so that differences between classes (here exemplified by crypts vs. tumor) remain distinguishable by classifiers.
Figure 2
Figure 2
Schematic overview of the cross-validation scheme for hierarchical clustering. (A) Composition of training data set, indicating an index color and proportion of spectra per class. (B) Dendrogram of the training data set and result of an optimal class assignment under a horizontal cut (indicated by dashed line in the left dendrogram) and an optimal tree assignment (right dendrogram) where each class is identified with the subtree colored according to its associated index color. Tree-assignment based segmentation not only achieves a much higher accuracy, but exhibits substantial differences in the assignment of several classes. The classes of crypts and submucosa are even identified as disjoint sets of spectra in both approaches, while substantial differences exist in the classes of tumour, inflammatory tissue, follicles, and support cells. The two segmentations indicate that even on well-curated training data, non-horizontal cuts in the dendrogram represent tissue classes much more reliably than horizontal cuts.
Figure 3
Figure 3
Indexed spectral images and confusion matrices of image 120514 . (A): Random-forest classified reference image. (B-D): Segmentations and confusion matrices obtained by different annotation approaches. In the confusion matrices, the numbers beside the tissue names indicate class sizes, and the tissues are sorted by size in descending order. Ward’s clustering in combination with the power metric achieves an Rand index of 0.83 and accuracy of 53.35% (data not shown).
Figure 4
Figure 4
Comparison of tree assignments, k -means and horizontal cut. Clustering the training dataset into 14 classes using k-means (left) or Ward’s clustering using a horizontal cut (middle) leads to partitionings with a Rand index of around.9 with relatively high standard deviation. A partitioning obtained from Ward’s clustering using tree-assignments leads to a significantly higher Rand index (right). Note that the Rand index approaches 1 for datasets with many classes. Yet, the difference after Monte-Carlo type validation is clearly significant.
Figure 5
Figure 5
Comparison of different hierarchical clustering approaches under varying the depth of segmentation Q . Three hierarchical clustering schemes are evaluated in terms of Rand index and accuracy on both the training dataset (A-B) and image 120514 (C-D). 10-fold Monte-Carlo cross validation is performed on the training dataset (the error bar indicates standard deviation).

Similar articles

Cited by

References

    1. Lasch P, Haensch W, Lewis EN, Kidder LH, Naumann D. Characterization of colorectal adenocarcinoma sections by spatially resolved ft-ir microspectroscopy. Appl Spectrosc. 2002;56(1):1–9. doi: 10.1366/0003702021954322. - DOI
    1. Steller W, Einenkel J, Horn LC, Braumann UD, Binder H, Salzer R, Krafft C. Delimitation of squamous cell cervical carcinoma using infrared microspectroscopic imaging. Anal Bioanal Chem. 2006;384(1):145–154. doi: 10.1007/s00216-005-0124-4. - DOI - PubMed
    1. Kallenbach-Thieltges A, Großerüschkamp F, Mosig A, Diem M, Tannapfel A, Gerwert K. Immunohistochemistry, histopathology and infrared spectral histopathology of colon cancer tissue sections. J Biophotonics. 2013;6(1):88–100. doi: 10.1002/jbio.201200132. - DOI - PubMed
    1. Trevisan J, Angelov PP, Carmichael PL, Scott AD, Martin FL. Extracting biological information with computational analysis of fourier-transform infrared (ftir) biospectroscopy datasets: current practices to future perspectives. Analyst. 2012;137(14):3202–3215. doi: 10.1039/c2an16300d. - DOI - PubMed
    1. Lasch P, Diem M, Hänsch W, Naumann D. Artificial neural networks as supervised techniques for ft-ir microspectroscopic imaging. J Chemometrics. 2006;20(5):209–220. doi: 10.1002/cem.993. - DOI - PMC - PubMed

Publication types

LinkOut - more resources