Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 2;25(1):134.
doi: 10.1186/s12864-024-10048-0.

DendroX: multi-level multi-cluster selection in dendrograms

Affiliations

DendroX: multi-level multi-cluster selection in dendrograms

Feiling Feng et al. BMC Genomics. .

Abstract

Background: Cluster heatmaps are widely used in biology and other fields to uncover clustering patterns in data matrices. Most cluster heatmap packages provide utility functions to divide the dendrograms at a certain level to obtain clusters, but it is often difficult to locate the appropriate cut in the dendrogram to obtain the clusters seen in the heatmap or computed by a statistical method. Multiple cuts are required if the clusters locate at different levels in the dendrogram.

Results: We developed DendroX, a web app that provides interactive visualization of a dendrogram where users can divide the dendrogram at any level and in any number of clusters and pass the labels of the identified clusters for functional analysis. Helper functions are provided to extract linkage matrices from cluster heatmap objects in R or Python to serve as input to the app. A graphic user interface was also developed to help prepare input files for DendroX from data matrices stored in delimited text files. The app is scalable and has been tested on dendrograms with tens of thousands of leaf nodes. As a case study, we clustered the gene expression signatures of 297 bioactive chemical compounds in the LINCS L1000 dataset and visualized them in DendroX. Seventeen biologically meaningful clusters were identified based on the structure of the dendrogram and the expression patterns in the heatmap. We found that one of the clusters consisting of mostly naturally occurring compounds is not previously reported and has its members sharing broad anticancer, anti-inflammatory and antioxidant activities.

Conclusions: DendroX solves the problem of matching visually and computationally determined clusters in a cluster heatmap and helps users navigate among different parts of a dendrogram. The identification of a cluster of naturally occurring compounds with shared bioactivities implicates a convergence of biological effects through divergent mechanisms.

Keywords: Cluster analysis; Dendrogram; LINCS L1000; Natural medicine.

PubMed Disclaimer

Conflict of interest statement

Authors that are affiliated with 3D Medicines Inc. are current employees. No potential conflicts of interest were disclosed by the other authors.

Figures

Fig. 1
Fig. 1
A screenshot of the user interface of the DendroX Cluster program
Fig. 2
Fig. 2
A screenshot of the input view. It shows the input view with an example JSON file and image file loaded. The example data is adapted from [22]
Fig. 3
Fig. 3
A screenshot of the visualization view. It visualizes the data submitted in Fig. 2
Fig. 4
Fig. 4
A: cluster heatmap of the 297 LINCS L1000 compound signatures. The 14 major clusters are colored and labeled. B-D: sub-clusters of cluster 2, 6 and 8. E-I: drug name-set enrichment analysis results of cluster 1, cluster 2.1, cluster 2.2, cluster 6.1 and cluster 10

References

    1. Eisen MB, Spellman PT, Brown PO, Botstein D. Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences. 1998;95:14863–8. - PMC - PubMed
    1. Suzuki R, Shimodaira H. Pvclust: an R package for assessing the uncertainty in hierarchical clustering. Bioinformatics. 2006;22:1540–2. doi: 10.1093/bioinformatics/btl117. - DOI - PubMed
    1. Kimes PK, Liu Y, Neil Hayes D, Marron JS. Statistical significance for hierarchical clustering. Biometrics. 2017;73:811–21. doi: 10.1111/biom.12647. - DOI - PMC - PubMed
    1. Sebastiani P, Perls TT. Detection of significant groups in hierarchical clustering by resampling. Front Genet. 2016;7:144. doi: 10.3389/fgene.2016.00144. - DOI - PMC - PubMed
    1. Gu Z, Eils R, Schlesner M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics. 2016;32:2847–9. doi: 10.1093/bioinformatics/btw313. - DOI - PubMed