Haisu: Hierarchically supervised nonlinear dimensionality reduction
- PMID: 35862429
- PMCID: PMC9345488
- DOI: 10.1371/journal.pcbi.1010351
Haisu: Hierarchically supervised nonlinear dimensionality reduction
Abstract
We propose a novel strategy for incorporating hierarchical supervised label information into nonlinear dimensionality reduction techniques. Specifically, we extend t-SNE, UMAP, and PHATE to include known or predicted class labels and demonstrate the efficacy of our approach on multiple single-cell RNA sequencing datasets. Our approach, "Haisu," is applicable across domains and methods of nonlinear dimensionality reduction. In general, the mathematical effect of Haisu can be summarized as a variable perturbation of the high dimensional space in which the original data is observed. We thereby preserve the core characteristics of the visualization method and only change the manifold to respect known or assumed class labels when provided. Our strategy is designed to aid in the discovery and understanding of underlying patterns in a dataset that is heavily influenced by parent-child relationships. We show that using our approach can also help in semi-supervised settings where labels are known for only some datapoints (for instance when only a fraction of the cells are labeled). In summary, Haisu extends existing popular visualization methods to enable a user to incorporate labels known a priori into a visualization, including their hierarchical relationships as defined by a user input graph.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
References
-
- Sivarajah U, Kamal MM, Irani Z, Weerakkody V. Critical analysis of Big Data challenges and analytical methods. Journal of Business Research. 2017;70: 263–286. doi: 10.1016/j.jbusres.2016.08.001 - DOI
-
- Dimensionality Reduction—an overview | ScienceDirect Topics. [cited 2 May 2020]. Available: https://www.sciencedirect.com/topics/computer-science/dimensionality-red...
-
- Sammon JW. A Nonlinear Mapping for Data Structure Analysis. IEEE Transactions on Computers. 1969;C–18: 401–409. doi: 10.1109/T-C.1969.222678 - DOI
-
- Hinton GE, Roweis ST. Stochastic Neighbor Embedding: 8.
MeSH terms
LinkOut - more resources
Full Text Sources
