Stochastic proximity embedding
- PMID: 12820129
- DOI: 10.1002/jcc.10234
Stochastic proximity embedding
Abstract
We introduce stochastic proximity embedding (SPE), a novel self-organizing algorithm for producing meaningful underlying dimensions from proximity data. SPE attempts to generate low-dimensional Euclidean embeddings that best preserve the similarities between a set of related observations. The method starts with an initial configuration, and iteratively refines it by repeatedly selecting pairs of objects at random, and adjusting their coordinates so that their distances on the map match more closely their respective proximities. The magnitude of these adjustments is controlled by a learning rate parameter, which decreases during the course of the simulation to avoid oscillatory behavior. Unlike classical multidimensional scaling (MDS) and nonlinear mapping (NLM), SPE scales linearly with respect to sample size, and can be applied to very large data sets that are intractable by conventional embedding procedures. The method is programmatically simple, robust, and convergent, and can be applied to a wide range of scientific problems involving exploratory data analysis and visualization.
Copyright 2003 Wiley Periodicals, Inc. J Comput Chem 24: 1215-1221, 2003
Similar articles
-
A modified update rule for stochastic proximity embedding.J Mol Graph Model. 2003 Nov;22(2):133-40. doi: 10.1016/S1093-3263(03)00155-4. J Mol Graph Model. 2003. PMID: 12932784
-
A geodesic framework for analyzing molecular similarities.J Chem Inf Comput Sci. 2003 Mar-Apr;43(2):475-84. doi: 10.1021/ci025631m. J Chem Inf Comput Sci. 2003. PMID: 12653511
-
Stochastic proximity embedding on graphics processing units: taking multidimensional scaling to a new scale.J Chem Inf Model. 2011 Nov 28;51(11):2852-9. doi: 10.1021/ci200420c. Epub 2011 Oct 21. J Chem Inf Model. 2011. PMID: 21961974
-
Multidimensional scaling: a brief overview.Nurs Res. 2008 Jan-Feb;57(1):64-8. doi: 10.1097/01.NNR.0000280659.88760.7c. Nurs Res. 2008. PMID: 18091294 Review.
-
Multidimensional scaling locus of memristor and fractional order elements.J Adv Res. 2020 Jan 20;25:147-157. doi: 10.1016/j.jare.2020.01.004. eCollection 2020 Sep. J Adv Res. 2020. PMID: 32922982 Free PMC article. Review.
Cited by
-
A self-organizing algorithm for modeling protein loops.PLoS Comput Biol. 2009 Aug;5(8):e1000478. doi: 10.1371/journal.pcbi.1000478. Epub 2009 Aug 21. PLoS Comput Biol. 2009. PMID: 19696883 Free PMC article.
-
A visual approach for analysis and inference of molecular activity spaces.J Cheminform. 2019 Oct 22;11(1):63. doi: 10.1186/s13321-019-0386-z. J Cheminform. 2019. PMID: 33430986 Free PMC article.
-
Identification of Allosteric Modulators of Metabotropic Glutamate 7 Receptor Using Proteochemometric Modeling.J Chem Inf Model. 2017 Dec 26;57(12):2976-2985. doi: 10.1021/acs.jcim.7b00338. Epub 2017 Dec 12. J Chem Inf Model. 2017. PMID: 29172488 Free PMC article.
-
Userscripts for the life sciences.BMC Bioinformatics. 2007 Dec 21;8:487. doi: 10.1186/1471-2105-8-487. BMC Bioinformatics. 2007. PMID: 18154664 Free PMC article.
-
A Chemographic Audit of anti-Coronavirus Structure-activity Information from Public Databases (ChEMBL).Mol Inform. 2020 Dec;39(12):e2000080. doi: 10.1002/minf.202000080. Epub 2020 May 14. Mol Inform. 2020. PMID: 32363750 Free PMC article.
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous