Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2025 Jun 26:2025.06.20.660828.
doi: 10.1101/2025.06.20.660828.

Divide and Cluster: The DIVINE Framework for Deterministic Top-Down Analysis of Molecular Dynamics Trajectories

Divide and Cluster: The DIVINE Framework for Deterministic Top-Down Analysis of Molecular Dynamics Trajectories

Jherome Brylle Woody Santos et al. bioRxiv. .

Abstract

We present DIVIsive N -ary Ensembles (DIVINE), a deterministic, top-down clustering framework designed for molecular dynamics (MD) trajectories. DIVINE constructs a complete clustering hierarchy by recursively splitting clusters based on n -ary similarity principles, avoiding the need for O(N 2 ) pairwise distance matrices. It supports multiple cluster selection criteria, including a weighted variance metric, and deterministic anchor initialization strategies such as NANI (N-ary Natural Initiation), ensuring reproducible and well-balanced partitions. Testing DIVINE up to a 305 μs folding trajectory of the villin headpiece (HP35) revealed that it matched or exceeded the clustering quality of bisecting k -means while reducing runtime and eliminating stochastic variability. Its single-pass design enables efficient exploration of clustering resolutions without repeated executions. By combining scalability, interpretability, and determinism, DIVINE offers a robust and practical alternative to conventional MD clustering methods. DIVINE is publicly available as part of the MDANCE package: https://github.com/mqcomplab/MDANCE .

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest: The authors declare no competing financial interests.

Figures

Figure 1.
Figure 1.
DIVINE clustering on a Synthetic 2D Benchmark Dataset. Clustering results are shown for a simple, low-dimensional 2D dataset with clearly separated clusters: ground truth labels (left), k-means NANI (center), and DIVINE using NANI for anchor selection (right). Each color represents a distinct cluster. While both methods broadly recover the true cluster structure, DIVINE achieves better separation and cluster consistency. This illustrative example provides a controlled benchmark for visual comparison but does not reflect the complexity of molecular dynamics data, which typically involve high-dimensionality and conformational noise. As such, we performed a more rigorous validation using a biomolecular system.
Figure 2.
Figure 2.
Change in Calinski-Harabasz and Davies-Bouldin indices for HP35 after DIVINE clustering from k = 1 to 30, comparing performance across different cluster selection strategies.
Figure 3.
Figure 3.
Change in Calinski-Harabasz and Davies-Bouldin indices for HP35 after DIVINE clustering from k = 1 to 30, comparing performance across different anchor methods.
Figure 4:
Figure 4:
Overlaps of best representative structures from the seven clusters of HP35 after performing DIVINE using NANI as anchors and weighted_MSD as cluster selection strategy. Helices 1, 2, and 3 are colored green, cyan, and yellow, respectively.
Figure 4:
Figure 4:
Overlaps of best representative structures from the seven clusters of HP35 after performing DIVINE using NANI as anchors and weighted_MSD as cluster selection strategy. Helices 1, 2, and 3 are colored green, cyan, and yellow, respectively.

Similar articles

References

    1. Karplus M.; McCammon J. A. Molecular Dynamics Simulations of Biomolecules. Nat. Struct Biol. 2002, 9 (9), 646–652. 10.1038/nsb0902-646. - DOI - PubMed
    1. Arnittali M.; Rissanou A. N.; Harmandaris V. Structure Of Biomolecules Through Molecular Dynamics Simulations. Procedia Computer Science 2019, 156, 69–78. 10.1016/j.procs.2019.08.181. - DOI
    1. Pierce L. C. T.; Salomon-Ferrer R.; De Oliveira, C Augusto F..; McCammon J. A.; Walker R. C. Routine Access to Millisecond Time Scale Events with Accelerated Molecular Dynamics. J. Chem. Theory Comput. 2012, 8 (9), 2997–3002. 10.1021/ct300284c. - DOI - PMC - PubMed
    1. Shaw D. E.; Deneroff M. M.; Dror R. O.; Kuskin J. S.; Larson R. H.; Salmon J. K.; Young C.; Batson B.; Bowers K. J.; Chao J. C.; Eastwood M. P.; Gagliardo J.; Grossman J. P.; Ho C. R.; Ierardi D. J.; Kolossváry I.; Klepeis J. L.; Layman T.; McLeavey C.; Moraes M. A.; Mueller R.; Priest E. C.; Shan Y.; Spengler J.; Theobald M.; Towles B.; Wang S. C. Anton, a Special-Purpose Machine for Molecular Dynamics Simulation. Commun. ACM 2008, 51 (7), 91–97. 10.1145/1364782.1364802. - DOI
    1. Harvey M. J.; Giupponi G.; Fabritiis G. D. ACEMD: Accelerating Biomolecular Dynamics in the Microsecond Time Scale. J. Chem. Theory Comput. 2009, 5 (6), 1632–1639. 10.1021/ct9000685. - DOI - PubMed

Publication types

LinkOut - more resources