Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jan 4;44(D1):D272-8.
doi: 10.1093/nar/gkv1301. Epub 2015 Nov 26.

BIGNASim: a NoSQL database structure and analysis portal for nucleic acids simulation data

Affiliations

BIGNASim: a NoSQL database structure and analysis portal for nucleic acids simulation data

Adam Hospital et al. Nucleic Acids Res. .

Abstract

Molecular dynamics simulation (MD) is, just behind genomics, the bioinformatics tool that generates the largest amounts of data, and that is using the largest amount of CPU time in supercomputing centres. MD trajectories are obtained after months of calculations, analysed in situ, and in practice forgotten. Several projects to generate stable trajectory databases have been developed for proteins, but no equivalence exists in the nucleic acids world. We present here a novel database system to store MD trajectories and analyses of nucleic acids. The initial data set available consists mainly of the benchmark of the new molecular dynamics force-field, parmBSC1. It contains 156 simulations, with over 120 μs of total simulation time. A deposition protocol is available to accept the submission of new trajectory data. The database is based on the combination of two NoSQL engines, Cassandra for storing trajectories and MongoDB to store analysis results and simulation metadata. The analyses available include backbone geometries, helical analysis, NMR observables and a variety of mechanical analyses. Individual trajectories and combined meta-trajectories can be downloaded from the portal. The system is accessible through http://mmb.irbbarcelona.org/BIGNASim/. Supplementary Material is also available on-line at http://mmb.irbbarcelona.org/BIGNASim/SuppMaterial/.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Global outline of the database platform and data flow.
Figure 2.
Figure 2.
Details of screenshots of the BIGNASim portal. (A) Details of the three search options. (B) Browser table. Column selectors and top search box allow filtering contents. (C) Portal to available analyses, after trajectory selection. Also available for global database analyses. Each bullet leads to analyses of the indicated molecular fragment. Number in parentheses indicates the available data items on each option. Full screenshots are available in the Supplementary Material examples
Figure 3.
Figure 3.
Screenshots of the BIGNASim portal. Example of navigation in the analysis structure for obtaining the twist parameter of CG bp-steps. (i) Selection of series of analysis based on curves. (ii) Selection of helical parameters. (iii) Selection of the twist parameter calculated for CG steps on all individual frames. Numbers in parentheses indicate the amount of available data items on each option. Raw histogram data are available for downloading. Full screenshots are available in the Supplementary Material examples.

References

    1. van der Kamp M.W., Schaeffer R.D., Jonsson A.L., Scouras A.D., Simms A.M., Toofanny R.D., Benson N.C., Anderson P.C., Merkley E.D., Rysavy S., et al. Dynameomics: a comprehensive database of protein dynamics. Structure. 2010;18:423–435. - PMC - PubMed
    1. Meyer T., D'Abramo M., Hospital A., Rueda M., Ferrer-Costa C., Perez A., Carrillo O., Camps J., Fenollosa C., Repchevsky D., et al. MoDEL (Molecular Dynamics Extended Library): a database of atomistic molecular dynamics trajectories. Structure. 2010;18:1399–1409. - PubMed
    1. Hensen U., Meyer T., Haas J., Rex R., Vriend G., Grubmüller H. Exploring protein dynamics space: the dynasome as the missing link between protein structure and function. PLoS One. 2012;7:e33931. - PMC - PubMed
    1. Kehl C., Simms A.M., Toofanny R.D., Daggett V. Dynameomics: a multi-dimensional analysis-optimized database for dynamic protein data. Protein Eng. Des. Sel. 2008;21:379–386. - PubMed
    1. Thibault J.C., Facelli J.C., Cheatham T.E. III. iBIOMES: managing and sharing biomolecular simulation data in a distributed environment. J. Chem. Inf. Model. 2013;53:726–736. - PMC - PubMed

Publication types

MeSH terms