Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 1;39(3):btad100.
doi: 10.1093/bioinformatics/btad100.

StonPy: a tool to parse and query collections of SBGN maps in a graph database

Affiliations

StonPy: a tool to parse and query collections of SBGN maps in a graph database

Adrien Rougny et al. Bioinformatics. .

Abstract

Summary: The systems biology graphical notation (SBGN) has become the de facto standard for the graphical representation of molecular maps. Having rapid and easy access to the content of large collections of maps is necessary to perform semantic or graph-based analysis of these resources. To this end, we propose StonPy, a new tool to store and query SBGN maps in a Neo4j graph database. StonPy notably includes a data model that takes into account all three SBGN languages and a completion module to automatically build valid SBGN maps from query results. StonPy is built as a library that can be integrated into other software and offers a command-line interface that allows users to easily perform all operations.

Availability and implementation: StonPy is implemented in Python 3 under a GPLv3 license. Its code and complete documentation are freely available from https://github.com/adrienrougny/stonpy.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Overview of StonPy’s functionalities. An example of an input SBGN PD map is shown on the left. The corresponding Neo4j graph, built using StonPy, is shown on the right. All SBGN glyphs and arcs are modeled using Neo4j nodes; relationships between SBGN glyphs and sub-glyphs or complex attributes are modeled using Neo4j relationships. SBGN arcs are optionally modeled using additional Neo4j relationships (CATALYZES, HAS_REACTANT, and HAS_PRODUCT relationships) that mimic the structure of the SBGN map and facilitate writing queries on the represented biological concepts. For each Neo4j node, either one of its labels or one of its attributes is shown; for each label, its type is shown. Examples of labels and attributes are shown for some Neo4j nodes in the boxes on the right. Consumption (Cons) and Production (Prod) have been shortened for visualization

References

    1. Bergmann F.T. et al. (2020) Systems biology graphical notation markup language (SBGNML) version 0.3. J. Integr. Bioinform., 17, 20200016. - PMC - PubMed
    1. Fabregat A. et al. (2018) Reactome graph database: efficient access to complex pathway data. PLoS Comput. Biol., 14, e1005968. - PMC - PubMed
    1. König M. (2020) matthiaskoenig/libsbgn-python: 0.2.2 (0.2.2). Zenodo. 10.5281/zenodo.4171366. - DOI
    1. Kuperstein I. et al. (2015) Atlas of cancer signalling network: a systems biology resource for integrative analysis of cancer data with google maps. Oncogenesis, 4, e160. - PMC - PubMed
    1. Le Novere N. et al. (2009) The systems biology graphical notation. Nat. Biotechnol., 27, 735. 741. - PubMed

Publication types