Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan 8;48(D1):D704-D715.
doi: 10.1093/nar/gkz997.

The Monarch Initiative in 2019: an integrative data and analytic platform connecting phenotypes to genotypes across species

Affiliations

The Monarch Initiative in 2019: an integrative data and analytic platform connecting phenotypes to genotypes across species

Kent A Shefchek et al. Nucleic Acids Res. .

Abstract

In biology and biomedicine, relating phenotypic outcomes with genetic variation and environmental factors remains a challenge: patient phenotypes may not match known diseases, candidate variants may be in genes that haven't been characterized, research organisms may not recapitulate human or veterinary diseases, environmental factors affecting disease outcomes are unknown or undocumented, and many resources must be queried to find potentially significant phenotypic associations. The Monarch Initiative (https://monarchinitiative.org) integrates information on genes, variants, genotypes, phenotypes and diseases in a variety of species, and allows powerful ontology-based search. We develop many widely adopted ontologies that together enable sophisticated computational analysis, mechanistic discovery and diagnostics of Mendelian diseases. Our algorithms and tools are widely used to identify animal models of human disease through phenotypic similarity, for differential diagnostics and to facilitate translational research. Launched in 2015, Monarch has grown with regards to data (new organisms, more sources, better modeling); new API and standards; ontologies (new Mondo unified disease ontology, improvements to ontologies such as HPO and uPheno); user interface (a redesigned website); and community development. Monarch data, algorithms and tools are being used and extended by resources such as GA4GH and NCATS Translator, among others, to aid mechanistic discovery and diagnostics.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
uPheno template-driven ontology development and harmonization. uPheno templates are used to define phenotypes according to agreed upon design patterns. (A). Computable definitions specified using uPheno templates are used to automate classification of uPheno and parts of the Zebrafish Phenotype Ontology (ZP (13); dashed lines). (B). Computable definitions also drive automated classification of HPO and ZP classes under uPheno classes. For example, enlarged heart in ZP (defined using the zebrafish anatomy heart term) and enlarged heart in HPO are both classified under uPheno enlarged heart (defined using Uberon heart). Algorithms can use this classification under uPheno to predict that human orthologs of zebrafish genes annotated to enlarged heart may cause enlarged heart in humans.
Figure 2.
Figure 2.
Decomposition of a Zebrafish Genotype. The left panel shows classes in the core genotype partonomy. The center panel shows an example instance of each class from the zebrafish genotype (see also https://zfin.org/ZDB-GENO-161227-1). The right panel shows a graphical depiction of the portion of the genome specified at each level (where the top panel shows a complete genome composed of two sets of homologous chromosomes).
Figure 3.
Figure 3.
A workflow diagram of the Monarch architecture. Since our last report, we have developed the Monarch API (highlighted) for accessing associations between entities, performing computations on phenotype profiles, executing graph traversal queries, and performing text annotation (https://api.monarchinitiative.org/api).
Figure 4.
Figure 4.
Monarch's data sources. The leftmost set of columns shows the types of data that the integrated data sources serve to Monarch. Note that these sources offer many additional data types that have not yet been integrated into Monarch. Each data source is annotated to specific ontologies and standards, which are, in turn, harmonized using the ontologies indicated in the rightmost panel. Those are used to create an integrated knowledge graph which drives the views and analytics on the Monarch website.
Figure 5.
Figure 5.
The New Monarch User Interface. A beta version of the new website is available at https://beta.monarchinitiative.org. Entering information on the ‘Search' bar, users can navigate directly to terms suggested via autocomplete, or explore more results through the results tables. In this example, a user enters only part of the name of a disease, ‘Pierpont syndrome’ (A). Selecting the term from the auto-complete menu, the user arrives at an overview page, which offers a summary of all available information in the integrated knowledge graph of the Monarch database (B). Users can explore all available data using a menu of options shown on a panel on the left (B-1), while the information is updated on the main panel on the right (B-2). In this example, the user learns that Pierpont syndrome, a rare subcutaneous tissue disorder, is characterized by phenotypes that include ‘prominent subcalcaneal fat pad' (a term in HPO, with identifier HP:0032276), ‘deep plantar creases' (HP:0001869) and ‘muscular hypotonia' (HP:0001252), among many others (C). Information integrated from the OMIM and Orphanet databases, as well as a number of publications, also support the association of a mutation in one gene, TBL1XR1, as the cause of Pierpont syndrome (D).
Figure 6.
Figure 6.
Text annotation widget on the new Monarch website. Users can supply free text and retrieve the resulting marked up text with links to terms in various ontologies. In this example, a user has entered text from a publication entitled ‘A specific mutation in TBL1XR1 causes Pierpont syndrome’ (51). The Text Annotator tool (in beta version) has highlighted terms identified in various ontologies, and hovering over each highlighted term offers details about the marked up annotations, in this case, ‘abnormal fat distribution.’

References

    1. Köhler S., Carmody L., Vasilevsky N., Jacobsen J.O.B., Danis D., Gourdine J.-P., Gargano M., Harris N.L., Matentzoglu N., McMurry J.A. et al. .. Expansion of the Human Phenotype Ontology (HPO) knowledge base and resources. Nucleic Acids Res. 2019; 47:D1018–D1027. - PMC - PubMed
    1. Vasilevsky N.A., Foster E.D., Engelstad M.E., Carmody L., Might M., Chambers C., Dawkins H.J.S., Lewis J., Della Rocca M.G., Snyder M. et al. .. Plain-language medical vocabulary for precision diagnosis. Nat. Genet. 2018; 50:474–476. - PMC - PubMed
    1. Turnbull C., Scott R.H., Thomas E., Jones L., Murugaesu N., Pretty F.B., Halai D., Baple E., Craig C., Hamblin A. et al. .. The 100 000 Genomes Project: bringing whole genome sequencing to the NHS. BMJ. 2018; 361:k1687. - PubMed
    1. Gall T., Valkanas E., Bello C., Markello T., Adams C., Bone W.P., Brandt A.J., Brazill J.M., Carmichael L., Davids M. et al. .. Defining disease, diagnosis, and translational medicine within a homeostatic perturbation paradigm: The national institutes of health undiagnosed diseases program experience. Front. Med. 2017; 4:62. - PMC - PubMed
    1. Ramoni R.B., Mulvihill J.J., Adams D.R., Allard P., Ashley E.A., Bernstein J.A., Gahl W.A., Hamid R., Loscalzo J., McCray A.T. et al. .. The undiagnosed diseases network: accelerating discovery about health and disease. Am. J. Hum. Genet. 2017; 100:185–192. - PMC - PubMed

Publication types