Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 May 1;24(3):513-519.
doi: 10.1093/jamia/ocw148.

The cancer precision medicine knowledge base for structured clinical-grade mutations and interpretations

Affiliations

The cancer precision medicine knowledge base for structured clinical-grade mutations and interpretations

Linda Huang et al. J Am Med Inform Assoc. .

Abstract

Objective: This paper describes the Precision Medicine Knowledge Base (PMKB; https://pmkb.weill.cornell.edu ), an interactive online application for collaborative editing, maintenance, and sharing of structured clinical-grade cancer mutation interpretations.

Materials and methods: PMKB was built using the Ruby on Rails Web application framework. Leveraging existing standards such as the Human Genome Variation Society variant description format, we implemented a data model that links variants to tumor-specific and tissue-specific interpretations. Key features of PMKB include support for all major variant types, standardized authentication, distinct user roles including high-level approvers, and detailed activity history. A REpresentational State Transfer (REST) application-programming interface (API) was implemented to query the PMKB programmatically.

Results: At the time of writing, PMKB contains 457 variant descriptions with 281 clinical-grade interpretations. The EGFR, BRAF, KRAS, and KIT genes are associated with the largest numbers of interpretable variants. PMKB's interpretations have been used in over 1500 AmpliSeq tests and 750 whole-exome sequencing tests. The interpretations are accessed either directly via the Web interface or programmatically via the existing API.

Discussion: An accurate and up-to-date knowledge base of genomic alterations of clinical significance is critical to the success of precision medicine programs. The open-access, programmatically accessible PMKB represents an important attempt at creating such a resource in the field of oncology.

Conclusion: The PMKB was designed to help collect and maintain clinical-grade mutation interpretations and facilitate reporting for clinical cancer genomic testing. The PMKB was also designed to enable the creation of clinical cancer genomics automated reporting pipelines via an API.

Keywords: application-programming interface; cancer genomics; clinical reporting; database; pathology; precision medicine.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Diagram of the variant description data types.
Figure 2.
Figure 2.
Illustration of information returned by the PMKB API when querying a variant.
Figure 3.
Figure 3.
Screenshots of (a) the interface for entering an interpretation in PMKB, and (b) summary view of an interpretation after entry/edit. This also demonstrates how PubMed IDs in the citations entry were resolved to complete citations in the display.
Figure 4.
Figure 4.
Growth of the knowledge base over time. Entries created prior to December 2015 represent work that IPM pathologists had saved in Excel spreadsheets. Currently, a small team at the IPM makes contributions to PMKB, and expansion in PMKB’s user base could result in a much higher rate of growth.
Figure 5.
Figure 5.
(a) The top 10 genes in PMKB when ranked by number of variant descriptions in the database. (b) The top 10 genes in PMKB when ranked by number of interpretations in the database. The prevalence of certain genes in PMKB naturally corresponds to genes that have well-studied variants with possible diagnostic use. (c) The top 10 solid tumor types in PMKB when ranked by number of interpretations.

References

    1. Roychowdhury S, Iyer MK, Robinson DR et al. . Personalized oncology through integrative high-throughput sequencing: a pilot study. Sci Trans Med. 2011;3(111):111ra21. - PMC - PubMed
    1. Van Allen EM, Wagle N et al. . Whole-exome sequencing and clinical interpretation of formalin-fixed, paraffin-embedded tumor samples to guide precision cancer medicine. Nat Med. 2014;20(6):682–8. - PMC - PubMed
    1. Beltran H, Eng K, Mosquera JM et al. . Whole-Exome Sequencing of Metastatic Cancer and Biomarkers of Treatment Response. JAMA Onco.l 2015;1(4):466–74. - PMC - PubMed
    1. McLaren W, Gil L, Hunt SE et al. . The Ensembl Variant Effect Predictor. Genome Biol. 2016;17(1):122. - PMC - PubMed
    1. Cingolani P, Platts A, Wang le L et al. . A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6(2):80–92. - PMC - PubMed