Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 9;13(1):756.
doi: 10.1038/s41467-022-28348-y.

A platform for oncogenomic reporting and interpretation

Affiliations

A platform for oncogenomic reporting and interpretation

Caralyn Reisle et al. Nat Commun. .

Abstract

Manual interpretation of variants remains rate limiting in precision oncology. The increasing scale and complexity of molecular data generated from comprehensive sequencing of cancer samples requires advanced interpretative platforms as precision oncology expands beyond individual patients to entire populations. To address this unmet need, we introduce a Platform for Oncogenomic Reporting and Interpretation (PORI), comprising an analytic framework that facilitates the interpretation and reporting of somatic variants in cancer. PORI integrates reporting and graph knowledge base tools combined with support for manual curation at the reporting stage. PORI represents an open-source platform alternative to commercial reporting solutions suitable for comprehensive genomic data sets in precision oncology. We demonstrate the utility of PORI by matching 9,961 pan-cancer genome atlas tumours to the graph knowledge base, calculating therapeutically informative alterations, and making available reports describing select individual samples.

PubMed Disclaimer

Conflict of interest statement

Dr. Stephen Yip has received funding as a member of scientific advisory boards of Amgen, AstraZeneca, Bayer, Norvatis, and Roche. All other authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Platform of Oncogenomic Reporting and Interpretation (PORI) overview.
PORI Design showing both the placement of PORI within a precision oncology workflow (a) and the process of generating a report (b). PORI is used for the interpretation and reporting of genomic findings from tumour sequencing. Sequencing Data is taken as input to a number of bioinformatic pipelines and analyses defined by the user. The results of these are loaded by the IPR report python adapter (ipr-python) and annotated with information from GraphKB. After annotation, the results are collated and prioritized based on matches for output into a report using the IPR interactive web platform. This is optionally manually reviewed by the case analyst who may add content to GraphKB as part of their literature review for the case and re-generate the report to include the newly added content. This report is shared with the molecular tumour board (MTB) to inform clinical decisions.
Fig. 2
Fig. 2. Integrated Pipeline Reports (IPR) web interface.
(a) The report front page of an example report displaying patient metadata and a summary of tumour characteristics and findings. (b) The annotations collected from GraphKB are listed in the knowledge base matches section of the generated report with links from each match back to its corresponding statement in GraphKB.
Fig. 3
Fig. 3. GraphKB Statement Schema.
Statements are composed of four main elements: conditions, subject, relevance, and evidence. A statement may be linked to any number of conditions but only one subject and relevance. The conclusion of a statement is considered to be composed only of the relevance and subject.
Fig. 4
Fig. 4. Clinically informative conclusion agreement across knowledge bases.
(a) The individual contribution of each source is shown as the number of unique conclusions which are given for both raw and normalized counts. The raw values represent the number of conclusions prior to normalization. (b) The amount of content which is shared between sources is shown as a fraction of the total number of unique conclusions. Source data are provided as a Source Data file.
Fig. 5
Fig. 5. Graph view of content in the GraphKB web application.
A subset of links between disease terms related to colorectal adenocarcinoma are shown from NCI thesaurus (NCIt) and OncoTree. For brevity, only a small number of links to clinical trials (Clinicaltrials.gov) are shown.
Fig. 6
Fig. 6. Coverage of clinical trial terminology in popular ontologies.
Proportion of matched drug (a) and disease (b) terms from clinical trials matched by ontology terms across multiple resources. A distinction was made between the primary/preferred terms for a given resource and the set of all terms (indicated with a+), which included synonyms, aliases, and commercial product names. The proportion of total clinical trials terms where an exact match was found in a given ontology is termed coverage. Coverage was calculated for trial terms at 3 frequencies (1+, 10+, or 100+) where the frequency is calculated as the number of clinical trials a given term was used in. Source data are provided as a Source Data file.
Fig. 7
Fig. 7. Division of the therapeutic matches to the TCGA samples (n = 9,961).
The total set of matches is further subdivided by a number of filters. Samples were considered disease matched when the diagnosis of the patient matched the disease listed by the annotation (Diagnosis Match) Samples were considered position matched (Position-Specific) when matches to non-specific gene-level small mutations were excluded. Other filters included: matches with AMP Tier I compatible evidence (AMP Tier I); matches excluding those obtained by second-pass or inferred matching (Direct Match); and finally only non-synonymous, at the protein level, mutations (Non-Synon). The union of all matches is given by the shaded portion and vertical dashed line. Source data are provided as a Source Data file.
Fig. 8
Fig. 8. Proportion of samples (n = 9,961) with therapeutic matches derived from each combination of variant types.
Upset plot of the number of samples with therapeutic matches from annotation of a given variant type. Sample variants are divided into four types: copy number variants (cnv); single nucleotide variants and indels (mut); gene fusions (fus); and gene expression (exp) variants. The left-hand bar plots are the total number of samples which have 1 or more therapeutic conclusions matched to the listed variant type. The union of all matches is given by the shaded portion and vertical dashed line. The upper bar plots show the number of samples in each of the intersection groups. These groups are mutually exclusive. Source data are provided as a Source Data file.

References

    1. Good BM, Ainscough BJ, McMichael JF, Su AI, Griffith OL. Organizing knowledge to enable personalization of medicine in cancer. Genome Biol. 2014;15:438. - PMC - PubMed
    1. Mardis ER. The 1,000 genome, the 100,000 analysis? Genome Med. 2010;2:84. - PMC - PubMed
    1. Chakravarty, D. et al. OncoKB: A Precision Oncology Knowledge Base. JCO Precis Oncol2017, PO.17.00011 (2017). - PMC - PubMed
    1. Griffith M, et al. CIViC is a community knowledgebase for expert crowdsourcing the clinical interpretation of variants in cancer. Nat. Genet. 2017;49:170–174. - PMC - PubMed
    1. Tamborero D, et al. Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations. Genome Med. 2018;10:25. - PMC - PubMed

Publication types

Substances