Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Oct:3:1-12.
doi: 10.1200/CCI.19.00077.

Open-Sourced CIViC Annotation Pipeline to Identify and Annotate Clinically Relevant Variants Using Single-Molecule Molecular Inversion Probes

Affiliations

Open-Sourced CIViC Annotation Pipeline to Identify and Annotate Clinically Relevant Variants Using Single-Molecule Molecular Inversion Probes

Erica K Barnell et al. JCO Clin Cancer Inform. 2019 Oct.

Abstract

Purpose: Clinical targeted sequencing panels are important for identifying actionable variants for patients with cancer; however, existing approaches do not provide transparent and rationally designed clinical panels to accommodate the rapidly growing knowledge within oncology.

Materials and methods: We used the Clinical Interpretations of Variants in Cancer (CIViC) database to develop an Open-Sourced CIViC Annotation Pipeline (OpenCAP). OpenCAP provides methods to identify variants within the CIViC database, build probes for variant capture, use probes on prospective samples, and link somatic variants to CIViC clinical relevance statements. OpenCAP was tested using a single-molecule molecular inversion probe (smMIP) capture design on 27 cancer samples from 5 tumor types. In total, 2,027 smMIPs were designed to target 111 eligible CIViC variants (61.5 kb of genomic space).

Results: When compared with orthogonal sequencing, CIViC smMIP sequencing demonstrated a 95% sensitivity for variant detection (n = 61 of 64 variants). Variant allele frequencies for variants identified on both sequencing platforms were highly concordant (Pearson's r = 0.885; n = 61 variants). Moreover, for individuals with paired tumor and normal samples (n = 12), 182 clinically relevant variants missed by orthogonal sequencing were discovered by CIViC smMIP sequencing.

Conclusion: The OpenCAP design paradigm demonstrates the utility of an open-source and open-access database built on attendant community contributions with peer-reviewed interpretations. Use of a public repository for variant identification, probe development, and variant interpretation provides a transparent approach to build dynamic next-generation sequencing-based oncology panels.

PubMed Disclaimer

Conflict of interest statement

Erica K. Barnell

Employment: Geneoscopy

Stock and Other Ownership Interests: Geneoscopy

Patents, Royalties, Other Intellectual Property: Inventor on intellectual property in start-up company (Geneoscopy)

Travel, Accommodations, Expenses: Geneoscopy

Adam Waalkes

Research Funding: Gilead Sciences

Patents, Royalties, Other Intellectual Property: Provisional application filed: Genome-Scale Molecular Diagnostics for Microsatellite instability Using Targeted Molecular Counting Methods [IP: 48143.02US1]; provisional application filed: Ultrasensitive and Universal Detection of Genomic Chimerism by Single-Molecule Molecular Inversion Probe Capture and Methods of Use [IP: 48101.01US1]

Kelsi Penewit

Research Funding: Gilead Sciences

Katie M. Campbell

Employment: Geneoscopy

Stock and Other Ownership Interests: Geneoscopy

Consulting or Advisory Role: Geneoscopy

Kilannin Krysiak

Consulting or Advisory Role: Gerson Lehrman Group

Damian Rieke

Honoraria: Bristol-Myers Squibb

Consulting or Advisory Role: Alacris Theranostics

Zachary L. Skidmore

Stock and Other Ownership Interests: AIM ImmunoTech, Catalyst Pharmaceuticals

Todd A. Fehniger

Stock and Other Ownership Interests: Kiadis Pharma, Indapta, Orca Biosystems

Honoraria: CytoSen

Consulting or Advisory Role: NKarta, Nektar

Research Funding: Altor BioScience (Inst), Affimed (Inst), Compass Therapeutics (Inst)

Travel, Accommodations, Expenses: Miltenyi Biotec

Ravindra Uppaluri

Consulting or Advisory Role: Merck

Research Funding: Merck

Ramaswamy Govindan

Honoraria: Genentech, AbbVie

Consulting or Advisory Role: GlaxoSmithKline, Genentech, AbbVie, Celgene, AstraZeneca/MedImmune, Inivata, Merck Serono, Pfizer, Bristol-Myers Squibb, EMD Serono, Eli Lilly, Ignyta, Nektar, Phillips Gilmore Oncology, Jounce Therapeutics

Stephen J. Salipante

Research Funding: Bristol-Myers Squibb

No other potential conflicts of interest were reported.

Figures

FIG 1.
FIG 1.
Methods for Clinical Interpretations of Variants in Cancer (CIViC) single-molecule molecular inversion probe (smMIP) development and validation using the Open-Sourced CIViC Annotation Pipeline (OpenCAP). The first series describes CIViC smMIP development. Variants were selected using sequence ontology identification numbers (IDs) and the CIViC Variant Evidence Score. Subsequently, eligible variants were categorized based on length, and smMIP reagents were designed to target regions of interest. The second series describes sample selection and sequencing methods. In total, there were 22 tumor samples derived from 5 tumor subtypes. Of these 27 samples, 15 had tumor and paired normal samples, and 7 were tumor-only samples. The third series shows the analysis used to validate the CIViC smMIP design. Variants were called using the pipeline described in Materials and Methods, and accuracy was attained by comparing variants observed on original sequencing to variants observed using the CIViC smMIP capture panel. Variant allele frequencies (VAFs) across both platforms were also compared. AML, acute myeloid leukemia; CRC, colorectal cancer; HL, Hodgkin lymphoma; HNSCC, head and neck squamous cell carcinoma; SCLC, small-cell lung cancer.
FIG 2.
FIG 2.
Regions targeted by the Clinical Interpretations of Variants in Cancer (CIViC) single-molecule molecular inversion probes (smMIPs) are, by design, supported by extensive clinical relevance according to the CIViC database. Variants that were eligible for CIViC smMIP development were divided into various coverage methods based on sequence ontology identification number and length. The bar graph shows the total number of evidence items used for each of the groups parsed by the evidence type.
FIG 3.
FIG 3.
Waterfall plot showing extensive overlap between variants observed using original exome or whole-genome sequencing with variants observed using Clinical Interpretations of Variants in Cancer (CIViC) single-molecule molecular inversion probe (smMIP) sequencing. Each column represents a sample that had original exome or whole-genome sequencing with subsequent orthogonal validation using the CIViC smMIP sequencing. Rows represent mutated genes across all samples. Numbers within each box represent the variant allele frequency (VAF) observed on original exome or whole-genome sequencing. Red boxes indicate that a variant was observed by CIViC smMIPs and validated with original exome or whole-genome sequencing. Blue boxes indicate that the variant was observed on original exome or whole-genome sequencing but not identified via the CIViC smMIP capture panel. The left panel indicates the number of samples containing a mutation in the indicated gene. AML, acute myeloid leukemia; CRC, colorectal cancer; HL, Hodgkin lymphoma; HNSCC, head and neck squamous cell carcinoma; SCLC, small-cell lung cancer.
FIG 4.
FIG 4.
Variant allele frequencies (VAFs) observed using original exome or whole-genome sequencing compared with VAFs observed using Clinical Interpretations of Variants in Cancer (CIViC) single-molecule molecular inversion probe (smMIP) sequencing. (A) Correlation of VAF with original sequencing parsed by sequencing status (ie, passed sequencing if total sequencing counts were > 1 standard deviation from the mean and tag complexity was > 600,000 unique captured smMIPs). (B) Correlation of VAF with validation status (ie, true if the variant identified using exome or genome sequencing was identified on CIViC smMIP sequencing). (C) Correlation of VAF parsed by coverage at variant loci. (D) Correlation of VAF parsed by DNA mass input for library construction. (E) Correlation of VAF parsed by presence or absence of matched normal tissue. (F) Correlation of VAF parsed by tumor type. AML, acute myeloid leukemia; CRC, colorectal cancer; HL, Hodgkin lymphoma; OSCC, oral squamous cell carcinoma; SCLC, small-cell lung cancer.
FIG 5.
FIG 5.
Analysis of variants rescued by Clinical Interpretations of Variants in Cancer (CIViC) single-molecule molecular inversion probe (smMIP) sequencing for samples with both tumor and matched normal. There were 217 variants called as somatic by CIViC smMIP sequencing that were not identified by the original sequencing. All variants were manually reviewed using both CIViC smMIP sequencing data and original sequencing data. (A) During manual review, 55 variants were identified as germline. A histogram shows that the distribution of the smMIP variant allele frequencies (VAFs) for these germline variants was observed at 50% and 100% VAF, indicating heterozygosity and homozygosity, respectively. (B) An additional 36 variants were identified as sequencing artifacts. Most artifacts were either mononucleotide repeats (MN), dinucleotide repeats (DN), or tandem repeats (TR). Other artifacts include multiple mismatches (MM) or multiple variants (MV). (C) During manual review, 162 variants did not show any support in the original sequencing data. Most unsupported variants did not have sufficient coverage to be detected based on a binomial probability of ≤ 3 variant-supporting reads (see Materials and Methods). (D) The remaining 11 variants had variant support in original sequencing but were not called as somatic in final original annotation. The scatter plot shows correlation between original VAF and CIViC smMIP VAF for these variants.

References

    1. Nunes RA, Harris LN. The HER2 extracellular domain as a prognostic and predictive factor in breast cancer. Clin Breast Cancer. 2002;3:125–135. - PubMed
    1. Griffith M, Griffith OL, Smith SM, et al. Genome Modeling System: A knowledge management platform for genomics. PLOS Comput Biol. 2015;11:e1004274. - PMC - PubMed
    1. Mardis ER. The $1,000 genome, the $100,000 analysis? Genome Med. 2010;2:84. - PMC - PubMed
    1. Collins FS, Varmus H. A new initiative on precision medicine. N Engl J Med. 2015;372:793–795. - PMC - PubMed
    1. Gray SW, Hicks-Courant K, Cronin A, et al. Physicians’ attitudes about multiplex tumor genomic testing. J Clin Oncol. 2014;32:1317–1323. - PMC - PubMed

MeSH terms