Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Nov 1;77(21):e31-e34.
doi: 10.1158/0008-5472.CAN-17-0337.

Variant Review with the Integrative Genomics Viewer

Affiliations

Variant Review with the Integrative Genomics Viewer

James T Robinson et al. Cancer Res. .

Abstract

Manual review of aligned reads for confirmation and interpretation of variant calls is an important step in many variant calling pipelines for next-generation sequencing (NGS) data. Visual inspection can greatly increase the confidence in calls, reduce the risk of false positives, and help characterize complex events. The Integrative Genomics Viewer (IGV) was one of the first tools to provide NGS data visualization, and it currently provides a rich set of tools for inspection, validation, and interpretation of NGS datasets, as well as other types of genomic data. Here, we present a short overview of IGV's variant review features for both single-nucleotide variants and structural variants, with examples from both cancer and germline datasets. IGV is freely available at https://www.igv.org Cancer Res; 77(21); e31-34. ©2017 AACR.

PubMed Disclaimer

Conflict of interest statement

Disclosure of Potential Conflicts of Interest

A.M.W. is an employee and shareholder of Pacific Biosciences, a company commercializing DNA sequencing technologies. The other authors declare no potential conflicts of interest.

Figures

Figure 1
Figure 1
This figure illustrates five examples of how IGV was used to visually highlight anomalies in aligned sequencing data to aid the investigator in their interpretation. A. Example of a false positive in a GC-rich region. Bases that do not match the reference genome are drawn with the letter of the called base (A, C, G, or T). Sorting and coloring alignments by strand reveal a strong strand bias indicative of a sequencing artifact. Pink and blue reads were aligned to the forward and reverse strand, respectively. B. A germline mutation mischaracterized as a somatic NOTCH2 mutation in a clinical cancer sample. The mutation caller used a mapping quality filter with threshold set to 20, removing most alignments that support the variant from the normal sample and resulting in a reference call. The short horizontal black lines in the alignments represent 2-nucleotide deletions, and the colored bars in the grey alignments represent bases that did not match the reference sequence. In particular, the green bars represent bases that were called as A’s. C. A clinically actionable multi-nucleotide deletion-insertion event (L747_A750delinsP) initially misclassified as two independent events. The horizontal black lines in the grey alignments represent deletions in alignments that otherwise match the reference sequence. The blue column of C’s indicates a large number of mismatches against the G at that locus of the reference sequence. D. Comparison of NGS paired-end short reads (top panel) and third-generation long reads (bottom panel) spanning a 500 base pair inversion. The short reads are colored by pair orientation. The two reads of each pair were expected to point inward toward each other when aligned to the reference sequence. The teal color represents pairs where both reads unexpectedly aligned in the “right” direction, and the dark blue represents pairs where both aligned in the “left” direction. Breakpoints can be inferred from the short reads by careful examination of the outer read extents and dip in coverage. The long reads are split into multiple alignments and each alignment color-coded by strand. Pink and blue segments were aligned to the forward and reverse strand, respectively. The breakpoints are directly visible as the boundaries between alternating strands. E. PacBio long reads support a 4.3 kb translocation from chr19 to chr16 in the SK-BR-3 breast cancer cell line. A black line represents a deletion, and a purple box is an insertion. Deletions and insertions are labeled with the event size.

References

    1. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303. - PMC - PubMed
    1. Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29:24–6. - PMC - PubMed
    1. Thorvaldsdóttir H, Robinson JT, Mesirov JP. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2013;14:178–92. - PMC - PubMed
    1. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. 1000 Genome Project Data Processing Subgroup. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–9. - PMC - PubMed
    1. Strom SP. Current practices and guidelines for clinical next-generation sequencing oncology testing. Cancer Biol Med. 2016;13:3–11. - PMC - PubMed

Publication types