Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Sep 22;16(9):e0250915.
doi: 10.1371/journal.pone.0250915. eCollection 2021.

Exogene: A performant workflow for detecting viral integrations from paired-end next-generation sequencing data

Affiliations

Exogene: A performant workflow for detecting viral integrations from paired-end next-generation sequencing data

Zachary Stephens et al. PLoS One. .

Abstract

The integration of viruses into the human genome is known to be associated with tumorigenesis in many cancers, but the accurate detection of integration breakpoints from short read sequencing data is made difficult by human-viral homologies, viral genome heterogeneity, coverage limitations, and other factors. To address this, we present Exogene, a sensitive and efficient workflow for detecting viral integrations from paired-end next generation sequencing data. Exogene's read filtering and breakpoint detection strategies yield integration coordinates that are highly concordant with long read validation. We demonstrate this concordance across 6 TCGA Hepatocellular carcinoma (HCC) tumor samples, identifying integrations of hepatitis B virus that are also supported by long reads. Additionally, we applied Exogene to targeted capture data from 426 previously studied HCC samples, achieving 98.9% concordance with existing methods and identifying 238 high-confidence integrations that were not previously reported. Exogene is applicable to multiple types of paired-end sequence data, including genome, exome, RNA-Seq and targeted capture.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Overview of Exogene workflow.
Fig 2
Fig 2. Comparison of evidence for HBV integration at chr7:72,027,703 from Exogene and HGT-ID.
Shaded regions indicate breakpoint ranges as inferred from read fragment lengths and orientations, darker shades indicate greater support.
Fig 3
Fig 3. Concordance rate of Exogene and HIVID calls as a function of minimum coverage.
Fig 4
Fig 4. Concordance rate of Exogene and HIVID calls as a function of minimum allowable percentage of reads aligned with mapping quality 0.

References

    1. White MK, Pagano JS, Khalili K. Viruses and human cancers: a long road of discovery of molecular paradigms. Clinical microbiology reviews. 2014;27(3):463–481. doi: 10.1128/CMR.00124-13 - DOI - PMC - PubMed
    1. Pagano JS, Blaser M, Buendia MA, Damania B, Khalili K, Raab-Traub N, et al..; Elsevier. Infectious agents and cancer: criteria for a causal relation. Seminars in cancer biology. 2004;14(6):453–471. doi: 10.1016/j.semcancer.2004.06.009 - DOI - PubMed
    1. Henle G, Henle W, Clifford P, Diehl V, Kafuko GW, Kirya BG, et al.. Antibodies to Epstein-Barr virus in Burkitt’s lymphoma and control groups. Journal of the National Cancer Institute. 1969;43(5):1147–1157. - PubMed
    1. Nonoyama M, Kawai Y, Pagano J. Detection of Epstein-Barr virus DNA in human tumors. Bibliotheca Haematologica. 1975;40:577–583. - PubMed
    1. Mincheva A, Gissmann L, Zur Hausen H. Chromosomal integration sites of human papillomavirus DNA in three cervical cancer cell lines mapped by in situ hybridization. Medical microbiology and immunology. 1987;176(5):245–256. doi: 10.1007/BF00190531 - DOI - PubMed

Publication types