Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 13:5:11534.
doi: 10.1038/srep11534.

Vy-PER: eliminating false positive detection of virus integration events in next generation sequencing data

Collaborators, Affiliations

Vy-PER: eliminating false positive detection of virus integration events in next generation sequencing data

Michael Forster et al. Sci Rep. .

Abstract

Several pathogenic viruses such as hepatitis B and human immunodeficiency viruses may integrate into the host genome. These virus/host integrations are detectable using paired-end next generation sequencing. However, the low number of expected true virus integrations may be difficult to distinguish from the noise of many false positive candidates. Here, we propose a novel filtering approach that increases specificity without compromising sensitivity for virus/host chimera detection. Our detection pipeline termed Vy-PER (Virus integration detection bY Paired End Reads) outperforms existing similar tools in speed and accuracy. We analysed whole genome data from childhood acute lymphoblastic leukemia (ALL), which is characterised by genomic rearrangements and usually associated with radiation exposure. This analysis was motivated by the recently reported virus integrations at genomic rearrangement sites and association with chromosomal instability in liver cancer. However, as expected, our analysis of 20 tumour and matched germline genomes from ALL patients finds no significant evidence for integrations by known viruses. Nevertheless, our method eliminates 12,800 false positives per genome (80× coverage) and only our method detects singleton human-phiX174-chimeras caused by optical errors of the Illumina HiSeq platform. This high accuracy is useful for detecting low virus integration levels as well as non-integrated viruses.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Detection of virus integrations into the human genome using paired-end sequencing.
After the computationally expensive classical alignment of all paired-end reads to the human genome, the pipeline splits into classical variant calling and virus integration detection.
Figure 2
Figure 2. Vy-PER ideogram summary plot.
Negative example without true virus integrations: Patient genome sequenced with 40× coverage and analysed with highest sensitivity, eliminating 6400 false positives and leaving three unsupported singletons (pseudomonas phage, caviid herpes, cercopithecine herpes) which we manually eliminated as alignment artefacts. PhiX and M13 originated from the 1% Illumina sequencing library control spike-in. The plot shows integration sites down to single chimeras.
Figure 3
Figure 3. Vy-PER ideogram summary plot.
Positive example with true virus integrations: Publicly available whole transcriptome liver cancer data analysed with default sensitivity, showing HBV candidate loci on chromosomes 4, 11 and 16. The plot only shows integrations supported by 10 or more chimeras.
Figure 4
Figure 4. Vy-PER ideogram summary plot.
HBV integration loci into the liver cancer genome detected at low stringency (threshold: 1 chimera), also showing detected phiX singletons.

Similar articles

Cited by

References

    1. Bertino P. et al.. Fowlpox-based survivin vaccination for malignant mesothelioma therapy. Int. J. Cancer 133, 612–623 (2013). - PMC - PubMed
    1. Baum C., Schambach A., Bohne J. & Galla M. Retrovirus vectors: toward the plentivirus? Mol. Ther. 13, 1050–1063 (2006). - PubMed
    1. Soliman H. Developing an effective breast cancer vaccine. Cancer Control 17, 183–190 (2010). - PubMed
    1. Bertino P. et al.. Vaccination with a piggyBac plasmid with transgene integration potential leads to sustained antigen expression and CD8(+) T cell responses. Vaccine 32, 1670–1677 (2014). - PMC - PubMed
    1. Watanabe T. Current status of HTLV-1 infection. Int. J. Hematol. 94, 430–434 (2011). - PubMed

Publication types

MeSH terms