Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2015 Oct 1:6:1069.
doi: 10.3389/fmicb.2015.01069. eCollection 2015.

Recovering full-length viral genomes from metagenomes

Affiliations
Review

Recovering full-length viral genomes from metagenomes

Saskia L Smits et al. Front Microbiol. .

Abstract

Infectious disease metagenomics is driven by the question: "what is causing the disease?" in contrast to classical metagenome studies which are guided by "what is out there?" In case of a novel virus, a first step to eventually establishing etiology can be to recover a full-length viral genome from a metagenomic sample. However, retrieval of a full-length genome of a divergent virus is technically challenging and can be time-consuming and costly. Here we discuss different assembly and fragment linkage strategies such as iterative assembly, motif searches, k-mer frequency profiling, coverage profile binning, and other strategies used to recover genomes of potential viral pathogens in a timely and cost-effective manner.

Keywords: assembly; coverage analysis; k-mer analysis; metagenomics; motif discovery; virus discovery; viruses; zoonotic pathogens.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Metagenome composition before and after genome finishing. Red bars: Percentage of reads of the metagenome assigned to target genome after similarity searches and before genome finishing. Blue bars: Percentage of reads of the metagenome assigned to target genome after genome finishing strategies were applied to the target genome. DRV, dolphin rhabdovirus; RFFRV, red fox fecal rhabdovirus; PNV, python nidovirus.
Figure 2
Figure 2
Genome completeness before and after genome finishing. (A) Genome of dolphin rhabdovirus (DRV). (B) Genome of red fox fecal rhabdovirus (RFFRV). (C) Genome of python nidovirus (PNV). Indicated is contig size after initial genome assembly (red) and after genome finishing (blue). Striped line at the 5′ end of PNV indicates putative unresolved 5′ end.
Figure 3
Figure 3
Steps in recovery of full-length viral genomes from metagenomes. For description see article text.

References

    1. Aguirre de Cárcer D., Angly F. E., Alcamí A. (2014). Evaluation of viral genome assembly and diversity estimation in deep metagenomes. BMC Genomics 15:989. 10.1186/1471-2164-15-989 - DOI - PMC - PubMed
    1. Albertsen M., Hugenholtz P., Skarshewski A., Nielsen K. L., Tyson G. W., Nielsen P. H. (2013). Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes. Nat. Biotechnol. 31, 533–538. 10.1038/nbt.2579 - DOI - PubMed
    1. Alneberg J., Bjarnason B. S., de Bruijn I., Schirmer M., Quick J., Ijaz U. Z., et al. . (2014). Binning metagenomic contigs by coverage and composition. Nat. Methods 11, 1144–1146. 10.1038/nmeth.3103 - DOI - PubMed
    1. Altschul S. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402. 10.1093/nar/25.17.3389 - DOI - PMC - PubMed
    1. Bailey T. L., Johnson J., Grant C. E., Noble W. S. (2015). The MEME Suite. Nucleic Acids Res. 43, W39–W49. 10.1093/nar/gkv416 - DOI - PMC - PubMed

LinkOut - more resources