Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2016 Sep 29;3(1):173-195.
doi: 10.1146/annurev-virology-110615-035747. Epub 2016 Aug 3.

Genomic Analysis of Viral Outbreaks

Affiliations
Review

Genomic Analysis of Viral Outbreaks

Shirlee Wohl et al. Annu Rev Virol. .

Abstract

Genomic analysis is a powerful tool for understanding viral disease outbreaks. Sequencing of viral samples is now easier and cheaper than ever before and can supplement epidemiological methods by providing nucleotide-level resolution of outbreak-causing pathogens. In this review, we describe methods used to answer crucial questions about outbreaks, such as how they began and how a disease is transmitted. More specifically, we explain current techniques for viral sequencing, phylogenetic analysis, transmission reconstruction, and evolutionary investigation of viral pathogens. By detailing the ways in which genomic data can help us understand viral disease outbreaks, we aim to provide a resource that will facilitate the response to future outbreaks.

Keywords: evolution; infectious disease; phylogeny; sequencing; transmission.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Assembly and alignment pipeline for viral reads in heterogeneous samples. High-throughput sequencing reads are ❶ demultiplexed and filtered for high-quality reads, and ❷ depleted of host reads (130) and mapped to a database of possible viruses. ❸ Reads from each sample are de novo assembled, and ❹ all reads from each sample are mapped onto their own assembly. ❺ The consensus sequence is determined for each sample and then ❻ aligned to all other samples using multiple sequence alignment. See Supplemental Table 1 for available software for each step.
FIGURE 2
FIGURE 2
Rooting phylogenetic trees. Ebola virus (EBOV) sequences illustrate the importance of correctly rooting trees. Each point represents one sequence from the outbreak indicated by its color (scale bar = nucleotide substitutions per site). (a) Maximum likelihood tree rooted on the Zaire 1976 branch (shown to be the more likely root in (25, 53)). (b) The same tree rooted on the Guinea 2014 branch. Interpretation of the ancestral relationships of a single set of samples changes dramatically with root selection.
FIGURE 3
FIGURE 3
Tree topology illuminates the nature of an outbreak. (a) EBOV tree. Sequences from previous outbreaks are colored in distinct shades of green (see Figure 2). Selected sequences from the Sierra Leone outbreak highlight the low diversity within the outbreak, and the development of new clades (SL1–4, defined in (18, 25)) from a single recent ancestor. This topology suggests that each outbreak began with a single zoonotic transmission but was subsequently sustained by human-to-human transmission. (b) Lassa virus (LASV) tree containing S segment sequences from both human (circle nodes) and M. natalensis (rodent nodes) hosts. Samples are from Sierra Leone (109), where LASV is endemic. Sequences do not cluster by time or by host, indicating frequent animal-to-human transmission and a lack of discrete outbreaks.
FIGURE 4
FIGURE 4
Determining virus transmission. Transmission during the 2003 SARS outbreak in Singapore, for which both sequence and contact tracing data are available. (a) Maximum likelihood tree rooted on TOR2, the earliest reported case. The four major branches (blue boxes) roughly correspond to geographic origin (top to bottom: China, Taiwan, Singapore, Singapore). (b) Transmission tree reconstruction from sequences and sample collection dates (created using outbreaker (85); generation time: gamma distribution with mean = 8.4 and sd = 3.8, based on values from (1)). Red and yellow circles correspond to the two Singapore clades identified in (a); lined red circles are samples with only sequence data (no contact tracing). Arrows are labeled with (number of SNPs between samples) / (posterior probability of transmission). (c) Transmission tree created during the SARS outbreak by contact tracing, as reported by (131). Gray circles are unreported cases assumed to be part of the transmission chain. Comparison of panels (a-c) shows that the three methods generate similar relationships between samples.
FIGURE 5
FIGURE 5
Rate-based tests for selection in viruses. Various tests identify signals of selection at different genomic scales. (a) dN-dS scores for every codon in the hemagglutinin (HA) gene of H5N1 influenza A (calculated using (104)). The highest dN-dS scores (red) indicate codons most likely under positive selection. (b) log(dN/dS) for each EBOV gene and for the mucin-like region of the glycoprotein (GP). (values from (18)). (c) Synonymous constraint for every codon position in the West Nile virus genome (sliding window = 20 nucleotides) (110). Red bars mark regions of excess constraint. Asterisks mark two known RNA structural elements (orange = structural proteins, yellow = non-structural proteins), a hairpin in the capsid gene and a pseudoknot element within non-structural protein 2A.

References

    1. Lipsitch M, Cohen T, Cooper B, Robins JM, Ma S, et al. Transmission dynamics and control of severe acute respiratory syndrome. Science. 2003;300(5627):1966–70. - PMC - PubMed
    1. Kiss IZ, Green DM, Kao RR. Disease contact tracing in random and clustered networks. Proc Biol Sci. 2005;272(1570):1407–14. - PMC - PubMed
    1. Porco TC, Holbrook KA, Fernyak SE, Portnoy DL, Reiter R, Aragón TJ. Logistics of community smallpox control through contact tracing and ring vaccination: a stochastic network model. BMC Public Health. 2004;4:34. - PMC - PubMed
    1. Kretzschmar M, van den Hof S, Jacco Wallinga, van Wijngaarden J. Ring vaccination and smallpox control. Emerg Infect Dis. 2004;10(5):832–41. - PMC - PubMed
    1. Klinkenberg D, Fraser C, Heesterbeek H. The effectiveness of contact tracing in emerging epidemics. PLoS ONE. 2006;1:e12. - PMC - PubMed