Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2012 Nov;7(11):1283-1296.
doi: 10.2217/fmb.12.108.

Developing insights into the mechanisms of evolution of bacterial pathogens from whole-genome sequences

Affiliations
Review

Developing insights into the mechanisms of evolution of bacterial pathogens from whole-genome sequences

Josephine Bryant et al. Future Microbiol. 2012 Nov.

Abstract

Evolution of bacterial pathogen populations has been detected in a variety of ways including phenotypic tests, such as metabolic activity, reaction to antisera and drug resistance and genotypic tests that measure variation in chromosome structure, repetitive loci and individual gene sequences. While informative, these methods only capture a small subset of the total variation and, therefore, have limited resolution. Advances in sequencing technologies have made it feasible to capture whole-genome sequence variation for each sample under study, providing the potential to detect all changes at all positions in the genome from single nucleotide changes to large-scale insertions and deletions. In this review, we focus on recent work that has applied this powerful new approach and summarize some of the advances that this has brought in our understanding of the details of how bacterial pathogens evolve.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Mapping and assembly with sequencing reads
Both mapping to a high-quality reference and de novo assembly of sequencing reads can provide us with evolutionary information. However, the resolution of this information and their possible applications vary. Mapping enables the rapid identification of high-quality SNPs, which can be used to build phylogenies for both evolutionary and epidemiological inference. However, de novo assembly is required to study larger variants, such as insertion/deletions, mobile elements and rearrangements. SNP: Single-nucleotide polymorphism.
Figure 2
Figure 2. Detecting single nucleotide variation and large-scale insertions and deletions from read mapping
An example looking at Chlamydia trachomatis chromosome and plasmid sequences. The pale blue bar highlights a deletion site observed from read data which has significant implications for the effective typing of this pathogen (see [99] for further details). (A) Depth of coverage for reads from a test genome mapped against a reference genome sequence represented by the orange bar (chromosome) and brown bar (plasmid). The depth of coverage steps up in the plasmid region due to its higher copy number per cell relative to the chromosome. Towards the right of the plasmid region the plot drops to zero, indicating a deletion in the test sample relative to the reference. (B) Stack view of the mapped reads showing those that are exactly equivalent in sequence and length to each other (green), those that are unique in sequence and length (blue) and positions where the reads disagree with the reference (red). Sporadic red marks likely represent errors, while true variants are indicated by vertical red lines where all reads mapped to that position indicate the variant. As with (A), the deletion region shows no mapped reads. (C) Insertion/deletion size can be estimated by plotting read pair inferred size on a log scale. The (log) mapped insert size is plotted on the y-axis to show that, where there is an insertion/deletion, the mapped insert size is increased relative to the mean insert size, which is the true fragment length. In essence, by mapping to a reference that does not have the deletion, we artificially introduce sequence length between the reads, thus increasing the mapped insert size. This reveals the presence of the deletion by the increase in inferred size and drop in coverage. The inferred insert size calculated from this subset of reads is far bigger than the normal size range of read pairs in this region. In addition, the absence of lines linking paired reads within the normal size range across this region is indicative of deletion. Image provided courtesy of S Harris and T Carver (Wellcome Trust Sanger Institute, UK).

References

    1. Fleischmann RD, Adams MD, White O, et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995;269(5223):496–512. - PubMed
    1. Fraser CM, Gocayne JD, White O, et al. The minimal gene complement of Mycoplasma genitalium. Science. 1995;270(5235):397–403. - PubMed
    1. Wassenaar TM, Gaastra W. Bacterial virulence: can we draw the line? FEMS Microbiol. Lett. 2001;201(1):1–7. - PubMed
    1. Nowrousian M, Teichert I, Masloff S, Kück U. Whole-genome sequencing of Sordaria macrospora mutants identifies developmental genes. G3. 2012;2(2):261–270. - PMC - PubMed
    1. Renzoni A, Andrey DO, Jousselin A, et al. Whole genome sequencing and complete genetic analysis reveals novel pathways to glycopeptide resistance in Staphylococcus aureus. PLoS One. 2011;6(6):e21577. - PMC - PubMed

Substances