Phylogenetic understanding of clonal populations in an era of whole genome sequencing
- PMID: 19477301
- DOI: 10.1016/j.meegid.2009.05.014
Phylogenetic understanding of clonal populations in an era of whole genome sequencing
Abstract
Phylogenetic hypotheses using whole genome sequences have the potential for unprecedented accuracy, yet a failure to understand issues associated with discovery bias, character sampling, and strain sampling can lead to highly erroneous conclusions. For microbial pathogens, phylogenies derived from whole genome sequences are becoming more common, as large numbers of characters distributed across entire genomes can yield extremely accurate phylogenies, particularly for strictly clonal populations. The availability of whole genomes is increasing as new sequencing technologies reduce the cost and time required for genome sequencing. Until entire sample collections can be fully sequenced, harnessing the phylogenetic power from whole genome sequences in more than a small subset of fully sequenced strains requires the integration of whole genome and partial genome genotyping data. Such integration involves discovering evolutionarily stable polymorphic characters by whole genome comparisons, then determining allelic states across a wide panel of isolates using high-throughput genotyping technologies. Here, we demonstrate how such an approach using single nucleotide polymorphisms (SNPs) yields highly accurate, but biased phylogenetic reconstructions and how the accuracy of the resulting tree is compromised by incomplete taxon and character sampling. Despite recent phylogenetic work detailing the strengths and biases of integrating whole genome and partial genome genotype data, these issues are relatively new and remain poorly understood by many researchers. Here, we revisit these biases and provide strategies for maximizing phylogenetic accuracy. Although we write this review with bacterial pathogens in mind, these concepts apply to any clonally reproducing population or indeed to any evolutionarily stable marker that is inherited in a strictly clonal manner. Understanding the ways in which current and emerging technologies can be used to maximize phylogenetic knowledge is advantageous only with a complete understanding of the strengths and weaknesses of these methods.
Similar articles
-
Genome-based phylogenetic analysis of Streptomyces and its relatives.Mol Phylogenet Evol. 2010 Mar;54(3):763-72. doi: 10.1016/j.ympev.2009.11.019. Epub 2009 Dec 3. Mol Phylogenet Evol. 2010. PMID: 19948233
-
Discrimination and phylogenomic classification of Bacillus anthracis-cereus-thuringiensis strains based on LC-MS/MS analysis of whole cell protein digests.Anal Chem. 2010 Jan 1;82(1):145-55. doi: 10.1021/ac9015648. Anal Chem. 2010. PMID: 19938824
-
Strain-specific single-nucleotide polymorphism assays for the Bacillus anthracis Ames strain.J Clin Microbiol. 2007 Jan;45(1):47-53. doi: 10.1128/JCM.01233-06. Epub 2006 Nov 8. J Clin Microbiol. 2007. PMID: 17093023 Free PMC article.
-
Whole genome sequencing.Methods Mol Biol. 2010;628:215-26. doi: 10.1007/978-1-60327-367-1_12. Methods Mol Biol. 2010. PMID: 20238084 Review.
-
Microbial ecology in the age of genomics and metagenomics: concepts, tools, and recent advances.Mol Ecol. 2006 Jun;15(7):1713-31. doi: 10.1111/j.1365-294X.2006.02882.x. Mol Ecol. 2006. PMID: 16689892 Review.
Cited by
-
Within-host evolution of Burkholderia pseudomallei in four cases of acute melioidosis.PLoS Pathog. 2010 Jan 15;6(1):e1000725. doi: 10.1371/journal.ppat.1000725. PLoS Pathog. 2010. PMID: 20090837 Free PMC article.
-
Origins and global context of Brucella abortus in Italy.BMC Microbiol. 2017 Feb 2;17(1):28. doi: 10.1186/s12866-017-0939-0. BMC Microbiol. 2017. PMID: 28152976 Free PMC article.
-
Massive dispersal of Coxiella burnetii among cattle across the United States.Microb Genom. 2016 Aug 25;2(8):e000068. doi: 10.1099/mgen.0.000068. eCollection 2016 Aug. Microb Genom. 2016. PMID: 28348863 Free PMC article.
-
Optimizing microbiome reference databases with PacBio full-length 16S rRNA sequencing for enhanced taxonomic classification and biomarker discovery.Front Microbiol. 2024 Nov 25;15:1485073. doi: 10.3389/fmicb.2024.1485073. eCollection 2024. Front Microbiol. 2024. PMID: 39654676 Free PMC article.
-
SNP/RD typing of Mycobacterium tuberculosis Beijing strains reveals local and worldwide disseminated clonal complexes.PLoS One. 2011;6(12):e28365. doi: 10.1371/journal.pone.0028365. Epub 2011 Dec 5. PLoS One. 2011. PMID: 22162765 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources