Population genomics based on low coverage sequencing: how low should we go?
- PMID: 23174005
- DOI: 10.1111/mec.12105
Population genomics based on low coverage sequencing: how low should we go?
Abstract
Research in molecular ecology is now often based on large numbers of DNA sequence reads. Given a time and financial budget for DNA sequencing, the question arises as to how to allocate the finite number of sequence reads among three dimensions: (i) sequencing individual nucleotide positions repeatedly and achieving high confidence in the true genotype of individuals, (ii) sampling larger numbers of individuals from a population, and (iii) sampling a larger fraction of the genome. Leaving aside the question of what fraction of the genome to sample, we analyze the trade-off between repeatedly sequencing the same nucleotide position (coverage depth) and the number of individuals in the sample. We review simple Bayesian models for allele frequencies and utilize these in the analysis of how to obtain maximal information about population genetic parameters. The models indicate that sampling larger numbers of individuals, at the expense of coverage depth per nucleotide position, provides more information about population parameters. Dividing the sequencing effort maximally among individuals and obtaining approximately one read per locus and individual (1 × coverage) yields the most information about a population. Some analyses require genetic parameters for individuals, in which case Bayesian population models also support inference from lower coverage sequence data than are required for simple likelihood models. Low coverage sequencing is not only sufficient to support inference, but it is optimal to design studies to utilize low coverage because they will yield highly accurate and precise parameter estimates based on more individuals or sites in the genome.
© 2012 John Wiley & Sons Ltd.
Similar articles
-
Estimation of population allele frequencies from next-generation sequencing data: pool-versus individual-based genotyping.Mol Ecol. 2013 Jul;22(14):3766-79. doi: 10.1111/mec.12360. Epub 2013 Jun 4. Mol Ecol. 2013. PMID: 23730833
-
How to infer reliable diploid genotypes from NGS or traditional sequence data: from basic probability to experimental optimization.J Evol Biol. 2012 May;25(5):949-60. doi: 10.1111/j.1420-9101.2012.02488.x. Epub 2012 Mar 16. J Evol Biol. 2012. PMID: 22420488
-
Statistical modeling of coverage in high-throughput data.Methods Mol Biol. 2013;1038:61-79. doi: 10.1007/978-1-62703-514-9_4. Methods Mol Biol. 2013. PMID: 23872969
-
Massive parallel sequencing in animal genetics: wherefroms and wheretos.Anim Genet. 2010 Dec;41(6):561-9. doi: 10.1111/j.1365-2052.2010.02057.x. Anim Genet. 2010. PMID: 20477787 Review.
-
Population genomics: genome-wide sampling of insect populations.Annu Rev Entomol. 2001;46:441-69. doi: 10.1146/annurev.ento.46.1.441. Annu Rev Entomol. 2001. PMID: 11112176 Review.
Cited by
-
Standing genetic variation in laboratory populations of insecticide-susceptible Phlebotomus papatasi and Lutzomyia longipalpis (Diptera: Psychodidae: Phlebotominae) for the evolution of resistance.Evol Appl. 2021 Feb 9;14(5):1248-1262. doi: 10.1111/eva.13194. eCollection 2021 May. Evol Appl. 2021. PMID: 34025765 Free PMC article.
-
The impact of sequencing depth and relatedness of the reference genome in population genomic studies: A case study with two caddisfly species (Trichoptera, Rhyacophilidae, Himalopsyche).Ecol Evol. 2022 Dec 12;12(12):e9583. doi: 10.1002/ece3.9583. eCollection 2022 Dec. Ecol Evol. 2022. PMID: 36523526 Free PMC article.
-
Improving genomic predictions by correction of genotypes from genotyping by sequencing in livestock populations.J Anim Sci Biotechnol. 2019 Jan 24;10:8. doi: 10.1186/s40104-019-0315-z. eCollection 2019. J Anim Sci Biotechnol. 2019. PMID: 30719286 Free PMC article.
-
Ecological genomics of local adaptation in Cornus florida L. by genotyping by sequencing.Ecol Evol. 2016 Dec 20;7(1):441-465. doi: 10.1002/ece3.2623. eCollection 2017 Jan. Ecol Evol. 2016. PMID: 28070306 Free PMC article.
-
Genotype Calling from Population-Genomic Sequencing Data.G3 (Bethesda). 2017 May 5;7(5):1393-1404. doi: 10.1534/g3.117.039008. G3 (Bethesda). 2017. PMID: 28108551 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources