Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Apr;199(4):1229-41.
doi: 10.1534/genetics.115.174664. Epub 2015 Jan 27.

The Drosophila genome nexus: a population genomic resource of 623 Drosophila melanogaster genomes, including 197 from a single ancestral range population

Affiliations

The Drosophila genome nexus: a population genomic resource of 623 Drosophila melanogaster genomes, including 197 from a single ancestral range population

Justin B Lack et al. Genetics. 2015 Apr.

Abstract

Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets.

Keywords: Drosophila Genome Nexus; Drosophila melanogaster; genome assembly; population genomics.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Comparison of genomic coverage and error rate for various genome assembly pipeline variations, based on resequencing of the D. melanogaster reference. (A) Heat maps illustrating variation in coverage (left) and error rate (right) at the Q75 threshold chosen to optimize the trade-off between coverage and error rate. (B) Evaluation of the trade-off between genomic coverage and error rate for the haploid caller of the Unified Genotyper; quality values ranged from 10 to 100. Resequenced genomes from the reference strain (y1 cn1 bw1 sp1) were modified to simulate realistic levels of variation. We chose a cutoff of Q75 (red) to maximize coverage and minimize error.
Figure 2
Figure 2
Relationship between the number of indels and the number of sites called by our two-round pipeline but not called in a single-round pipeline for two RG genomes (RG5 and RG33). Site counts (y-axis) and indel counts (x-axis) were determined in 100-kb windows across each genome.
Figure 3
Figure 3
Enrichment of each of nine annotation classes in the sites added by our two-round pipeline, but not called by a single-round pipeline, relative to genome-wide frequencies. We examined two RG genomes (RG5 and RG33) with an ∼30× mean depth and comparable coverage.
Figure 4
Figure 4
Mean sequencing depth vs. genetic distance (A) from the Zambia population and depth vs. coverage (B) for the AGES dataset genomes with high coverage on all chromosome arms (listed in Table S1). Circles indicate comparisons utilizing all windows with called sites, while triangles indicate comparisons including only sites called for all of the AGES and ZI genomes. Comparisons illustrate the effect of depth on genetic distance (A) and coverage (B) for genomes assembled using a single-round pipeline (red) vs. our two-round pipeline (blue). The two-round pipeline appears to alleviate the potential downward bias present in the single-round pipeline for depths below ∼20×, and the greater impact of depth on coverage for the single-round pipeline (B) suggests that the sites added by the two-round pipeline are driving the differences in distance to ZI.
Figure 5
Figure 5
A histogram of the proportions of each autosomal chromosome arm called heterozygous from the 205 DGRP genomes. Based on the cytological analysis of Huang et al. (2014), red arms were reported to be free of inversion polymorphism, while blue arms contained polymorphic inversions. The greatly increased heterozygosity of the latter category illustrates the effects of inversion polymorphism on inbreeding efficacy.
Figure 6
Figure 6
Heterogeneity in estimated cosmopolitan admixture proportions among individuals for each Sub-Saharan African population.

References

    1. Alkan C., Sajjadian S., Eichler E. E., 2011. Limitations of next-generation genome assembly. Nat. Methods 8: 61–65. - PMC - PubMed
    1. Aulard S., David J. R., Lemeunier F., 2002. Chromosomal inversion polymorphism in Afrotropical populations of Drosophila melanogaster. Genet. Res. 79: 49–63. - PubMed
    1. Baudry E., Vinier B., Veuille M., 2004. Non-African populations of Drosophila melanogaster have a unique origin. Mol. Biol. Evol. 21: 1482–1491. - PubMed
    1. Begun D. J., Aquadro C. F., 1993. African and North American populations of Drosophila melanogaster are very different at the DNA level. Nature 365: 548–550. - PubMed
    1. Bhatia G., Tandon A., Patterson N., Aldrich M. C., Ambrosone C. B., et al. , 2014. Genome-wide scan of 29,141 African Americans finds no evidence of directional selection since admixture. Am. J. Hum. Genet. 95: 1–8. - PMC - PubMed

Publication types