Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Apr 16;4(4):669-79.
doi: 10.1534/g3.114.010264.

Sequencing, assembling, and correcting draft genomes using recombinant populations

Affiliations

Sequencing, assembling, and correcting draft genomes using recombinant populations

Matthew W Hahn et al. G3 (Bethesda). .

Abstract

Current de novo whole-genome sequencing approaches often are inadequate for organisms lacking substantial preexisting genetic data. Problems with these methods are manifest as: large numbers of scaffolds that are not ordered within chromosomes or assigned to individual chromosomes, misassembly of allelic sequences as separate loci when the individual(s) being sequenced are heterozygous, and the collapse of recently duplicated sequences into a single locus, regardless of levels of heterozygosity. Here we propose a new approach for producing de novo whole-genome sequences-which we call recombinant population genome construction-that solves many of the problems encountered in standard genome assembly and that can be applied in model and nonmodel organisms. Our approach takes advantage of next-generation sequencing technologies to simultaneously barcode and sequence a large number of individuals from a recombinant population. The sequences of all recombinants can be combined to create an initial de novo assembly, followed by the use of individual recombinant genotypes to correct assembly splitting/collapsing and to order and orient scaffolds within linkage groups. Recombinant population genome construction can rapidly accelerate the transformation of nonmodel species into genome-enabled systems by simultaneously producing a high-quality genome assembly and providing genomic tools (e.g., high-confidence single-nucleotide polymorphisms) for immediate applications. In populations segregating for important functional traits, this approach also enables simultaneous mapping of quantitative trait loci. We demonstrate our method using simulated Illumina data from a recombinant population of Caenorhabditis elegans and show that the method can produce a high-fidelity, high-quality genome assembly for both parents of the cross.

Keywords: assembly; duplication; genetics; genome; next-generation sequencing.

PubMed Disclaimer

Figures

Figure 1
Figure 1
General outline of the steps involved in recombinant population genome construction. Each individually numbered step is described in detail in the text.
Figure 2
Figure 2
The pattern of segregation of collapsed duplicates through an F2 cross. The top panel shows the physical reality of genes arranged on chromosomes, with two duplicates present in each parent, F1, and F2. The bottom panel shows how, when duplicates are collapsed into a single locus, all individuals appear to be heterozygous at sites that differentiate the two copies.
Figure 3
Figure 3
The pattern of segregation of split alleles through an F2 cross. The top panel shows the physical reality of a single gene that differs in allelic sequence between the parents. The F1 and half the F2s are heterozygous. The bottom panel shows how, when alleles are split into two loci, each parent appears to be missing a locus, whereas the F1 and half the F2s have both loci present.

References

    1. Adams M. D., Celniker S. E., Holt R. A., Evans C. A., Gocayne J. D., et al. , 2000. The genome sequence of Drosophila melanogaster. Science 287: 2185–2195. - PubMed
    1. Alkan C., Sajjadian S., Eichler E. E., 2011. Limitations of next-generation genome sequence assembly. Nat. Methods 8: 61–65. - PMC - PubMed
    1. Amores A., Catchen J., Ferrara A., Fontenot Q., Postlethwait J. H., 2011. Genome evolution and meiotic maps by massively parallel DNA sequencing: spotted gar, an outgroup for the teleost genome duplication. Genetics 188: 799–808. - PMC - PubMed
    1. Bailey J. A., Yavor A. M., Massa H. F., Trask B. J., Eichler E. E., 2001. Segmental duplications: organization and impact within the current Human Genome Project assembly. Genome Res. 11: 1005–1017. - PMC - PubMed
    1. Bailey J. A., Gu Z. P., Clark R. A., Reinert K., Samonte R. V., et al. , 2002. Recent segmental duplications in the human genome. Science 297: 1003–1007. - PubMed

Publication types

LinkOut - more resources