Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jan 20:12:2.
doi: 10.1186/s13007-016-0107-9. eCollection 2016.

An efficient approach to BAC based assembly of complex genomes

Affiliations

An efficient approach to BAC based assembly of complex genomes

Paul Visendi et al. Plant Methods. .

Abstract

Background: There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate 'gold' reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success.

Results: We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes.

Conclusions: We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes.

Keywords: 7DS; Assembly; BAC; Next-generation sequencing; SASSY; Saccharum spp; Triticum aestivum.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Optimal coverage for assembly. Assembly sizes vs coverage for each of the 11 sugarcane BACs. Assembly sizes peak at 450x and level off despite increase in coverage beyond 1500x
Fig. 2
Fig. 2
Mummer plot of assemblies of single BACs A, B, C, E against pooled BACs of ABCE
Fig. 3
Fig. 3
BES mappings on contigs of simulated pool (ABCE). Clones A, C and E have forward (M13_For) and reverse (SP6_Rev) BES (A01_M13_For, A01_SP6_Rev, C01_M13_For, C01_SP6_Rev, E01_M13_For, E01_SP6_Rev) respectively correctly mapped. Clone B had no BES available but 120 bp sequences from cloning vector ends (FOR and REV) were used to identify contig ends of clone B
Fig. 4
Fig. 4
Distribution of no of contigs and scaffolds per BAC for 96 BAC pools
Fig. 5
Fig. 5
Distribution and orientation of MP insert sizes on E coli (a), contigs (b) and scaffolds (c) of 96 wheat BAC pools. Y axis (MP read counts in log scale), X axis (insert sizes). Correctly orientated MP reads with orientation RF (< –, – >) are shown in green, shadow library MP reads mapping with orientation FR (– >, < –) are shown in orange and chimeric MP reads mapping with orientation FF (– >, – >) and RR (< –, < –) are shown in blue

References

    1. Ruperao P, Chan C-KK, Azam S, Karafiátová M, Hayashi S, Čížková J, Saxena RK, Šimková H, Song C, Vrána J, Chitikineni A, Visendi P, Gaur PM, Millán T, Singh KB, Tara B, Wang J, Batley J, Doležel J, Varshney RK, Edwards D. A chromosomal genomics approach to assess and validate the desi and kabuli draft chickpea genome assemblies. Plant Biotechnol J. 2014;12:778–786. doi: 10.1111/pbi.12182. - DOI - PubMed
    1. Kelley DR, Salzberg SL. Detection and correction of false segmental duplications caused by genome mis-assembly. Genome Biol. 2010;11:R28. doi: 10.1186/gb-2010-11-3-r28. - DOI - PMC - PubMed
    1. Alkan C, Sajjadian S, Eichler EE. Limitations of next-generation genome sequence assembly. Nat Methods. 2010;8:61–65. doi: 10.1038/nmeth.1527. - DOI - PMC - PubMed
    1. Berkman PJ, Visendi P, Lee HC, Stiller J, Manoli S, Lorenc MT, Lai K, Batley J, Fleury D, Šimková H, Kubaláková M, Weining S, Doležel J, Edwards D. Dispersion and domestication shaped the genome of bread wheat. Plant Biotechnol J. 2013;11:564–571. doi: 10.1111/pbi.12044. - DOI - PubMed
    1. Kajitani R, Toshimoto K, Noguchi H, Toyoda A, Ogura Y, Okuno M, Yabana M, Harada M, Nagayasu E, Maruyama H, Kohara Y, Fujiyama A, Hayashi T, Itoh T. Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads. Genome Res. 2014;24:1384–1395. doi: 10.1101/gr.170720.113. - DOI - PMC - PubMed

LinkOut - more resources