Assembly reconciliation
- PMID: 18057021
- DOI: 10.1093/bioinformatics/btm542
Assembly reconciliation
Abstract
Motivation: Many genomes are sequenced by a collaboration of several centers, and then each center produces an assembly using their own assembly software. The collaborators then pick the draft assembly that they judge to be the best and the information contained in the other assemblies is usually not used.
Methods: We have developed a technique that we call assembly reconciliation that can merge draft genome assemblies. It takes one draft assembly, detects apparent errors, and, when possible, patches the problem areas using pieces from alternative draft assemblies. It also closes gaps in places where one of the alternative assemblies has spanned the gap correctly.
Results: Using the Assembly Reconciliation technique, we produced reconciled assemblies of six Drosophila species in collaboration with Agencourt Bioscience and The J. Craig Venter Institute. These assemblies are now the official (CAF1) assemblies used for analysis. We also produced a reconciled assembly of Rhesus Macaque genome, and this assembly is available from our website http://www.genome.umd.edu.
Availability: The reconciliation software is available for download from http://www.genome.umd.edu/software.htm
Similar articles
-
WindowMasker: window-based masker for sequenced genomes.Bioinformatics. 2006 Jan 15;22(2):134-41. doi: 10.1093/bioinformatics/bti774. Epub 2005 Nov 15. Bioinformatics. 2006. PMID: 16287941
-
Techniques for multi-genome synteny analysis to overcome assembly limitations.Genome Inform. 2006;17(2):152-61. Genome Inform. 2006. PMID: 17503388
-
Alvira: comparative genomics of viral strains.Bioinformatics. 2007 Aug 15;23(16):2178-9. doi: 10.1093/bioinformatics/btm293. Epub 2007 Jun 5. Bioinformatics. 2007. PMID: 17550913
-
Discovering and detecting transposable elements in genome sequences.Brief Bioinform. 2007 Nov;8(6):382-92. doi: 10.1093/bib/bbm048. Epub 2007 Oct 10. Brief Bioinform. 2007. PMID: 17932080 Review.
-
Genome resequencing and genetic variation.Nat Biotechnol. 2008 Jan;26(1):65-6. doi: 10.1038/nbt0108-65. Nat Biotechnol. 2008. PMID: 18183021 Review. No abstract available.
Cited by
-
Myoglobin primary structure reveals multiple convergent transitions to semi-aquatic life in the world's smallest mammalian divers.Elife. 2021 Apr 29;10:e66797. doi: 10.7554/eLife.66797. Elife. 2021. PMID: 33949308 Free PMC article.
-
Nebulous without white: annotated long-read genome assembly and CRISPR/Cas9 genome engineering in Drosophila nebulosa.G3 (Bethesda). 2022 Nov 4;12(11):jkac231. doi: 10.1093/g3journal/jkac231. G3 (Bethesda). 2022. PMID: 36063049 Free PMC article.
-
GAM-NGS: genomic assemblies merger for next generation sequencing.BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-14-S7-S6. Epub 2013 Apr 22. BMC Bioinformatics. 2013. PMID: 23815503 Free PMC article.
-
Reevaluating assembly evaluations with feature response curves: GAGE and assemblathons.PLoS One. 2012;7(12):e52210. doi: 10.1371/journal.pone.0052210. Epub 2012 Dec 28. PLoS One. 2012. PMID: 23284938 Free PMC article.
-
Metassembler: merging and optimizing de novo genome assemblies.Genome Biol. 2015 Sep 24;16:207. doi: 10.1186/s13059-015-0764-4. Genome Biol. 2015. PMID: 26403281 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous