Toward almost closed genomes with GapFiller
- PMID: 22731987
- PMCID: PMC3446322
- DOI: 10.1186/gb-2012-13-6-r56
Toward almost closed genomes with GapFiller
Abstract
De novo assembly is a commonly used application of next-generation sequencing experiments. The ultimate goal is to puzzle millions of reads into one complete genome, although draft assemblies usually result in a number of gapped scaffold sequences. In this paper we propose an automated strategy, called GapFiller, to reliably close gaps within scaffolds using paired reads. The method shows good results on both bacterial and eukaryotic datasets, allowing only few errors. As a consequence, the amount of additional wetlab work needed to close a genome is drastically reduced. The software is available at http://www.baseclear.com/bioinformatics-tools/.
Figures


Similar articles
-
GAPPadder: a sensitive approach for closing gaps on draft genomes with short sequence reads.BMC Genomics. 2019 Jun 6;20(Suppl 5):426. doi: 10.1186/s12864-019-5703-4. BMC Genomics. 2019. PMID: 31167639 Free PMC article.
-
GFinisher: a new strategy to refine and finish bacterial genome assemblies.Sci Rep. 2016 Oct 10;6:34963. doi: 10.1038/srep34963. Sci Rep. 2016. PMID: 27721396 Free PMC article.
-
GapFiller: a de novo assembly approach to fill the gap within paired reads.BMC Bioinformatics. 2012;13 Suppl 14(Suppl 14):S8. doi: 10.1186/1471-2105-13-S14-S8. Epub 2012 Sep 7. BMC Bioinformatics. 2012. PMID: 23095524 Free PMC article.
-
The bioinformatics tools for the genome assembly and analysis based on third-generation sequencing.Brief Funct Genomics. 2019 Feb 14;18(1):1-12. doi: 10.1093/bfgp/ely037. Brief Funct Genomics. 2019. PMID: 30462154 Review.
-
Assembly, Annotation, and Comparative Genomics in PATRIC, the All Bacterial Bioinformatics Resource Center.Methods Mol Biol. 2018;1704:79-101. doi: 10.1007/978-1-4939-7463-4_4. Methods Mol Biol. 2018. PMID: 29277864 Review.
Cited by
-
Genomic characteristics of antimicrobial resistance and virulence factors of carbapenem-resistant Stutzerimonas nitrititolerans isolated from the clinical specimen.BMC Microbiol. 2024 Oct 3;24(1):386. doi: 10.1186/s12866-024-03546-4. BMC Microbiol. 2024. PMID: 39358682 Free PMC article.
-
Genome sequence of Enterobacter sp. strain SP1, an endophytic nitrogen-fixing bacterium isolated from sugarcane.J Bacteriol. 2012 Dec;194(24):6963-4. doi: 10.1128/JB.01933-12. J Bacteriol. 2012. PMID: 23209221 Free PMC article.
-
Sealer: a scalable gap-closing application for finishing draft genomes.BMC Bioinformatics. 2015 Jul 25;16(1):230. doi: 10.1186/s12859-015-0663-4. BMC Bioinformatics. 2015. PMID: 26209068 Free PMC article.
-
The Genome and Methylome of a Subsocial Small Carpenter Bee, Ceratina calcarata.Genome Biol Evol. 2016 May 13;8(5):1401-10. doi: 10.1093/gbe/evw079. Genome Biol Evol. 2016. PMID: 27048475 Free PMC article.
-
Contig-Layout-Authenticator (CLA): A Combinatorial Approach to Ordering and Scaffolding of Bacterial Contigs for Comparative Genomics and Molecular Epidemiology.PLoS One. 2016 Jun 1;11(6):e0155459. doi: 10.1371/journal.pone.0155459. eCollection 2016. PLoS One. 2016. PMID: 27248146 Free PMC article.
References
-
- Li R, Fan W, Tian G, Zhu H, He L, Cai J, Huang Q, Cai Q, Li B, Bai Y, Zhang Z, Zhang Y, Wang W, Li J, Wei F, Li H, Jian M, Li J, Zhang Z, Nielsen R, Li D, Gu W, Yang Z, Xuan Z, Ryder OA, Leung FC, Zhou Y, Cao J, Sun X, Fu Y. et al.The sequence and de novo assembly of the giant panda genome. Nature. 2010;463:311–317. doi: 10.1038/nature08696. - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials