CGAL: computing genome assembly likelihoods
- PMID: 23360652
- PMCID: PMC3663106
- DOI: 10.1186/gb-2013-14-1-r8
CGAL: computing genome assembly likelihoods
Abstract
Assembly algorithms have been extensively benchmarked using simulated data so that results can be compared to ground truth. However, in de novo assembly, only crude metrics such as contig number and size are typically used to evaluate assembly quality. We present CGAL, a novel likelihood-based approach to assembly assessment in the absence of a ground truth. We show that likelihood is more accurate than other metrics currently used for evaluating assemblies, and describe its application to the optimization and comparison of assembly algorithms. Our methods are implemented in software that is freely available at http://bio.math.berkeley.edu/cgal/.
Figures










Similar articles
-
De novo likelihood-based measures for comparing genome assemblies.BMC Res Notes. 2013 Aug 22;6:334. doi: 10.1186/1756-0500-6-334. BMC Res Notes. 2013. PMID: 23965294 Free PMC article.
-
SWALO: scaffolding with assembly likelihood optimization.Nucleic Acids Res. 2021 Nov 18;49(20):e117. doi: 10.1093/nar/gkab717. Nucleic Acids Res. 2021. PMID: 34417615 Free PMC article.
-
Assembly reconciliation.Bioinformatics. 2008 Jan 1;24(1):42-5. doi: 10.1093/bioinformatics/btm542. Epub 2007 Dec 5. Bioinformatics. 2008. PMID: 18057021
-
SuRankCo: supervised ranking of contigs in de novo assemblies.BMC Bioinformatics. 2015 Jul 30;16:240. doi: 10.1186/s12859-015-0644-7. BMC Bioinformatics. 2015. PMID: 26224355 Free PMC article.
-
OSLay: optimal syntenic layout of unfinished assemblies.Bioinformatics. 2007 Jul 1;23(13):1573-9. doi: 10.1093/bioinformatics/btm153. Epub 2007 Apr 26. Bioinformatics. 2007. PMID: 17463020
Cited by
-
GAML: genome assembly by maximum likelihood.Algorithms Mol Biol. 2015 Jun 3;10:18. doi: 10.1186/s13015-015-0052-6. eCollection 2015. Algorithms Mol Biol. 2015. PMID: 26042154 Free PMC article.
-
Host-Associated Genomic Features of the Novel Uncultured Intracellular Pathogen Ca. Ichthyocystis Revealed by Direct Sequencing of Epitheliocysts.Genome Biol Evol. 2016 Jun 13;8(6):1672-89. doi: 10.1093/gbe/evw111. Genome Biol Evol. 2016. PMID: 27190004 Free PMC article.
-
De novo likelihood-based measures for comparing genome assemblies.BMC Res Notes. 2013 Aug 22;6:334. doi: 10.1186/1756-0500-6-334. BMC Res Notes. 2013. PMID: 23965294 Free PMC article.
-
A molecular portrait of maternal sepsis from Byzantine Troy.Elife. 2017 Jan 10;6:e20983. doi: 10.7554/eLife.20983. Elife. 2017. PMID: 28072390 Free PMC article.
-
Theoretical Analysis of Sequencing Bioinformatics Algorithms and Beyond.Commun ACM. 2023 Jul;66(7):118-125. doi: 10.1145/3571723. Epub 2023 Jun 22. Commun ACM. 2023. PMID: 38736702 Free PMC article. No abstract available.
References
-
- Medvedev P, Georgiou K, Myers G, Brudno M. In: Algorithms in Bioinformatics, Volume 4645 of Lecture Notes in Computer Science. Giancarlo R, Hannenhalli S, editor. Berlin/Heidelberg: Springer; 2007. Computability of models for sequence assembly. pp. 289–301.http://dx.doi.org/10.1007/978-3-540-74126-8_27 - DOI
-
- Earl DA, Bradnam K, St John J, Darling A, Lin D, Faas J, Yu HOK, Vince B, Zerbino DR, Diekhans M, Nguyen N, Nuwantha P, Sung AWK, Ning Z, Haimel M, Simpson JT, Fronseca NA, Birol N, Docking TR, Ho IY, Rokhsar DS, Chikhi R, Lavenier D, Chapuis G, Naquin D, Maillet N, Schatz MC, Kelly DR, Phillippy AM, Koren S. et al.Assemblathon 1: a competitive assessment of de novo short read assembly methods. Genome Research. 2011;12:2224–2241. - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources