Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014;15 Suppl 9(Suppl 9):S10.
doi: 10.1186/1471-2164-15-S9-S10. Epub 2014 Dec 8.

Assessment of de novo assemblers for draft genomes: a case study with fungal genomes

Assessment of de novo assemblers for draft genomes: a case study with fungal genomes

Mostafa M Abbas et al. BMC Genomics. 2014.

Abstract

Background: Recently, large bio-projects dealing with the release of different genomes have transpired. Most of these projects use next-generation sequencing platforms. As a consequence, many de novo assembly tools have evolved to assemble the reads generated by these platforms. Each tool has its own inherent advantages and disadvantages, which make the selection of an appropriate tool a challenging task.

Results: We have evaluated the performance of frequently used de novo assemblers namely ABySS, IDBA-UD, Minia, SOAP, SPAdes, Sparse, and Velvet. These assemblers are assessed based on their output quality during the assembly process conducted over fungal data. We compared the performance of these assemblers by considering both computational as well as quality metrics. By analyzing these performance metrics, the assemblers are ranked and a procedure for choosing the candidate assembler is illustrated.

Conclusions: In this study, we propose an assessment method for the selection of de novo assemblers by considering their computational as well as quality metrics at the draft genome level. We divide the quality metrics into three groups: g1 measures the goodness of the assemblies, g2 measures the problems of the assemblies, and g3 measures the conservation elements in the assemblies. Our results demonstrate that the assemblers ABySS and IDBA-UD exhibit a good performance for the studied data from fungal genomes in terms of running time, memory, and quality. The results suggest that whole genome shotgun sequencing projects should make use of different assemblers by considering their merits.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Comparison for N50 size metric for the studied assemblers at contigs level. (a): BcDw1, (b): UCRNP2, (c): UCRPA7, (d): UCREL1, (e): PST21.
Figure 2
Figure 2
Percentage of CEGs in four conservation groups for all assemblies at contigs level. CEGs Mapping results of the seven assemblers outputs and the current draft genome(df_1) at contigs level for the studied datasets in the four groups of core genes. (a): BcDw1, (b): UCRNP2, (c): UCRPA7, (d): UCREL1, (e): PST21.
Figure 3
Figure 3
Percentage of CEGs in four conservation groups for all assemblies at the scaffolds level. CEGs Mapping results of all assemblers outputs (except Minia) and the current draft genome(df_1) at scaffolds level for the studied datasets in the four groups of core genes. (a): BcDw1, (b): UCRNP2, (c): UCRPA7, (d): UCREL1, (e): PST21. In PST21 dataset, the df_1 represents the assemblies at the contigs level because we do not have scaffolds level.

References

    1. Wu Y, Gao B, Zhu S. Fungal defensins, an emerging source of anti-infective drugs. Chinese Science Bulletin. 2014;59(10):931–935.
    1. Galagan JE, Henn MR, Ma LJ, Cuomo CA, Birren B. Genomics of the fungal kingdom: insights into eukaryotic biology. Genome Res. 2005;15(12):1620–1631. - PubMed
    1. Birren B, Fink G, Lander E. Fungal genome initiative: a white paper for fungal comparative genomics. Center for Genome Research, Cambridge. 2003.
    1. Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z. et al.Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005;437(7057):376–380. - PMC - PubMed
    1. Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu SM, Peng S, Xiaoqian Z, Liu G, Liao X, Li Y, Yang H, Wang J, Lam TW, Wang J. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Giga Science. 2012;1(1):18. - PMC - PubMed

LinkOut - more resources