Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul 5;51(W1):W601-W606.
doi: 10.1093/nar/gkad406.

WebQUAST: online evaluation of genome assemblies

Affiliations

WebQUAST: online evaluation of genome assemblies

Alla Mikheenko et al. Nucleic Acids Res. .

Abstract

Selecting proper genome assembly is key for downstream analysis in genomics studies. However, the availability of many genome assembly tools and the huge variety of their running parameters challenge this task. The existing online evaluation tools are limited to specific taxa or provide just a one-sided view on the assembly quality. We present WebQUAST, a web server for multifaceted quality assessment and comparison of genome assemblies based on the state-of-the-art QUAST tool. The server is freely available at https://www.ccb.uni-saarland.de/quast/. WebQUAST can handle an unlimited number of genome assemblies and evaluate them against a user-provided or pre-loaded reference genome or in a completely reference-free fashion. We demonstrate key WebQUAST features in three common evaluation scenarios: assembly of an unknown species, a model organism, and a close variant of it.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
WebQUAST text reports for E. coli assemblies in the (A) reference-free and (B) reference-based evaluation mode. Unless otherwise noted, all statistics are based on contigs of size ≥ 500 bp (the default cut-off). Heatmap highlights the best value in each row which could be the largest or the smallest number depending on the quality metric. Heatmap is not used for # contigs and GC (%) due to the ambiguity of these metrics trends.
Figure 2.
Figure 2.
Icarus viewers for E. coli assemblies aligned against (A) the reference genome matching the dataset and (B) a close reference. The reference regions between 0.5 Mb and 0.7 Mb are shown. mis: X + Y stands for the total number of extensive (X) and local (Y) misassemblies per assembly. Correctly assembled contigs are colored green and aquamarine (if longer than 10 kb and similar in at least three assemblies), and fragments of misassembled contigs are colored pink and orange (if similar in at least three assemblies). Red triangles designate the sides of alignment breakpoints for misassembled contigs. Contig names are shown for contigs of sufficient size.

References

    1. Van Dijk E.L., Jaszczyszyn Y., Naquin D., Thermes C.. The third revolution in sequencing technology. Trends Genet. 2018; 34:666–681. - PubMed
    1. Sohn J.-i., Nam J.-W.. The present and future of de novo whole-genome assembly. Brief. Bioinform. 2018; 19:23–40. - PubMed
    1. Lloret-Villas A., Bhati M., Kadri N.K., Fries R., Pausch H.. Investigating the impact of reference assembly choice on genomic analyses in a cattle breed. BMC Genomics. 2021; 22:1–17. - PMC - PubMed
    1. Salzberg S.L., Phillippy A.M., Zimin A., Puiu D., Magoc T., Koren S., Treangen T.J., Schatz M.C., Delcher A.L., Roberts M.et al. .. GAGE: a critical evaluation of genome assemblies and assembly algorithms. Genome Res. 2012; 22:557–567. - PMC - PubMed
    1. Hunt M., Kikuchi T., Sanders M., Newbold C., Berriman M., Otto T.D.. REAPR: a universal tool for genome assembly evaluation. Genome Biol. 2013; 14:1–10. - PMC - PubMed

Publication types