Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar;42(3):367-370.
doi: 10.1038/s41587-023-02100-3.

Scalable, accessible and reproducible reference genome assembly and evaluation in Galaxy

Affiliations

Scalable, accessible and reproducible reference genome assembly and evaluation in Galaxy

Delphine Larivière et al. Nat Biotechnol. 2024 Mar.
No abstract available

PubMed Disclaimer

Conflict of interest statement

Competing interests

The authors declare no competing interests.

Figures

Fig. 1 |
Fig. 1 |. VGP–Galaxy assembly pipeline (version 2.1) consists of 10 workflows that can be combined into 8 analysis trajectories depending on the combination of input data.
A decision on whether to invoke workflow 6 is based on the analysis of QC output of workflows 3, 4 or 5 (see Supplementary Information for full explanation). Thicker lines connecting workflows 7, 8 and 9 reflect the fact that these workflows are invoked separately for each phased assembly (once for maternal and once for paternal).
Fig. 2 |
Fig. 2 |. Phylogenetic tree and assembly statistics of genomes assembled using the VGP–Galaxy assembly pipeline.
From the innermost circle to the outermost circle: (i) repeat content; (ii) heterozygosity; (iii) heterogamy: individuals with two identical sex chromosomes (white) or two different sex chromosomes (blue); (iv) assembly size in percentage of the genome size estimated by Genomescope; (v) scaffold NG50 in % of estimated genome size; (vi) Merqury completeness of both haplotypes; (vii) BUSCO completeness: presence of orthologous genes present and complete compared to the set expected in vertebrates; (viii) mitogenome assembled and available (black); (ix) genome size in gigabytes, with lines at 9, 2, 3, 4, 6 and 8 Gb; (x) number of scaffolds in log scale, with lines at 1 (10 scaffolds), 2 (100 scaffolds), 3 (1,000 scaffolds) and 4 (10,000 scaffolds).

Update of

References

    1. Hotaling S, Kelley JL & Frandsen PB Proc. Natl Acad. Sci. USA 118, e2109019118 (2021). - PMC - PubMed
    1. Formenti G et al. Trends Ecol. Evol. 37, 197–202 (2022). - PubMed
    1. Theissinger K et al. Trends Genet. 39, 545–559 (2003). - PubMed
    1. Lewin HA et al. Proc. Natl Acad. Sci. USA 119, e2115635118 (2022). - PubMed
    1. Rhie A, Walenz BP, Koren S & Phillippy AM Genome Biol. 21, 245 (2020). - PMC - PubMed

LinkOut - more resources