AssemblyQC: a Nextflow pipeline for reproducible reporting of assembly quality
- PMID: 39078114
- PMCID: PMC11333564
- DOI: 10.1093/bioinformatics/btae477
AssemblyQC: a Nextflow pipeline for reproducible reporting of assembly quality
Abstract
Summary: Genome assembly projects have grown exponentially due to breakthroughs in sequencing technologies and assembly algorithms. Evaluating the quality of genome assemblies is critical to ensure the reliability of downstream analysis and interpretation. To fulfil this task, we have developed the AssemblyQC pipeline that performs file-format validation, contaminant checking, contiguity measurement, gene- and repeat-space completeness quantification, telomere inspection, taxonomic assignment, synteny alignment, scaffold examination through Hi-C contact-map visualization, and assessments of completeness, consensus quality and phasing through k-mer analysis. It produces a comprehensive HTML report with method descriptions, tables, and visualizations.
Availability and implementation: The pipeline uses Nextflow for workflow orchestration and adheres to the best-practice established by the nf-core community. This pipeline offers a reproducible, scalable, and portable method to assess the quality of genome assemblies-the code is available online at GitHub: https://github.com/Plant-Food-Research-Open/assemblyqc.
© The Author(s) 2024. Published by Oxford University Press.
Conflict of interest statement
None declared.
References
-
- Agarwal T, Suravajhala R, Bhushan M. et al. Recent Advances in Gene and Genome Assembly: Challenges and Implications. Advances in Synthetic Biology2020:199–220.
-
- Andrews S. FastQC: A Quality Control Tool for High Throughput Sequence Data. Cambridge, United Kingdom: Babraham Bioinformatics, Babraham Institute. 2010.
-
- Brown M, González De la Rosa PM, Mark B.. A Telomere Identification Toolkit. Zenodo (2023). 10.5281/zenodo.10091385. Code repository: https://github.com/tolkit/telomeric-identifier. - DOI
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous
