Sequali: efficient and comprehensive quality control of short- and long-read sequencing data
- PMID: 39927290
- PMCID: PMC11802474
- DOI: 10.1093/bioadv/vbaf010
Sequali: efficient and comprehensive quality control of short- and long-read sequencing data
Abstract
Motivation: Quality control of sequencing data is the first step in many sequencing workflows. Short- and long-read sequencing technologies have many commonalities with regard to quality control. Several quality control programs exist; however, none possess a feature set that is adequate for both technologies. Quality control programs aimed at Oxford Nanopore Technologies sequencing lack vital features, such as adapter searching, overrepresented sequence analysis, and duplication analysis.
Results: Sequali was developed to provide sequencing quality control for both short- and long-read sequencing technologies. It features adapter search, overrepresented sequence analysis, and duplication analysis and supports FASTQ and uBAM inputs. It is significantly faster than comparable sequencing quality control programs for both short- and long-read sequencing technologies.
Availability and implementation: Sequali is an open-source Python application using C extensions and is freely available under the AGPL-3.0 license at https://github.com/rhpvorderman/sequali. The source code for each release is archived at zenodo: https://zenodo.org/doi/10.5281/zenodo.10822485.
© The Author(s) 2025. Published by Oxford University Press.
Conflict of interest statement
None declared.
Figures
References
-
- Andrews S. FastQC: a quality control tool for high throughput sequence data. 2010. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (12 August 2024, date last accessed).
LinkOut - more resources
Full Text Sources
