Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct 14;17(1):308.
doi: 10.1186/s13104-024-06951-0.

Sequence-matching adapter trimmers generate consistent quality and assembly metrics for Illumina sequencing of RNA viruses

Affiliations

Sequence-matching adapter trimmers generate consistent quality and assembly metrics for Illumina sequencing of RNA viruses

Grace Nabakooza et al. BMC Res Notes. .

Abstract

Trimming adapters and low-quality bases from next-generation sequencing (NGS) data is crucial for optimal analysis. We evaluated six trimming programs, implementing five different algorithms, for their effectiveness in trimming adapters and improving quality, contig assembly, and single-nucleotide polymorphism (SNP) quality and concordance for poliovirus, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and norovirus paired data sequenced on Illumina iSeq and MiSeq platforms. Trimmomatic and BBDuk effectively removed adapters from all datasets, unlike FastP, AdapterRemoval, SeqPurge, and Skewer. All trimmers improved read quality (Q ≥ 30, 87.8 - 96.1%) compared to raw reads (83.6 - 93.2%). Trimmers implementing traditional sequence-matching (Trimmomatic and AdapterRemoval) and overlapping algorithm (FastP) retained the highest-quality reads. While all trimmers improved the maximum contig length and genome coverage for iSeq and MiSeq viral assemblies, BBDuk-trimmed reads assembled the shortest contigs. SNP concordance was consistently high (> 97.7 - 100%) across trimmers. However, BBDuk-trimmed reads had the lowest quality SNPs. Overall, the two adapter trimmers that utilized the traditional sequence-matching algorithm performed consistently across the viral datasets analyzed. Our findings guide software selection and inform future versatile trimmer development for viral genome analysis.

Keywords: Adapter trimming; De novo assembly; Illumina; Next-generation sequencing; Quality control; RNA viruses.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

The authors declare no conflicting interests.

Figures

Fig. 1
Fig. 1
Differences in percentage residual adapters between iSeq vs. MiSeq poliovirus, severe acute respiratory syndrome coronavirus 2, and norovirus reads, grouped by the trimmer. The median is shown as black bars, while the gray bars show the mean and standard error interval. Only statistically significant differences are indicated with corresponding p-values. Abbreviations: Polio is for poliovirus, Noro for norovirus, and SC2 for severe acute respiratory syndrome coronavirus 2
Fig. 2
Fig. 2
Mean (average) read lengths of raw and trimmed iSeq (red) and MiSeq (blue) reads for poliovirus (A), severe acute respiratory syndrome coronavirus 2 (B), and norovirus (C) datasets. Abbreviations: SC2 for severe acute respiratory syndrome coronavirus 2

References

    1. Gargis AS, Kalman L, Lubin IM. Assuring the quality of Next-Generation Sequencing in Clinical Microbiology and Public Health Laboratories. J Clin Microbiol. 2016;54(12):2857–65. - PMC - PubMed
    1. Kanzi AM, San JE, Chimukangara B, Wilkinson E, Fish M, Ramsuran V et al. Next Generation Sequencing and Bioinformatics Analysis of Family Genetic Inheritance. Front Genet [Internet]. 2020 [cited 2023 Jul 17];11. https://www.frontiersin.org/articles/10.3389/fgene.2020.544162 - PMC - PubMed
    1. Maljkovic Berry I, Melendrez MC, Bishop-Lilly KA, Rutvisuttinunt W, Pollett S, Talundzic E, et al. Next generation sequencing and Bioinformatics Methodologies for Infectious Disease Research and Public Health: approaches, applications, and considerations for development of Laboratory Capacity. J Infect Dis. 2020;221(Supplement3):S292–307. - PubMed
    1. Nabakooza G, Owuor DC, de Laurent ZR, Galiwango R, Owor N, Kayiwa JT, et al. Phylogenomic analysis uncovers a 9-year variation of Uganda influenza type-A strains from the WHO-recommended vaccines and other Africa strains. Sci Rep. 2023;13(1):5516. - PMC - PubMed
    1. Liu L, Li Y, Li S, Hu N, He Y, Pong R, et al. Comparison of next-generation sequencing systems. BioMed Res Int. 2012;2012:e251364. - PMC - PubMed

LinkOut - more resources