Flexible parsing, interpretation, and editing of technical sequences with splitcode
- PMID: 38876979
- PMCID: PMC11193061
- DOI: 10.1093/bioinformatics/btae331
Flexible parsing, interpretation, and editing of technical sequences with splitcode
Abstract
Motivation: Next-generation sequencing libraries are constructed with numerous synthetic constructs such as sequencing adapters, barcodes, and unique molecular identifiers. Such sequences can be essential for interpreting results of sequencing assays, and when they contain information pertinent to an experiment, they must be processed and analyzed.
Results: We present a tool called splitcode, that enables flexible and efficient parsing, interpreting, and editing of sequencing reads. This versatile tool facilitates simple, reproducible preprocessing of reads from libraries constructed for a large array of single-cell and bulk sequencing assays.
Availability and implementation: The splitcode program is available at http://github.com/pachterlab/splitcode.
© The Author(s) 2024. Published by Oxford University Press.
Conflict of interest statement
None declared.
Figures


Update of
-
Flexible parsing, interpretation, and editing of technical sequences with splitcode.bioRxiv [Preprint]. 2023 Dec 9:2023.03.20.533521. doi: 10.1101/2023.03.20.533521. bioRxiv. 2023. Update in: Bioinformatics. 2024 Jun 3;40(6):btae331. doi: 10.1093/bioinformatics/btae331. PMID: 36993532 Free PMC article. Updated. Preprint.
Similar articles
-
Flexible parsing, interpretation, and editing of technical sequences with splitcode.bioRxiv [Preprint]. 2023 Dec 9:2023.03.20.533521. doi: 10.1101/2023.03.20.533521. bioRxiv. 2023. Update in: Bioinformatics. 2024 Jun 3;40(6):btae331. doi: 10.1093/bioinformatics/btae331. PMID: 36993532 Free PMC article. Updated. Preprint.
-
TagDust2: a generic method to extract reads from sequencing data.BMC Bioinformatics. 2015 Jan 28;16:24. doi: 10.1186/s12859-015-0454-y. BMC Bioinformatics. 2015. PMID: 25627334 Free PMC article.
-
Btrim: a fast, lightweight adapter and quality trimming program for next-generation sequencing technologies.Genomics. 2011 Aug;98(2):152-3. doi: 10.1016/j.ygeno.2011.05.009. Epub 2011 May 30. Genomics. 2011. PMID: 21651976
-
Next-generation sequencing fragment library construction.Curr Protoc Mol Biol. 2014 Jul 1;107:7.17.1-7.17.16. doi: 10.1002/0471142727.mb0717s107. Curr Protoc Mol Biol. 2014. PMID: 24984855 Review.
-
Library preparation methods for next-generation sequencing: tone down the bias.Exp Cell Res. 2014 Mar 10;322(1):12-20. doi: 10.1016/j.yexcr.2014.01.008. Epub 2014 Jan 15. Exp Cell Res. 2014. PMID: 24440557 Review.
Cited by
-
Long-read sequencing transcriptome quantification with lr-kallisto.bioRxiv [Preprint]. 2025 Jan 29:2024.07.19.604364. doi: 10.1101/2024.07.19.604364. bioRxiv. 2025. PMID: 39071335 Free PMC article. Preprint.
-
kallisto, bustools, and kb-python for quantifying bulk, single-cell, and single-nucleus RNA-seq.bioRxiv [Preprint]. 2024 Jan 23:2023.11.21.568164. doi: 10.1101/2023.11.21.568164. bioRxiv. 2024. Update in: Nat Protoc. 2025 Mar;20(3):587-607. doi: 10.1038/s41596-024-01057-0. PMID: 38045414 Free PMC article. Updated. Preprint.
-
Flexiplex: a versatile demultiplexer and search tool for omics data.Bioinformatics. 2024 Mar 4;40(3):btae102. doi: 10.1093/bioinformatics/btae102. Bioinformatics. 2024. PMID: 38379414 Free PMC article.
-
simpleaf: a simple, flexible, and scalable framework for single-cell data processing using alevin-fry.Bioinformatics. 2023 Oct 3;39(10):btad614. doi: 10.1093/bioinformatics/btad614. Bioinformatics. 2023. PMID: 37802884 Free PMC article.
-
Accurate quantification of nascent and mature RNAs from single-cell and single-nucleus RNA-seq.Nucleic Acids Res. 2025 Jan 7;53(1):gkae1137. doi: 10.1093/nar/gkae1137. Nucleic Acids Res. 2025. PMID: 39657125 Free PMC article.