SPLASH: A statistical, reference-free genomic algorithm unifies biological discovery
- PMID: 38065078
- PMCID: PMC10861363
- DOI: 10.1016/j.cell.2023.10.028
SPLASH: A statistical, reference-free genomic algorithm unifies biological discovery
Abstract
Today's genomics workflows typically require alignment to a reference sequence, which limits discovery. We introduce a unifying paradigm, SPLASH (Statistically Primary aLignment Agnostic Sequence Homing), which directly analyzes raw sequencing data, using a statistical test to detect a signature of regulation: sample-specific sequence variation. SPLASH detects many types of variation and can be efficiently run at scale. We show that SPLASH identifies complex mutation patterns in SARS-CoV-2, discovers regulated RNA isoforms at the single-cell level, detects the vast sequence diversity of adaptive immune receptors, and uncovers biology in non-model organisms undocumented in their reference genomes: geographic and seasonal variation and diatom association in eelgrass, an oceanic plant impacted by climate change, and tissue-specific transcripts in octopus. SPLASH is a unifying approach to genomic analysis that enables expansive discovery without metadata or references.
Keywords: RNA-seq; computational biology; genetics; genomics; reference-free; single-cell RNA-seq; splicing; statistics.
Copyright © 2023 The Authors. Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of interests K.C., T.Z.B., and J.S. are inventors on provisional patents related to this work.
Figures







Update of
-
SPLASH: a statistical, reference-free genomic algorithm unifies biological discovery.bioRxiv [Preprint]. 2023 Jul 31:2022.06.24.497555. doi: 10.1101/2022.06.24.497555. bioRxiv. 2023. Update in: Cell. 2023 Dec 7;186(25):5440-5456.e26. doi: 10.1016/j.cell.2023.10.028. PMID: 35794890 Free PMC article. Updated. Preprint.
-
[WITHDRAWN] SPLASH: a statistical, reference-free genomic algorithm unifies biological discovery.bioRxiv [Preprint]. 2023 Jul 31:2023.07.17.549408. doi: 10.1101/2023.07.17.549408. bioRxiv. 2023. Update in: Cell. 2023 Dec 7;186(25):5440-5456.e26. doi: 10.1016/j.cell.2023.10.028. PMID: 37503014 Free PMC article. Updated. Preprint.
References
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources