Identification of Somatic Mutations From Bulk and Single-Cell Sequencing Data
- PMID: 35822012
- PMCID: PMC9261417
- DOI: 10.3389/fragi.2021.800380
Identification of Somatic Mutations From Bulk and Single-Cell Sequencing Data
Abstract
Somatic mutations are DNA variants that occur after the fertilization of zygotes and accumulate during the developmental and aging processes in the human lifespan. Somatic mutations have long been known to cause cancer, and more recently have been implicated in a variety of non-cancer diseases. The patterns of somatic mutations, or mutational signatures, also shed light on the underlying mechanisms of the mutational process. Advances in next-generation sequencing over the decades have enabled genome-wide profiling of DNA variants in a high-throughput manner; however, unlike germline mutations, somatic mutations are carried only by a subset of the cell population. Thus, sensitive bioinformatic methods are required to distinguish mutant alleles from sequencing and base calling errors in bulk tissue samples. An alternative way to study somatic mutations, especially those present in an extremely small number of cells or even in a single cell, is to sequence single-cell genomes after whole-genome amplification (WGA); however, it is critical and technically challenging to exclude numerous technical artifacts arising during error-prone and uneven genome amplification in current WGA methods. To address these challenges, multiple bioinformatic tools have been developed. In this review, we summarize the latest progress in methods for identification of somatic mutations and the challenges that remain to be addressed in the future.
Keywords: bioinformatic tool; bulk sequencing; single-cell sequencing; single-nucleotide variant; somatic mutation.
Copyright © 2022 Huang and Lee.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures

Similar articles
-
Somatic variant calling from single-cell DNA sequencing data.Comput Struct Biotechnol J. 2022 Jun 14;20:2978-2985. doi: 10.1016/j.csbj.2022.06.013. eCollection 2022. Comput Struct Biotechnol J. 2022. PMID: 35782734 Free PMC article. Review.
-
Inspecting Targeted Deep Sequencing of Whole Genome Amplified DNA Versus Fresh DNA for Somatic Mutation Detection: A Genetic Study in Myelodysplastic Syndrome Patients.Biopreserv Biobank. 2017 Aug;15(4):360-365. doi: 10.1089/bio.2016.0094. Epub 2017 Jun 6. Biopreserv Biobank. 2017. PMID: 28586236
-
FIREVAT: finding reliable variants without artifacts in human cancer samples using etiologically relevant mutational signatures.Genome Med. 2019 Dec 17;11(1):81. doi: 10.1186/s13073-019-0695-x. Genome Med. 2019. PMID: 31847917 Free PMC article.
-
Somatic and Germline Variant Calling from Next-Generation Sequencing Data.Adv Exp Med Biol. 2022;1361:37-54. doi: 10.1007/978-3-030-91836-1_3. Adv Exp Med Biol. 2022. PMID: 35230682
-
Direct mutation analysis by high-throughput sequencing: from germline to low-abundant, somatic variants.Mutat Res. 2012 Jan 3;729(1-2):1-15. doi: 10.1016/j.mrfmmm.2011.10.001. Epub 2011 Oct 12. Mutat Res. 2012. PMID: 22016070 Free PMC article. Review.
Cited by
-
Somatic cancer driver mutations are enriched and associated with inflammatory states in Alzheimer's disease microglia.bioRxiv [Preprint]. 2024 Jan 4:2024.01.03.574078. doi: 10.1101/2024.01.03.574078. bioRxiv. 2024. PMID: 38260600 Free PMC article. Preprint.
-
Detecting somatic variants in purified brain DNA obtained from surgically implanted depth electrodes in epilepsy.Epilepsia. 2025 Apr;66(4):1234-1249. doi: 10.1111/epi.18251. Epub 2025 Jan 3. Epilepsia. 2025. PMID: 39751777 Free PMC article.
-
Chasing Sequencing Perfection: Marching Toward Higher Accuracy and Lower Costs.Genomics Proteomics Bioinformatics. 2024 Jul 3;22(2):qzae024. doi: 10.1093/gpbjnl/qzae024. Genomics Proteomics Bioinformatics. 2024. PMID: 38991976 Free PMC article. Review.
-
Cell-by-Cell: Unlocking Lung Cancer Pathogenesis.Cancers (Basel). 2022 Jul 14;14(14):3424. doi: 10.3390/cancers14143424. Cancers (Basel). 2022. PMID: 35884485 Free PMC article. Review.
-
MAST: a hybrid Multi-Agent Spatio-Temporal model of tumor microenvironment informed using a data-driven approach.Bioinform Adv. 2022 Dec 5;2(1):vbac092. doi: 10.1093/bioadv/vbac092. eCollection 2022. Bioinform Adv. 2022. PMID: 36699399 Free PMC article.
References
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources