Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 15;35(14):i398-i407.
doi: 10.1093/bioinformatics/btz392.

Inference of clonal selection in cancer populations using single-cell sequencing data

Affiliations

Inference of clonal selection in cancer populations using single-cell sequencing data

Pavel Skums et al. Bioinformatics. .

Abstract

Summary: Intra-tumor heterogeneity is one of the major factors influencing cancer progression and treatment outcome. However, evolutionary dynamics of cancer clone populations remain poorly understood. Quantification of clonal selection and inference of fitness landscapes of tumors is a key step to understanding evolutionary mechanisms driving cancer. These problems could be addressed using single-cell sequencing (scSeq), which provides an unprecedented insight into intra-tumor heterogeneity allowing to study and quantify selective advantages of individual clones. Here, we present Single Cell Inference of FItness Landscape (SCIFIL), a computational tool for inference of fitness landscapes of heterogeneous cancer clone populations from scSeq data. SCIFIL allows to estimate maximum likelihood fitnesses of clone variants, measure their selective advantages and order of appearance by fitting an evolutionary model into the tumor phylogeny. We demonstrate the accuracy our approach, and show how it could be applied to experimental tumor data to study clonal selection and infer evolutionary history. SCIFIL can be used to provide new insight into the evolutionary dynamics of cancer.

Availability and implementation: Its source code is available at https://github.com/compbel/SCIFIL.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Mutation tree
Fig. 2.
Fig. 2.
Depiction of the evolutionary model. Tree nodes represent mutation events whose times are marked on the time axis. Leafs represent the sampling event. For each node, the distribution of clone abundances after the corresponding event is shown
Fig. 3.
Fig. 3.
Example of simulated mutation tree
Fig. 4.
Fig. 4.
Performance of SCIFIL on simulated data with m mutations and fixed standard deviation of mutation rate. (Left): Mean relative accuracy of fitness estimation. (Right): Spearman correlation between true and inferred fitness vectors
Fig. 5.
Fig. 5.
Performance of SCIFIL on simulated data with m =50 mutations and different standard deviations of mutations rates. (Left) Mean relative accuracy of fitness estimation. (Right) Spearman correlation between true and inferred fitness vectors
Fig. 6.
Fig. 6.
Performance of SCIFIL on simulated data with different false negative error rates α and with mutation trees reconstructed by SCITE (Jahn et al., 2016). (Left) Mean relative accuracy of fitness estimation. (Right) Spearman correlation between true and inferred fitness vectors
Fig. 7.
Fig. 7.
(Left) Spearman correlation between true and inferred fitness vectors for QuasiFit and SCIFIL. (Right) Running time of SCIFIL
Fig. 8.
Fig. 8.
Fitness landscape and mutation tree for JAK2-negative myeloproliferative neoplasm (Hou et al., 2012) (left) and colorectal cancer (right) (Leung et al., 2017) inferred by SCIFIL. Colors represent fitness values and distance from each tree node to the root is approximately proportional to its time of appearance
Fig. 9.
Fig. 9.
Log-likelihoods of trees with and without recurrent mutations. (Left) Log-likelihoods produced by infSCITE. (Right) Evolutionary likelihoods produced by SCIFIL. Likelihoods of perfect phylogeny are shown in green. Purple and red: trees with the evolutionary likelihoods higher than for the perfect phylogeny

References

    1. Baron M. et al. (2016) A single-cell transcriptomic map of the human and mouse pancreas reveals inter-and intra-cell population structure. Cell Syst., 3, 346–360. - PMC - PubMed
    1. Beerenwinkel N. et al. (2007) Genetic progression and the waiting time to cancer. PLoS Comput. Biol., 3, e225.. - PMC - PubMed
    1. Bonavia R. et al. (2011) Heterogeneity maintenance in glioblastoma: a social network. Cancer Res., 71, 4055–4060. - PMC - PubMed
    1. Bozic I. et al. (2010) Accumulation of driver and passenger mutations during tumor progression. Proc. Natl. Acad. Sci. USA, 107, 18545–18550. - PMC - PubMed
    1. Crona K. et al. (2017) Inferring genetic interactions from comparative fitness data. eLife, 6, e28629. - PMC - PubMed

Publication types