Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Apr;11(4):396-8.
doi: 10.1038/nmeth.2883. Epub 2014 Mar 16.

PyClone: statistical inference of clonal population structure in cancer

Affiliations

PyClone: statistical inference of clonal population structure in cancer

Andrew Roth et al. Nat Methods. 2014 Apr.

Abstract

We introduce PyClone, a statistical model for inference of clonal population structures in cancers. PyClone is a Bayesian clustering method for grouping sets of deeply sequenced somatic mutations into putative clonal clusters while estimating their cellular prevalences and accounting for allelic imbalances introduced by segmental copy-number changes and normal-cell contamination. Single-cell sequencing validation demonstrates PyClone's accuracy.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Comparison of clustering performance for the mixture of normal tissues dataset | We compare the infinite Binomial mixture model (IBMM); infinite Beta-Binomial mixture model (IBBMM); PyClone using binomial emission densities and total copy number (Bin-TCN) or parental copy number (Bin-PCN) prior; PyClone using Beta-Binomial emissions and the parental (BeBin-PCN) or total (BeBin-TCN) copy number prior. (a) Comparison of methods when analysing each mixture experiment separately and (b) analysing all four mixtures jointly. (c) Expected cellular prevalence of each cluster across the four mixture experiments. (d) Inferred cellular prevalences and clustering using the IBBMM model and (e) PyClone BeBin-PCN model to jointly analyse all four mixtures. Solid lines (d, e) indicate clusters for which SNVs are predominatly homozygous (BB) and dashed lines indicate clusters for which SNVs are predominatly heterozygous (AB), in the event an equal number of both types of SNVs is present the cluster is drawn as a solid line. (f) Variant allelic prevalence for mutations assigned to cluster 1 by PyClone BeBin-PCN model. Dashed lines represent heterozygous SNVs and solid lines represent homozygous SNVs. (a, b) Whiskers indicate 1.5 the interquartile range, red bars indicate the median, and boxes represent the interquartile range. (d, e) Error bars indicate the mean standard deviation of MCMC cellular prevalences estimates for mutations in a cluster. (d, e, f) The number of mutations n in each cluster is shown in the legend in parentheses.
Figure 2
Figure 2
Joint analysis of multiple samples from high grade serous ovarian cancer (HGSOC) 2 | The variant allelic prevalence for each mutation color coded by predicted cluster using the (a) IBBMM and (c) PyClone with BeBin-PCN model to jointly analyse the four samples. The inferred cellular prevalence for each cluster using the (b) IBBMM and (d) BeBin-PCN methods. As in Fig. 1 the cellular prevalence of the cluster is the mean value of the cellular prevalence of mutations in the cluster. (e) Presence or absence of variant allele at target loci in single cells from sample B. Loci with less than 40 reads covering them are coloured gray. Predicted clusters for each method are show on the left, with white cells indicating non-somatic control positions. (f) Presence or absence of IBBMM clusters in single cells from sample B. Clusters were deemed present if any mutation in the cluster was present. (b, d) Error bars indicate the mean standard deviation of MCMC cellular prevalences estimates for mutations in a cluster. The number of mutations n in each cluster is shown in the legend in parentheses.

References

    1. Nowell PC. Science. 1976;194:23–8. - PubMed
    1. Aparicio S, Caldas C. The New England journal of medicine. 2013;368:842–851. - PubMed
    1. Greaves M, Maley CC. Nature. 2012;481:306–313. - PMC - PubMed
    1. Shah SP, et al. Nature. 2012;486:395–9. - PMC - PubMed
    1. Ding L, et al. Nature. 2012;481:506–10. - PMC - PubMed

Publication types