A stochastic Markov chain model to describe lung cancer growth and metastasis
- PMID: 22558094
- PMCID: PMC3338733
- DOI: 10.1371/journal.pone.0034637
A stochastic Markov chain model to describe lung cancer growth and metastasis
Abstract
A stochastic Markov chain model for metastatic progression is developed for primary lung cancer based on a network construction of metastatic sites with dynamics modeled as an ensemble of random walkers on the network. We calculate a transition matrix, with entries (transition probabilities) interpreted as random variables, and use it to construct a circular bi-directional network of primary and metastatic locations based on postmortem tissue analysis of 3827 autopsies on untreated patients documenting all primary tumor locations and metastatic sites from this population. The resulting 50 potential metastatic sites are connected by directed edges with distributed weightings, where the site connections and weightings are obtained by calculating the entries of an ensemble of transition matrices so that the steady-state distribution obtained from the long-time limit of the Markov chain dynamical system corresponds to the ensemble metastatic distribution obtained from the autopsy data set. We condition our search for a transition matrix on an initial distribution of metastatic tumors obtained from the data set. Through an iterative numerical search procedure, we adjust the entries of a sequence of approximations until a transition matrix with the correct steady-state is found (up to a numerical threshold). Since this constrained linear optimization problem is underdetermined, we characterize the statistical variance of the ensemble of transition matrices calculated using the means and variances of their singular value distributions as a diagnostic tool. We interpret the ensemble averaged transition probabilities as (approximately) normally distributed random variables. The model allows us to simulate and quantify disease progression pathways and timescales of progression from the lung position to other sites and we highlight several key findings based on the model.
Conflict of interest statement
Figures

Normal distribution with sample mean (0.15115) and variance (0.01821) is shown as overlay.
Normal distribution with sample mean (0.13165) and variance (0.01953) is shown as overlay.
each conditioned on the same initial matrix
The average convergence curve is shown, along with standard deviations marked along each decade showing the spread associated with the convergence rates.
through
showing linear decrease with exponent
The 27 non-zero singular values reflect the fact that there are 27 entries in the steady-state target distribution for primary lung cancer. The two diamond shaped data points are the two singular values associated with the initial matrix
The 27 ‘asterix’ data points are those obtained from a trained matrix using a perturbed
with Rank 2 perturbation. See text for details.References
-
- Ashworth T. A case of cancer in which cells similar to those in the tumors were seen in the blood after death. Australian Medical Journal. 1869;14:146.
-
- Fidler I. The pathogenesis of cancer metastasis: the ‘seed and soil’ hypothesis revisited. Nat Rev Cancer. 2003;3:453–458. - PubMed
-
- Paget S. The distribution of secondary growths in cancer of the breast. Lancet. 1889;1:571–573. - PubMed
-
- Weinberg R. Garland Science; 2006. The Biology of Cancer.
-
- Ewing J. W.B. Saunders, 6th Ed; 1929. Neoplastic Diseases: A Textbook on Tumors.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
