Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Mar;71(1):90-101.
doi: 10.1111/biom.12252. Epub 2014 Oct 15.

A joint model for multistate disease processes and random informative observation times, with applications to electronic medical records data

Affiliations

A joint model for multistate disease processes and random informative observation times, with applications to electronic medical records data

Jane M Lange et al. Biometrics. 2015 Mar.

Abstract

Multistate models are used to characterize individuals' natural histories through diseases with discrete states. Observational data resources based on electronic medical records pose new opportunities for studying such diseases. However, these data consist of observations of the process at discrete sampling times, which may either be pre-scheduled and non-informative, or symptom-driven and informative about an individual's underlying disease status. We have developed a novel joint observation and disease transition model for this setting. The disease process is modeled according to a latent continuous-time Markov chain; and the observation process, according to a Markov-modulated Poisson process with observation rates that depend on the individual's underlying disease status. The disease process is observed at a combination of informative and non-informative sampling times, with possible misclassification error. We demonstrate that the model is computationally tractable and devise an expectation-maximization algorithm for parameter estimation. Using simulated data, we show how estimates from our joint observation and disease transition model lead to less biased and more precise estimates of the disease rate parameters. We apply the model to a study of secondary breast cancer events, utilizing mammography and biopsy records from a sample of women with a history of primary breast cancer.

Keywords: Disease process; Electronic medical records; Informative observations; Markov-modulated Poisson process; Multistate model; Panel data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A. Example of a joint informative observation and disease process, Y (t) = (X (t), N(t)). B. The informative observation time process and the disease process observed at DDO and scheduled times. C. Same as B, with misclassification error.
Figure 2
Figure 2
Simulation results demonstrating bias that occurs when informative visit times are ignored. Data were simulated from discretely observed 2-state standard and latent CTMC multistate-DDO models on the interval t=[0,8] at DDO times or a combination of DDO and scheduled visits (See Web Appendix Figure D-1 and Table D-1 for simulation details). Data were fit with correctly specified multistate-DDO models and incorrectly specified panel models. Box plots/functional box plots are shown for hazard estimates of H →D and D →H transitions from both DDO and panel models. The different DDO rates in the model states varied across simulations, with more discrepant rates inducing more bias under model misspecification. A. DDO rates are qD = 2, qH = .25; data also included fixed observation times t = (0, 2, 4, 6, 8). B. DDO rates are qD = 2, qH = .25. C. DDO rates are qD = .35, qH = .25. D. DDO rates are qH = .25 and qD = 2.
Figure 3
Figure 3
SBCE competing risks disease models. A. Standard CTMC, where H=healthy, C=contralateral SBCE, I=ipsilateral SBCE, and D=death before SBCE. B. Latent CTMC with Coxian structure. States H1 and H2 map to the healthy state.
Figure 4
Figure 4
Estimated cumulative incidence for ipsilateral and contralateral SBCEs and death, via empirical estimates of the diagnosis times or using the BIC-selected multistate-DDO model (Web Appendix Table E-2, model 6). The bands are point-wise standard errors. Abbreviations: Dx empirical=empirical estimate of cumulative incidence of diagnosed SBCE events; SE=standard error.

Similar articles

Cited by

References

    1. Aalen OO. Phase type distributions in survival analysis. Scandinavian Journal of Statistics. 1995;22:447–463.
    1. Andersen PK, Keiding N. Multi-state models for event history analysis. Statistical Methods in Medical Research. 2002;11:91–115. - PubMed
    1. Andreetta C, Smith I. Adjuvant endocrine therapy for early breast cancer. Cancer letters. 2007;251:17–27. - PubMed
    1. Baum L, Petrie T, Soules G, Weiss N. A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Annals of Mathematical Statistics. 1970;41:164–171.
    1. Boer R, Plevritis S, Clarke L. Diversity of model approaches for breast cancer screening: a review of model assumptions by the Cancer Intervention and Surveillance Network (CISNET) Breast Cancer Groups. Statistical Methods in Medical Research. 2004;13:525–538. - PubMed

Publication types

MeSH terms