Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Apr 1;35(7):1204-1212.
doi: 10.1093/bioinformatics/bty769.

Mixed graphical models for integrative causal analysis with application to chronic lung disease diagnosis and prognosis

Affiliations

Mixed graphical models for integrative causal analysis with application to chronic lung disease diagnosis and prognosis

Andrew J Sedgewick et al. Bioinformatics. .

Abstract

Motivation: Integration of data from different modalities is a necessary step for multi-scale data analysis in many fields, including biomedical research and systems biology. Directed graphical models offer an attractive tool for this problem because they can represent both the complex, multivariate probability distributions and the causal pathways influencing the system. Graphical models learned from biomedical data can be used for classification, biomarker selection and functional analysis, while revealing the underlying network structure and thus allowing for arbitrary likelihood queries over the data.

Results: In this paper, we present and test new methods for finding directed graphs over mixed data types (continuous and discrete variables). We used this new algorithm, CausalMGM, to identify variables directly linked to disease diagnosis and progression in various multi-modal datasets, including clinical datasets from chronic obstructive pulmonary disease (COPD). COPD is the third leading cause of death and a major cause of disability and thus determining the factors that cause longitudinal lung function decline is very important. Applied on a COPD dataset, mixed graphical models were able to confirm and extend previously described causal effects and provide new insights on the factors that potentially affect the longitudinal lung function decline of COPD patients.

Availability and implementation: The CausalMGM package is available on http://www.causalmgm.org.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Schematic view of CausalMGM and its applications
Fig. 2.
Fig. 2.
Precision-Recall curves of edge direction recovery on high-dimensional dataset. (A) Full range of algorithms and edge types. (B) Detail view of CPC-stable and MGM-CPC-stable performance averaged over all edge types. Parameter range: 0.2 ≤ λ ≤ 0.8; 0.01 ≤ α ≤ 0.1. Bars correspond to one standard error
Fig. 3.
Fig. 3.
Structural Hamming Distance on high dimensional dataset for CPC-stable and MGM-CPCS. The lower the SHD, the closer the predicted graph is to the true graph. Parameter range: 0.2 ≤ λ ≤ 0.8 and 0.01 ≤ α ≤ 0.1
Fig. 4.
Fig. 4.
Average running times with 95% confidence interval error bars of search algorithms on high dimensional data. Each row of columns corresponds to a different setting of α and each column corresponds to a different setting of λ. Directed search steps were run in parallel on a 4-core laptop
Fig. 5.
Fig. 5.
First and second neighbors of 2-year lung function decline, measured as FEV1 Progression. The variables that most influence the FEV1 progression are smoking status, creatinine and TNFα blood levels, pulmonary artery enlargement, history of GERD, systolic BP after exercise and four spirometry variables (% change in FEV1 before and after bronchodilators, best percent predicted FVC, best percent predicted FRC, and PIF)

Similar articles

Cited by

References

    1. Agusti A. et al. (2011) Addressing the complexity of chronic obstructive pulmonary disease: from phenotypes and biomarkers to scale-free networks, systems biology, and P4 medicine. Am. J. Respir. Crit. Care Med., 183, 1129–1137. - PubMed
    1. Anthonisen N.R. et al. (2002) Smoking and lung function of Lung Health Study participants after 11 years. Am. J. Respir. Crit. Care Med., 166, 675–679. - PubMed
    1. Anttila S. et al. (2001) CYP1A1 levels in lung tissue of tobacco smokers and polymorphisms of CYP1A1 and aromatic hydrocarbon receptor. Pharmacogenetics, 11, 501–509. - PubMed
    1. Baumgartner K.B. et al. (1997) Cigarette smoking: a risk factor for idiopathic pulmonary fibrosis. Am. J. Respir. Crit. Care Med., 155, 242–248. - PubMed
    1. Bøttcher S.G. (2001) Learning Bayesian networks with mixed variables. In: Eighth International Workshop on Artificial Intelligence and Statistics. Key West, Florida, 149–156.

Publication types