Mixed graphical models for integrative causal analysis with application to chronic lung disease diagnosis and prognosis
- PMID: 30192904
- PMCID: PMC6449754
- DOI: 10.1093/bioinformatics/bty769
Mixed graphical models for integrative causal analysis with application to chronic lung disease diagnosis and prognosis
Abstract
Motivation: Integration of data from different modalities is a necessary step for multi-scale data analysis in many fields, including biomedical research and systems biology. Directed graphical models offer an attractive tool for this problem because they can represent both the complex, multivariate probability distributions and the causal pathways influencing the system. Graphical models learned from biomedical data can be used for classification, biomarker selection and functional analysis, while revealing the underlying network structure and thus allowing for arbitrary likelihood queries over the data.
Results: In this paper, we present and test new methods for finding directed graphs over mixed data types (continuous and discrete variables). We used this new algorithm, CausalMGM, to identify variables directly linked to disease diagnosis and progression in various multi-modal datasets, including clinical datasets from chronic obstructive pulmonary disease (COPD). COPD is the third leading cause of death and a major cause of disability and thus determining the factors that cause longitudinal lung function decline is very important. Applied on a COPD dataset, mixed graphical models were able to confirm and extend previously described causal effects and provide new insights on the factors that potentially affect the longitudinal lung function decline of COPD patients.
Availability and implementation: The CausalMGM package is available on http://www.causalmgm.org.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Figures





Similar articles
-
CausalMGM: an interactive web-based causal discovery tool.Nucleic Acids Res. 2020 Jul 2;48(W1):W597-W602. doi: 10.1093/nar/gkaa350. Nucleic Acids Res. 2020. PMID: 32392295 Free PMC article.
-
Learning mixed graphical models with separate sparsity parameters and stability-based model selection.BMC Bioinformatics. 2016 Jun 6;17 Suppl 5(Suppl 5):175. doi: 10.1186/s12859-016-1039-0. BMC Bioinformatics. 2016. PMID: 27294886 Free PMC article.
-
NCC-AUC: an AUC optimization method to identify multi-biomarker panel for cancer prognosis from genomic and clinical data.Bioinformatics. 2015 Oct 15;31(20):3330-8. doi: 10.1093/bioinformatics/btv374. Epub 2015 Jun 18. Bioinformatics. 2015. PMID: 26092859
-
A Systematic Review of Case-Identification Algorithms Based on Italian Healthcare Administrative Databases for Two Relevant Diseases of the Respiratory System: Asthma and Chronic Obstructive Pulmonary Disease.Epidemiol Prev. 2019 Jul-Aug;43(4 Suppl 2):75-87. doi: 10.19191/EP19.4.S2.P075.094. Epidemiol Prev. 2019. PMID: 31650808
-
Systems biology coupled with label-free high-throughput detection as a novel approach for diagnosis of chronic obstructive pulmonary disease.Respir Res. 2009 Apr 22;10(1):29. doi: 10.1186/1465-9921-10-29. Respir Res. 2009. PMID: 19386108 Free PMC article. Review.
Cited by
-
Inferring Interaction Networks From Multi-Omics Data.Front Genet. 2019 Jun 12;10:535. doi: 10.3389/fgene.2019.00535. eCollection 2019. Front Genet. 2019. PMID: 31249591 Free PMC article. Review.
-
CausalMGM: an interactive web-based causal discovery tool.Nucleic Acids Res. 2020 Jul 2;48(W1):W597-W602. doi: 10.1093/nar/gkaa350. Nucleic Acids Res. 2020. PMID: 32392295 Free PMC article.
-
New Analysis Framework Incorporating Mixed Mutual Information and Scalable Bayesian Networks for Multimodal High Dimensional Genomic and Epigenomic Cancer Data.Front Genet. 2020 Jun 18;11:648. doi: 10.3389/fgene.2020.00648. eCollection 2020. Front Genet. 2020. PMID: 32625238 Free PMC article.
-
Essential Regression: A generalizable framework for inferring causal latent factors from multi-omic datasets.Patterns (N Y). 2022 Mar 24;3(5):100473. doi: 10.1016/j.patter.2022.100473. eCollection 2022 May 13. Patterns (N Y). 2022. PMID: 35607614 Free PMC article.
-
Pancreatic quantitative sensory testing to predict treatment response of endoscopic therapy or surgery for painful chronic pancreatitis with pancreatic duct obstruction: study protocol for an observational clinical trial.BMJ Open. 2024 Mar 21;14(3):e081505. doi: 10.1136/bmjopen-2023-081505. BMJ Open. 2024. PMID: 38514147 Free PMC article.
References
-
- Agusti A. et al. (2011) Addressing the complexity of chronic obstructive pulmonary disease: from phenotypes and biomarkers to scale-free networks, systems biology, and P4 medicine. Am. J. Respir. Crit. Care Med., 183, 1129–1137. - PubMed
-
- Anthonisen N.R. et al. (2002) Smoking and lung function of Lung Health Study participants after 11 years. Am. J. Respir. Crit. Care Med., 166, 675–679. - PubMed
-
- Anttila S. et al. (2001) CYP1A1 levels in lung tissue of tobacco smokers and polymorphisms of CYP1A1 and aromatic hydrocarbon receptor. Pharmacogenetics, 11, 501–509. - PubMed
-
- Baumgartner K.B. et al. (1997) Cigarette smoking: a risk factor for idiopathic pulmonary fibrosis. Am. J. Respir. Crit. Care Med., 155, 242–248. - PubMed
-
- Bøttcher S.G. (2001) Learning Bayesian networks with mixed variables. In: Eighth International Workshop on Artificial Intelligence and Statistics. Key West, Florida, 149–156.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical