Bayesian approach to transforming public gene expression repositories into disease diagnosis databases
- PMID: 20360561
- PMCID: PMC2872390
- DOI: 10.1073/pnas.0912043107
Bayesian approach to transforming public gene expression repositories into disease diagnosis databases
Abstract
The rapid accumulation of gene expression data has offered unprecedented opportunities to study human diseases. The National Center for Biotechnology Information Gene Expression Omnibus is currently the largest database that systematically documents the genome-wide molecular basis of diseases. However, thus far, this resource has been far from fully utilized. This paper describes the first study to transform public gene expression repositories into an automated disease diagnosis database. Particularly, we have developed a systematic framework, including a two-stage Bayesian learning approach, to achieve the diagnosis of one or multiple diseases for a query expression profile along a hierarchical disease taxonomy. Our approach, including standardizing cross-platform gene expression data and heterogeneous disease annotations, allows analyzing both sources of information in a unified probabilistic system. A high level of overall diagnostic accuracy was shown by cross validation. It was also demonstrated that the power of our method can increase significantly with the continued growth of public gene expression repositories. Finally, we showed how our disease diagnosis system can be used to characterize complex phenotypes and to construct a disease-drug connectivity map.
Conflict of interest statement
The authors declare no conflict of interest.
Figures




Similar articles
-
Mandatory submission of microarray data to public repositories: how is it working?Physiol Genomics. 2005 Jan 20;20(2):153-6. doi: 10.1152/physiolgenomics.00264.2004. Physiol Genomics. 2005. PMID: 15661852 No abstract available.
-
Computational method for temporal pattern discovery in biomedical genomic databases.Proc IEEE Comput Syst Bioinform Conf. 2005:362-5. doi: 10.1109/csb.2005.25. Proc IEEE Comput Syst Bioinform Conf. 2005. PMID: 16447993
-
GEM-TREND: a web tool for gene expression data mining toward relevant network discovery.BMC Genomics. 2009 Sep 3;10:411. doi: 10.1186/1471-2164-10-411. BMC Genomics. 2009. PMID: 19728865 Free PMC article.
-
Bayesian methods in bioinformatics and computational systems biology.Brief Bioinform. 2007 Mar;8(2):109-16. doi: 10.1093/bib/bbm007. Epub 2007 Apr 12. Brief Bioinform. 2007. PMID: 17430978 Review.
-
Navigation and discovery in 3D CAD repositories.IEEE Comput Graph Appl. 2007 Jul-Aug;27(4):38-47. doi: 10.1109/mcg.2007.87. IEEE Comput Graph Appl. 2007. PMID: 17713233 Review. No abstract available.
Cited by
-
Integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets.Nucleic Acids Res. 2015 Sep 18;43(16):7779-89. doi: 10.1093/nar/gkv810. Epub 2015 Aug 10. Nucleic Acids Res. 2015. PMID: 26261215 Free PMC article.
-
Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies.Bioinformatics. 2013 Dec 1;29(23):3036-44. doi: 10.1093/bioinformatics/btt529. Epub 2013 Sep 12. Bioinformatics. 2013. PMID: 24037214 Free PMC article.
-
Omics Profiling in Precision Oncology.Mol Cell Proteomics. 2016 Aug;15(8):2525-36. doi: 10.1074/mcp.O116.059253. Epub 2016 Apr 20. Mol Cell Proteomics. 2016. PMID: 27099341 Free PMC article. Review.
-
A Computational Framework for Genome-wide Characterization of the Human Disease Landscape.Cell Syst. 2019 Feb 27;8(2):152-162.e6. doi: 10.1016/j.cels.2018.12.010. Epub 2019 Jan 23. Cell Syst. 2019. PMID: 30685436 Free PMC article.
-
Enhancing systems medicine beyond genotype data by dynamic patient signatures: having information and using it too.Front Genet. 2013 Nov 19;4:241. doi: 10.3389/fgene.2013.00241. eCollection 2013. Front Genet. 2013. PMID: 24312119 Free PMC article.
References
-
- Horton PB, Kiseleva L, Fujibuchi W. RaPiDS: an algorithm for rapid expression profile database search. Genome Inform Ser. 2006;17(2):67–76. - PubMed
-
- Hibbs MA, et al. Exploring the functional landscape of gene expression: directed search of large microarray compendia. Bioinformatics. 2007;23(20):2692–2699. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources