Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jan;13(2):213-22.
doi: 10.2217/pgs.11.145.

Integrating heterogeneous high-throughput data for meta-dimensional pharmacogenomics and disease-related studies

Affiliations

Integrating heterogeneous high-throughput data for meta-dimensional pharmacogenomics and disease-related studies

Emily R Holzinger et al. Pharmacogenomics. 2012 Jan.

Abstract

The current paradigm of human genetics research is to analyze variation of a single data type (i.e., DNA sequence or RNA levels) to detect genes and pathways that underlie complex traits such as disease state or drug response. While these studies have detected thousands of variations that associate with hundreds of complex phenotypes, much of the estimated heritability, or trait variability due to genetic factors, remain unexplained. We may be able to account for a portion of the missing heritability if we incorporate a systems biology approach into these analyses. Rapid technological advances will make it possible for scientists to explore this hypothesis via the generation of high-throughput omics data - transcriptomic, proteomic and methylomic to name a few. Analyzing this 'meta-dimensional' data will require clever statistical techniques that allow for the integration of qualitative and quantitative predictor variables. For this article, we examine two major categories of approaches for integrated data analysis, give examples of their use in experimental and in silico datasets, and assess the limitations of each method.

PubMed Disclaimer

Conflict of interest statement

Financial & competing interests disclosure

The authors have no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript apart from those disclosed.

No writing assistance was utilized in the production of this manuscript.

Figures

Figure 1
Figure 1. Variations of the triangle method
eQTL: Expression quantitative trait loci
Figure 2
Figure 2. Decision tree example
For the SNP variables, the genotypes are represented as: 0: no minor alleles; 1: one minor allele; and 2: two minor alleles. The up and down dashed arrows indicate increased and decreased gene expression, respectively. EXP: Gene expression
Figure 3
Figure 3. Bayesian network example with direct and indirect effects
EXP: Gene expression; PHENO: Phenotype.
Figure 4
Figure 4. Single-layer artificial neural network

Similar articles

Cited by

References

    1. Maher B. Personal genomes: the case of the missing heritability. Nature. 2008;456(7218):18–21. - PubMed
    1. Grant GR, Manduchi E, Stoeckert CJ., Jr Analysis and management of microarray gene expression data. Curr. Protoc. Mol. Biol. 2007;Chapter 19(Unit 19.6) - PubMed
    1. Ozsolak F, Platt AR, Jones DR, et al. Direct RNA sequencing. Nature. 2009;461(7265):814–818. - PubMed
    1. Nilsson T, Mann M, Aebersold R, et al. Mass spectrometry in high-throughput proteomics: ready for the big time. Nat. Methods. 2010;7(9):681–685. - PubMed
    1. Carless M. Investigation of genomic methylation status using methylation-specific and bisulfite sequencing polymerase chain reaction. Methods Mol. Biol. 2009;523:217–234. - PubMed

Websites

    1. Hindorff LA, Junkins HA, Hall PN, Mehta JP, Manolio TA. A catalog of published genome-wide association studies. www.genome.gov/gwastudies.
    1. Coriell Institute for Medical Research. Camden, NJ, USA: http://ccr.coriell.org.

Publication types