Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005;6(10):R88.
doi: 10.1186/gb-2005-6-10-r88. Epub 2005 Sep 19.

Searching for differentially expressed gene combinations

Affiliations

Searching for differentially expressed gene combinations

Marcel Dettling et al. Genome Biol. 2005.

Abstract

We propose 'CorScor', a novel approach for identifying gene pairs with joint differential expression. This is defined as a situation with good phenotype discrimination in the bivariate, but not in the two marginal distributions. CorScor can be used to detect phenotype-related dependencies and interactions among genes. Our easily interpretable approach is scalable to current microarray dimensions and yields promising results on several cancer-gene-expression datasets.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Two artificial examples of joint differential gene expression. The units of the x-axis and y-axis are gene expression; blue triangles and red circles represent samples of two different phenotypes. The inner panels reflect the joint distribution; the outer margins display the univariate marginal distributions. The dashed lines represent the first principal components, conditional on the phenotype.
Figure 2
Figure 2
Six examples of joint differential gene expression of the gap/substitution type, obtained from the colon and BRCA1 datasets. The inner panels show the joint distribution; the outer margins display the univariate distributions. Blue triangles stand for cancers in colon and BRCA1 mutants in breast; the red circles stand for normal samples in colon and sporadic cancers in breast. The dashed lines represent the conditional first principal components.
Figure 3
Figure 3
Symmetric heat map of CorScor values from Equation (1), for the colon and BRCA1 data. Columns and rows are rearranged according to a hierarchical clustering. Displayed are the 50 genes that are involved in the pairs with the highest scores. Black stands for low, grey for intermediate, and white for high score.
Figure 4
Figure 4
Six examples of joint differential gene expression according to the on/off-scenario, obtained from the colon and BRCA1 data. The inner panels show the joint distribution; the outer margins display the univariate distributions. Blue triangles stand for cancerous and BRCA1 mutants, the red circles for normal and BRCA1 wild types, respectively. The dashed lines represent the direction of the conditional first principal components.
Figure 5
Figure 5
Histograms displaying the right tail of the permutation distributions of CorScor in the colon and BRCA1 data. The dashed vertical lines indicate the score values of the top three gene pairs from Figures 2 and 4. Also reported is the fraction of null scores (tail.p) that exceed each of observed values.
Figure 6
Figure 6
Comparison of CorScor and predictive modeling scores. Density plots for a comparison of the gap/substitution scoring function from correlation scoring defined in Equation (1) and predictive modeling (Equation (4)), as well as the on/off objective measures defined in Equations (2) and (5). Each panel is divided into a 50-×-50-cell grid. The darker the color of a cell, the more instances are therein. In the figure header, cor is the Pearson correlation coefficient between the CorScor and the respective predictive modeling scores.
Figure 7
Figure 7
Power analysis for detecting joint differential expression. Receiver operating characteristic (ROC) curves that display the fraction of false positives, or discriminatory ability, in our simulation study to detect joint differential expression. The left panel summarizes information about the gap/substitution scenario; the right panel is about the on/off scenario. The solid lines correspond to CorScor, and the dashed ones, to logistic regression. Finally, the strength of joint differential expression was set at five different levels in our simulation experiment. The yellow lines are for the weakest amount of joint differential expression and the black lines, for the strongest amount.

References

    1. Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA. 2001;98:5116–5121. doi: 10.1073/pnas.091062498. - DOI - PMC - PubMed
    1. Efron B, Tibshirani R, Storey J, Tusher V. Empirical Bayes analysis of a microarray experiment. J Am Stat Assoc. 2001;96:1151–1160. doi: 10.1198/016214501753382129. - DOI
    1. Pan W. A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments. Bioinformatics. 2002;18:546–554. doi: 10.1093/bioinformatics/18.4.546. - DOI - PubMed
    1. Dudoit S, Fridlyand J. Classification in microarray experiments. In: Speed T, editor. Statistical Analysis of Gene Expression Data. New York: Chapman and Hall; 2003. pp. 93–158.
    1. Dettling M, Bühlmann P. Finding predictive gene groups from microarray data. J Multivariate Anal. 2004;90:106–131. doi: 10.1016/j.jmva.2004.02.012. - DOI

Publication types

LinkOut - more resources