Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Sep 15;24(18):2015-22.
doi: 10.1093/bioinformatics/btn373. Epub 2008 Jul 17.

Considering dependence among genes and markers for false discovery control in eQTL mapping

Affiliations

Considering dependence among genes and markers for false discovery control in eQTL mapping

Liang Chen et al. Bioinformatics. .

Abstract

Motivation: Multiple comparison adjustment is a significant and challenging statistical issue in large-scale biological studies. In previous studies, dependence among genes is largely ignored. However, such dependence may be strong for some genomic-scale studies such as genetical genomics [also called expression quantitative trait loci (eQTL) mapping] in which thousands of genes are treated as quantitative traits and mapped to different genetical markers. Besides the dependence among markers, the dependence among the expression levels of genes can also have a significant impact on data analysis and interpretation.

Results: In this article, we propose to consider both the mean as well as the variance of false discovery number for multiple comparison adjustment to handle dependence among hypotheses. This is achieved by developing a variance estimator for false discovery number, and using the upper bound of false discovery proportion (uFDP) for false discovery control. More importantly, we introduce a weighted version of uFDP (wuFDP) control to improve the statistical power of eQTL identification. In addition, the wuFDP approach can better control false positives than false discovery rate (FDR) and uFDP approaches when markers are in linkage disequilibrium. The relative performance of uFDP control and wuFDP control is illustrated through simulation studies and real data analysis.

Supplementary information: Supplementary figures, tables and appendices are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Boxplots of FDP or weighted FDP among 1000 simulations for FDR, uFDP and wuFDP controls. The differential signal β=0.5. The threshold for FDR, uFDP and wuFDP is 0.1. z1−α is 1.65. The number of correlated non-differentially expressed genes (nc) varies from 0 to 200. The correlation among these genes (c) varies from 0.6 to 0.9.
Fig. 2.
Fig. 2.
Statistical power for different differential signal (0.3–0.9). The number of correlated non-differentially expressed genes varies from 50 to 200. The correlation among these genes varies from 0.6 to 0.9. The threshold for uFDP and wuFDP is 0.1. z1−α is 1.65. Triangle symbol line is for wuFDP control and cross symbol line is for uFDP control.
Fig. 3.
Fig. 3.
ROC curves for FDR, uFDP and wuFDP controls. The differential signal β=0.4. Two hundred non-differentially expressed gene are correlated with correlation 0.7. z1−α is 1.65 to control Pr(V/RuFDP) and Pr(∑wivi/∑wiriwuFDP) less than or equal to 0.05. Triangle symbol line is for wuFDP control, cross symbol line is for uFDP control and circle symbol line is for FDR.

References

    1. Benjamini Y, Hochberg Y. Controlling the false discovery rate - a practical and powerful appraoch to multiple testing. J. R. Stat. Soc. Ser. B Stat. Methodol. 1995;57:289–300.
    1. Brem R, et al. Genetic dissection of transcriptional regulation in budding yeast. Science. 2002;296:752–755. - PubMed
    1. Broman K, et al. R/qtl: Qtl mapping in experimental crosses. Bioinformatics. 2003;19:889–890. - PubMed
    1. Bystrykh L, et al. Uncovering regulatory pathways that affect hematopoietic stem cell function using “genetical genomics”. Nat. Genet. 2005;37:225–232. - PubMed
    1. Chen L, Storey J. Relaxed significance criteria for linkage analysis. Genetics. 2006;173:2371–2381. - PMC - PubMed

Publication types

Substances