Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jul;33(5):406-18.
doi: 10.1002/gepi.20394.

Unbiased estimation of odds ratios: combining genomewide association scans with replication studies

Affiliations
Free PMC article

Unbiased estimation of odds ratios: combining genomewide association scans with replication studies

Jack Bowden et al. Genet Epidemiol. 2009 Jul.
Free PMC article

Abstract

Odds ratios or other effect sizes estimated from genome scans are upwardly biased, because only the top-ranking associations are reported, and moreover only if they reach a defined level of significance. No unbiased estimate exists based on data selected in this fashion, but replication studies are routinely performed that allow unbiased estimation of the effect sizes. Estimation based on replication data alone is inefficient in the sense that the initial scan could, in principle, contribute information on the effect size. We propose an unbiased estimator combining information from both the initial scan and the replication study, which is more efficient than that based just on the replication. Specifically, we adjust the standard combined estimate to allow for selection by rank and significance in the initial scan. Our approach explicitly allows for multiple associations arising from a scan, and is robust to mis-specification of a significance threshold. We require replication data to be available but argue that, in most applications, estimates of effect sizes are only useful when associations have been replicated. We illustrate our approach on some recently completed scans and explore its efficiency by simulation.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Top: Difference between the MLE and the corrected MLE of Zhong and Prenctice [2008]. Bottom: Difference between the MLE and the UMVCUE. MLE, maximum likelihood estimator.
Fig. 2
Fig. 2
Difference between the stage 2 estimates and the UMVCUE.
Fig. 3
Fig. 3
Top: Bias and MSE for T1D data. Bottom: bias and MSE for CD data. Numbers calculated using method A. TID, type-1 diabetes; CD, Crohn's disease.
Fig. 4
Fig. 4
Top: Bias and MSE for T1D data. Bottom: bias and MSE for CD data. Numbers calculated using method B. TID, type-1 diabetes; CD, Crohn's disease.
Fig. 5
Fig. 5
Top: Bias and MSE for T1D data. Bottom: bias and MSE for CD data. Numbers calculated using simulation method A, but with an assumed stage 1 of size 20,000 and stage 2 size 2,000. TID, type-1 diabetes; CD, Crohn's disease.
Fig. 6
Fig. 6
Point estimates and bootstrap confidence intervals for the MLE and UMVCUE for rs17696736, varying the stage 2 standard error relative to stage 1. The point estimates and bootstrap confidence intervals are calculated using method A.

References

    1. Bowden J, Glimm E. Unbiased estimation of selected treatment means in two-stage trials. Biometrical J. 2008;50:515–527. - PubMed
    1. Chanock SJ, Manolio T, Boehnke M, Boerwinkle E, Hunter DJ, Thomas G, Hirschhorn JN, Abecasis G, Altshuler D, Bailey-Wilson JE, Brooks LD, Cardon LR, Daly M, Donnelly P, Fraumeni JF, Jr, Freimer NB, Gerhard DS, Gunter C, Guttmacher AE, Guyer MS, Harris EL, Hoh J, Hoover R, Kong CA, Merikangas Kr, Morton CC, Palmer LF, Phimister EG, Rice JP, Robert J, Rotimi C, Tucker Ma, Vogan KJ, Wacholder S, Wijsman EM, Winn DM, Collins FS, NCI-NHGRI Working Group on Replication in Association Studies Replicating genotype–phenotype associations. Nature. 2007;447:655–660. - PubMed
    1. Cohen A, Sackrowitz H. Two stage conditionally unbiased estimators of the selected mean. Stat Probability Lett. 1989;8:273–278.
    1. Copas J. What works? Selectivity models in meta-analysis. J R Stat Soc Ser A. 1999;162:95–109.
    1. Easton DF, Pooley KA, Dunning AM, Pharoah PDP, Thompson D, Ballinger DG, Struewing JP, Morrison J, Field H, Luben R, Wareham N, Ahmed S, Healey CS, Bowman R SEARCH collaborators; Meyer KB, Haiman CA, Kolonel LK, Henderson BE, Le Marchand L, Brennan P, Sangrajrang S, Gaborieau V, Odefrey F, Shen CY, Wu PE, Wang HC, Eccles D, Evans DG, Peto J, Fletcher O, Johnson N, Seal S, Stratton MR, Rahman N, Chenevix-Trench G, Bojesen SE, Nordestgaard BG, Axelsson CK, Garcia-Closas M, Brinton L, Chanock S, Lissowska J, Peplonska B, Nevanlinna H, Pagerholm R, Eerola H, Kang D, Yoo KY, Noh DY, Ahn SH, Hunter DJ, Hankinson SE, Cox DG, Hall P, Wedren S, Liu J, Low YL, Bogdanova N, Schurmann P, Dork T, Tollenaar RA, Jacobi CE, Devilee P, Klijn JG, Sigurdson AJ, Doody MM, Alexander BH, Zhang J, Cox A, Brock IW, MacPherson G, Reed MW, Couch FJ, Goode EL, Olson JE, Meijers-Heijboer H, van den Puweland A, Uitterlinden A, Rivadeneira F, Milne RL, Ribas G, Gonzalez-Neira A, Benitez J, Hopper JL, McCredie MM, Southey M, Giles GG, Schroen C, Justenhoven C, Brauch H, Hamann U, Ko YD, Spurdle AB, Beesley J, Chen X kConFab; AOCS Management Group. Mannermaa A, Kosma VM, Kataja V, Hartikainen J, Day NE, Cox DR, Ponder BA. Genome-wide association study identifies novel breast cancer susceptibility loci. Nature. 2007;447:1087–1093. - PMC - PubMed

Publication types

LinkOut - more resources