Phenotypic extremes in rare variant study designs
- PMID: 26350511
- PMCID: PMC4867440
- DOI: 10.1038/ejhg.2015.197
Phenotypic extremes in rare variant study designs
Abstract
Currently, next-generation sequencing studies aim to identify rare and low-frequency variation that may contribute to disease. For a given effect size, as the allele frequency decreases, the power to detect genes or variants of interest also decreases. Although many methods have been proposed for the analysis of such data, study design and analytic issues still persist in data interpretation. In this study we present sequencing data for ABCA1 that has known rare variants associated with high-density lipoprotein cholesterol (HDL-C). We contrast empirical findings from two study designs: a phenotypic extreme sample and a population-based random sample. We found differing strengths of association with HDL-C across the two study designs (P=0.0006 with n=701 phenotypic extremes vs P=0.03 with n=1600 randomly sampled individuals). To explore this apparent difference in evidence for association, we performed a simulation study focused on the impact of phenotypic selection on power. We demonstrate that the power gain for an extreme phenotypic selection study design is much greater in rare variant studies than for studies of common variants. Our study confirms that studying phenotypic extremes is critical in rare variant studies because it boosts power in two ways: the typical increases from extreme sampling and increasing the proportion of relevant functional variants ascertained and thereby tested for association. Furthermore, we show that when combining statistical evidence through meta-analysis from an extreme-selected sample and a second separate population-based random sample, power is lower when a traditional sample size weighting is used compared with weighting by the noncentrality parameter.
Figures



Similar articles
-
Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants.Genet Epidemiol. 2011 May;35(4):236-46. doi: 10.1002/gepi.20572. Genet Epidemiol. 2011. PMID: 21308769
-
Detecting the Common and Individual Effects of Rare Variants on Quantitative Traits by Using Extreme Phenotype Sampling.Genes (Basel). 2016 Jan 14;7(1):2. doi: 10.3390/genes7010002. Genes (Basel). 2016. PMID: 26784232 Free PMC article.
-
Real world scenarios in rare variant association analysis: the impact of imbalance and sample size on the power in silico.BMC Bioinformatics. 2019 Jan 22;20(1):46. doi: 10.1186/s12859-018-2591-6. BMC Bioinformatics. 2019. PMID: 30669967 Free PMC article.
-
Strategies for identifying the genetic basis of dyslipidemia: genome-wide association studies vs. the resequencing of extremes.Curr Opin Lipidol. 2010 Apr;21(2):123-7. doi: 10.1097/MOL.0b013e328336eae9. Curr Opin Lipidol. 2010. PMID: 20125008 Review.
-
Multistage designs in the genomic era: providing balance in complex disease studies.Genet Epidemiol. 2007;31 Suppl 1(Suppl 1):S118-23. doi: 10.1002/gepi.20288. Genet Epidemiol. 2007. PMID: 18046769 Free PMC article. Review.
Cited by
-
Extreme phenotypes approach to investigate host genetics and COVID-19 outcomes.Genet Mol Biol. 2021 Mar 1;44(1 Suppl 1):e20200302. doi: 10.1590/1678-4685-GMB-2020-0302. eCollection 2021. Genet Mol Biol. 2021. PMID: 33651876 Free PMC article.
-
Emerging roles of rare and low-frequency genetic variants in type 1 diabetes mellitus.J Med Genet. 2021 May;58(5):289-296. doi: 10.1136/jmedgenet-2020-107350. Epub 2021 Mar 22. J Med Genet. 2021. PMID: 33753534 Free PMC article. Review.
-
Association of variants in HTRA1 and NOTCH3 with MRI-defined extremes of cerebral small vessel disease in older subjects.Brain. 2019 Apr 1;142(4):1009-1023. doi: 10.1093/brain/awz024. Brain. 2019. PMID: 30859180 Free PMC article.
-
EPS-LASSO: test for high-dimensional regression under extreme phenotype sampling of continuous traits.Bioinformatics. 2018 Jun 15;34(12):1996-2003. doi: 10.1093/bioinformatics/bty042. Bioinformatics. 2018. PMID: 29385408 Free PMC article.
-
Targeted exonic sequencing of GWAS loci in the high extremes of the plasma lipids distribution.Atherosclerosis. 2016 Jul;250:63-8. doi: 10.1016/j.atherosclerosis.2016.04.011. Epub 2016 Apr 23. Atherosclerosis. 2016. PMID: 27182959 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources