Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Apr;20(2):108-118.
doi: 10.1017/thg.2017.7. Epub 2017 Feb 27.

The Weighting is the Hardest Part: On the Behavior of the Likelihood Ratio Test and the Score Test Under a Data-Driven Weighting Scheme in Sequenced Samples

Affiliations

The Weighting is the Hardest Part: On the Behavior of the Likelihood Ratio Test and the Score Test Under a Data-Driven Weighting Scheme in Sequenced Samples

Camelia C Minică et al. Twin Res Hum Genet. 2017 Apr.

Abstract

Sequence-based association studies are at a critical inflexion point with the increasing availability of exome-sequencing data. A popular test of association is the sequence kernel association test (SKAT). Weights are embedded within SKAT to reflect the hypothesized contribution of the variants to the trait variance. Because the true weights are generally unknown, and so are subject to misspecification, we examined the efficiency of a data-driven weighting scheme. We propose the use of a set of theoretically defensible weighting schemes, of which, we assume, the one that gives the largest test statistic is likely to capture best the allele frequency-functional effect relationship. We show that the use of alternative weights obviates the need to impose arbitrary frequency thresholds. As both the score test and the likelihood ratio test (LRT) may be used in this context, and may differ in power, we characterize the behavior of both tests. The two tests have equal power, if the weights in the set included weights resembling the correct ones. However, if the weights are badly specified, the LRT shows superior power (due to its robustness to misspecification). With this data-driven weighting procedure the LRT detected significant signal in genes located in regions already confirmed as associated with schizophrenia - the PRRC2A (p = 1.020e-06) and the VARS2 (p = 2.383e-06) - in the Swedish schizophrenia case-control cohort of 11,040 individuals with exome-sequencing data. The score test is currently preferred for its computational efficiency and power. Indeed, assuming correct specification, in some circumstances, the score test is the most powerful test. However, LRT has the advantageous properties of being generally more robust and more powerful under weight misspecification. This is an important result given that, arguably, misspecified models are likely to be the rule rather than the exception in weighting-based approaches.

Keywords: MAF thresholding; SKAT; power; robustness; schizophrenia; variable weighting.

PubMed Disclaimer

Figures

Fig 1
Fig 1
The power of the likelihood ratio test (LRT) and the score test to detect a gene harboring 50 functional variants, jointly explaining 1% of the phenotypic variance (minor allele frequency 0.5%–5%). Data were simulated according to weights dbeta(.5,.5). Power was evaluated in 1000 datasets consisting of 10,000 individuals each.

References

    1. Genovese Giulio, Fromer Menachem, Stahl Eli A, Ruderfer Douglas M, Chambert Kimberly, Landén Mikael, Moran Jennifer L, Purcell Shaun M, Sklar Pamela, Sullivan Patrick F, et al. Nature Neuroscience. Nature Research; 2016. Increased burden of ultra-rare protein-altering variants among 4,877 individuals with schizophrenia. - PMC - PubMed
    1. Shlyakhter Ilya, Sabeti Pardis C, Schaffner Stephen F. Bioinformatics. Oxford Univ Press; 2014. Cosi2: an efficient simulator of exact and approximate coalescent with selection; pp. 3427–3429. - PMC - PubMed
    1. Tang Zheng-Zheng, Lin Dan-Yu. Meta-analysis for Discovering Rare-Variant Associations: Statistical Methods and Software Programs. The American Journal of Human Genetics. 2015:35–53. - PMC - PubMed
    1. Ripke Stephan, Neale Benjamin M, Corvin Aiden, Walters James TR, Farh Kai-How, Holmans Peter A, Lee Phil, Bulik-Sullivan Brendan, Collier David A, Huang Hailiang, et al. Nature. Europe PMC Funders; 2013. Biological insights from 108 schizophrenia-associated genetic loci; pp. 421–427. - PMC - PubMed
    1. Aberg Karolina A, Liu Youfang, Bukszár Jozsef, McClay Joseph L, Khachane Amit N, Andreassen Ole A, Blackwood Douglas, Corvin Aiden, Djurovic Srdjan, Gurling Hugh, et al. JAMA psychiatry. American Medical Association; 2013. A comprehensive family-based replication study of schizophrenia genes; pp. 573–581. - PMC - PubMed