Type I Error Rates and Parameter Bias in Multivariate Behavioral Genetic Models
- PMID: 30569348
- PMCID: PMC6345547
- DOI: 10.1007/s10519-018-9942-y
Type I Error Rates and Parameter Bias in Multivariate Behavioral Genetic Models
Abstract
For many multivariate twin models, the numerical Type I error rates are lower than theoretically expected rates using a likelihood ratio test (LRT), which implies that the significance threshold for statistical hypothesis tests is more conservative than most twin researchers realize. This makes the numerical Type II error rates higher than theoretically expected. Furthermore, the discrepancy between the observed and expected error rates increases as more variables are included in the analysis and can have profound implications for hypothesis testing and statistical inference. In two simulation studies, we examine the Type I error rates for the Cholesky decomposition and Correlated Factors models. Both show markedly lower than nominal Type I error rates under the null hypothesis, a discrepancy that increases with the number of variables in the model. In addition, we observe slightly biased parameter estimates for the Cholesky decomposition and Correlated Factors models. By contrast, if the variance-covariance matrices for variance components are estimated directly (without constraints), the numerical Type I error rates are consistent with theoretical expectations and there is no bias in the parameter estimates regardless of the number of variables analyzed. We call this the direct symmetric approach. It appears that each model-implied boundary, whether explicit or implicit, increases the discrepancy between the numerical and theoretical Type I error rates by truncating the sampling distributions of the variance components and inducing bias in the parameters. The direct symmetric approach has several advantages over other multivariate twin models as it corrects the Type I error rate and parameter bias issues, is easy to implement in current software, and has fewer optimization problems. Implications for past and future research, and potential limitations associated with direct estimation of genetic and environmental covariance matrices are discussed.
Keywords: Cholesky decomposition; Correlated factors model; Direct symmetrical matrix; Twin models; Type I error.
Conflict of interest statement
Figures





References
-
- Boker SM, Neale MC, Maes HH, Wilde MJ, Spiegel M, Brick TR, Estabrook R, Bates TC, Mehta P,von Oertzen T, Gore RJ, Hunter MD, Hackett DC, Karch J, Brandmaier A, Pritikin JM, Zahery M, Kirkpatrick RM, Wang Y, Driver C, Johnson SG, Kraft D, Wilhelm S, & Manjunath BG (2017) OpenMx 2.7.17–23 User Guide
-
- Bulik-Sullivan BK (2015) Relationship between LD Score and Haseman-Elston, bioRxiv doi 10.1101/018283. - DOI
-
- Coventry WL, & Keller MC (2005) Estimating the extent of parameter bias in the classical twin design: a comparison of parameter estimates from extended twin-family and classical twin designs. Twin Research and Human Genetics 8(3):214–23. - PubMed
-
- Dominicus A, Skrondal A, Gjessing HK, Pedersen NL, Palmgren J (2006) Likelihood ratio tests in behavioral genetics: problems and solutions. Behavior Genetics 36(2):331–340. - PubMed
-
- Falconer DS (1960) Introduction to Quantitative Genetics Oliver and Boyd, London.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Miscellaneous