A likelihood-based approach to mixed modeling with ambiguity in cluster identifiers
- PMID: 18343883
- PMCID: PMC2536727
- DOI: 10.1093/biostatistics/kxm055
A likelihood-based approach to mixed modeling with ambiguity in cluster identifiers
Abstract
This manuscript describes a novel, linear mixed-effects model-fitting technique for the setting in which correlated data indicators are not completely observed. Mixed modeling is a useful analytical tool for characterizing genotype-phenotype associations among multiple potentially informative genetic loci. This approach involves grouping individuals into genetic clusters, where individuals in the same cluster have similar or identical multilocus genotypes. In haplotype-based investigations of unrelated individuals, corresponding cluster assignments are unobservable since the alignment of alleles within chromosomal copies is not generally observed. We derive an expectation conditional maximization approach to estimation in the mixed modeling setting, where cluster assignments are ambiguous. The approach has broad relevance to the analysis of data with missing correlated data identifiers. An example is provided based on data arising from a cohort of human immunodeficiency virus type-1-infected individuals at risk for antiretroviral therapy-associated dyslipidemia.
Keywords: Expectation conditional maximization; Genotype; HIV-1; Haplotype; Lipids; Missing identifiers; Mixed-effects models; Phenotype; Population-based genetic association studies.
Figures
References
-
- Chiu WF, Yucel RM, Zanutto E, Zaslavsky AM. Using matched substitutes to improve imputations for geographically linked databases. Survey Methodology. 2005;31:69–72.
-
- Demidenko E. Mixed Models: Theory and Applications. Hoboken, NJ: John Wiley & Sons; 2004.
-
- Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm (C/R: p22-37) Journal of the Royal Statistical Society, Series B, Methodological. 1977;39:1–22.
-
- Diggle P, Liang K-Y, Zeger SL. Analysis of Longitudinal Data. New York: Oxford University Press; 1994.
-
- Excoffier L, Slatkin M. Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Molecular Biology and Evolution. 1995;12:921–927. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
