Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009;38(18):3445-3460.
doi: 10.1080/03610920802696588.

Modeling Informatively Missing Genotypes in Haplotype Analysis

Affiliations

Modeling Informatively Missing Genotypes in Haplotype Analysis

Nianjun Liu et al. Commun Stat Theory Methods. 2009.

Abstract

It is common to have missing genotypes in practical genetic studies. The majority of the existing statistical methods, including those on haplotype analysis, assume that genotypes are missing at random-that is, at a given marker, different genotypes and different alleles are missing with the same probability. In our previous work, we have demonstrated that the violation of this assumption may lead to serious bias in haplotype frequency estimates and haplotype association analysis. We have proposed a general missing data model to simultaneously characterize missing data patterns across a set of two or more biallelic markers. We have proved that haplotype frequencies and missing data probabilities are identifiable if and only if there is linkage disequilibrium between these markers under the general missing data model. In this study, we extend our work to multi-allelic markers and observe a similar finding. Simulation studies on the analysis of haplotypes consisting of two markers illustrate that our proposed model can reduce the bias for haplotype frequency estimates due to incorrect assumptions on the missing data mechanism. Finally, we illustrate the utilities of our method through its application to a real data set from a study of scleroderma.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Akey J, Jin L, Xiong M. Haplotypes vs single marker linkage disequilibrium tests: What do we gain? Eur J Hum Genet. 2001;9(4):291–300. - PubMed
    1. Allen AS, Rathouz PJ, Satten GA. Informative missingness in genetic association studies: Case-parent designs. Am J Hum Genet. 2003;72(3):671–680. - PMC - PubMed
    1. Arnett FC, Cho M, Chatterjee S, Aguilar MB, Reveille JD, Mayes MD. Familial occurrence frequencies and relative risks for systemic sclerosis (scleroderma) in three United States cohorts. Arthritis Rheum. 2001;44(6):1359–1362. - PubMed
    1. Assassi S, Tan FK. Genetics of scleroderma: Update on single nucleotide polymorphism analysis and microarrays. Curr Opin Rheumatol. 2005;17(6):761–767. - PubMed
    1. Baugh JA, Chitnis S, Donnelly SC, Monteiro J, Lin X, Plant BJ, Wolfe F, Gregersen PK, Bucala R. A functional promoter polymorphism in the macrophage migration inhibitory factor (MIF) gene associated with disease severity in rheumatoid arthritis. Genes Immun. 2002;3(3):170–176. - PubMed

LinkOut - more resources