Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2013 Mar;23(3):509-18.
doi: 10.1101/gr.145821.112. Epub 2012 Dec 10.

Genotype imputation via matrix completion

Affiliations
Comparative Study

Genotype imputation via matrix completion

Eric C Chi et al. Genome Res. 2013 Mar.

Abstract

Most current genotype imputation methods are model-based and computationally intensive, taking days to impute one chromosome pair on 1000 people. We describe an efficient genotype imputation method based on matrix completion. Our matrix completion method is implemented in MATLAB and tested on real data from HapMap 3, simulated pedigree data, and simulated low-coverage sequencing data derived from the 1000 Genomes Project. Compared with leading imputation programs, the matrix completion algorithm embodied in our program MENDEL-IMPUTE achieves comparable imputation accuracy while reducing run times significantly. Implementation in a lower-level language such as Fortran or C is apt to further improve computational efficiency.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The negative logarithm of P-values for association when the true signal depends on SNP 40,938. (MI) MENDEL-IMPUTE.
Figure 2.
Figure 2.
Accuracy versus time trade-off for the Nesterov algorithm on chromosome 4 from the Chinese Han group in HapMap 3. The numbers indicate the subwindow size w. (Dashed line) Error rate for MaCH on the same data set.

References

    1. The 1000 Genomes Project Consortium 2010. A map of human genome variation from population-scale sequencing. Nature 467: 1061–1073 - PMC - PubMed
    1. Ayers KL, Lange K 2008. Penalized estimation of haplotype frequencies. Bioinformatics 24: 1596–1602 - PubMed
    1. Beck A, Teboulle M 2009. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J Imaging Sci 2: 183–202
    1. Browning SR, Browning BL 2007. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet 81: 1084–1097 - PMC - PubMed
    1. Browning BL, Browning SR 2009. A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am J Hum Genet 84: 210–223 - PMC - PubMed

Publication types