Whole genome identity-by-descent determination
- PMID: 23600820
- DOI: 10.1142/S0219720013500029
Whole genome identity-by-descent determination
Abstract
High-throughput single nucleotide polymorphism genotyping assays conveniently produce genotype data for genome-wide genetic linkage and association studies. For pedigree datasets, the unphased genotype data is used to infer the haplotypes for individuals, according to Mendelian inheritance rules. Linkage studies can then locate putative chromosomal regions based on the haplotype allele sharing among the pedigree members and their disease status. Most existing haplotyping programs require rather strict pedigree structures and return a single inferred solution for downstream analysis. In this research, we relax the pedigree structure to contain ungenotyped founders and present a cubic time whole genome haplotyping algorithm to minimize the number of zero-recombination haplotype blocks. With or without explicitly enumerating all the haplotyping solutions, the algorithm determines all distinct haplotype allele identity-by-descent (IBD) sharings among the pedigree members, in linear time in the total number of haplotyping solutions. Our algorithm is implemented as a computer program iBDD. Extensive simulation experiments using 2 sets of 16 pedigree structures from previous studies showed that, in general, there are trillions of haplotyping solutions, but only up to a few thousand distinct haplotype allele IBD sharings. iBDD is able to return all these sharings for downstream genome-wide linkage and association studies.
Similar articles
-
Computing the minimum recombinant haplotype configuration from incomplete genotype data on a pedigree by integer linear programming.J Comput Biol. 2005 Jul-Aug;12(6):719-39. doi: 10.1089/cmb.2005.12.719. J Comput Biol. 2005. PMID: 16108713
-
Ancestral haplotype reconstruction in endogamous populations using identity-by-descent.PLoS Comput Biol. 2021 Feb 26;17(2):e1008638. doi: 10.1371/journal.pcbi.1008638. eCollection 2021 Feb. PLoS Comput Biol. 2021. PMID: 33635861 Free PMC article.
-
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1. Bioinformatics. 2005. PMID: 15231536
-
Identity by descent between distant relatives: detection and applications.Annu Rev Genet. 2012;46:617-33. doi: 10.1146/annurev-genet-110711-155534. Epub 2012 Sep 17. Annu Rev Genet. 2012. PMID: 22994355 Review.
-
Haplotyping methods for pedigrees.Hum Hered. 2009;67(4):248-66. doi: 10.1159/000194978. Epub 2009 Jan 27. Hum Hered. 2009. PMID: 19172084 Free PMC article. Review.
Cited by
-
JS-MA: A Jensen-Shannon Divergence Based Method for Mapping Genome-Wide Associations on Multiple Diseases.Front Genet. 2020 Oct 30;11:507038. doi: 10.3389/fgene.2020.507038. eCollection 2020. Front Genet. 2020. PMID: 33193597 Free PMC article.
-
Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering.BMC Bioinformatics. 2014 Apr 10;15:102. doi: 10.1186/1471-2105-15-102. BMC Bioinformatics. 2014. PMID: 24717145 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources