Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jun 15;26(12):i175-82.
doi: 10.1093/bioinformatics/btq204.

Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping

Affiliations

Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping

Sivan Bercovici et al. Bioinformatics. .

Abstract

Motivation: Association analysis is the method of choice for studying complex multifactorial diseases. The premise of this method is that affected persons contain some common genomic regions with similar SNP alleles and such areas will be found in this analysis. An important disadvantage of GWA studies is that it does not distinguish between genomic areas that are inherited from a common ancestor [identical by descent (IBD)] and areas that are identical merely by state [identical by state (IBS)]. Clearly, areas that can be marked with higher probability as IBD and have the same correlation with the disease status of identical areas that are more probably only IBS, are better candidates to be causative, and yet this distinction is not encoded in standard association analysis.

Results: We develop a factorial hidden Markov model-based algorithm for computing genome-wide IBD sharing. The algorithm accepts as input SNP data of measured individuals and estimates the probability of IBD at each locus for every pair of individuals. For two g-degree relatives, when g > or = 8, the computation yields a precision of IBD tagging of over 50% higher than previous methods for 95% recall. Our algorithm uses a first-order Markovian model for the linkage disequilibrium process and employs a reduction of the state space of the inheritance vector from being exponential in g to quadratic. The higher accuracy along with the reduced time complexity marks our method as a feasible means for IBD mapping in practical scenarios.

Availability: A software implementation, called IBDMAP, is freely available at http://bioinfo.cs.technion.ac.il/IBDmap.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Two slots of the g-degree relatives pedigree, where each node represents an individual at a specific location. The last generation is genotyped (measured). The full model has in addition two LD terms between the founder Fi in one slot and Fi in the next slot, for every founder in the pedigree. For clarity, these links are not shown. The model drawn has one LD term for each of the common founder B's alleles, and one additional LD term for each of the founders Fg and Fg.
Fig. 2.
Fig. 2.
The 4-track model achieves a significantly higher precision in comparison to the other methods under most tested scenarios, increasing the performance gap for g≥8 with over 50% improvement. Results correspond to over 95% recall for the 4-track model, and a lower recall for Plink and Merlin.
Fig. 3.
Fig. 3.
Finding a disease seeded at 22 cm (location indicated by the horizontal line) using IBD predictions. Pairs of individuals were simulated using g = 6,8,10, assuming a dominant disease with complete penetrance that originated from a common founder.

Similar articles

Cited by

References

    1. Abecasis G, Wigginton J. Handling marker-marker linkage disequilibrium: pedigree analysis with clustered markers. Am. J. Hum. Genet. 2005;77:754–767. - PMC - PubMed
    1. Abecasis GR, et al. Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat. Genet. 2002;30:97–101. - PubMed
    1. Benjamini Y, Yekutieli D. Quantitative traits loci analysis using the false discovery rate. Genetics. 2005;171:783–789. - PMC - PubMed
    1. Bercovici S, Geiger D. Inferring ancestries efficiently in admixed populations with linkage disequilibrium. J. Comput. Biol. 2009;16:1141–1150. - PubMed
    1. Browning S, Browning B. On reducing the statespace of hidden markov models for the identity by descent process. Theor. Popul. Biol. 2002;62:1–8. - PubMed

Publication types