Speeding up HMM algorithms for genetic linkage analysis via chain reductions of the state space
- PMID: 19477987
- PMCID: PMC2687978
- DOI: 10.1093/bioinformatics/btp224
Speeding up HMM algorithms for genetic linkage analysis via chain reductions of the state space
Abstract
We develop an hidden Markov model (HMM)-based algorithm for computing exact parametric and non-parametric linkage scores in larger pedigrees than was possible before. The algorithm is applicable whenever there are chains of persons in the pedigree with no genetic measurements and with unknown affection status. The algorithm is based on shrinking the state space of the HMM considerably using such chains. In a two g-degree cousins pedigree the reduction drops the state space from being exponential in g to being linear in g. For a Finnish family in which two affected children suffer from a rare cold-inducing sweating syndrome, we were able to reduce the state space by more than five orders of magnitude from 2(50) to 2(32). In another pedigree of state-space size of 2(27), used for a study of pituitary adenoma, the state space reduced by a factor of 8.5 and consequently exact linkage scores can now be computed, rather than approximated.
Supplementary information: Supplementary data are available at Bioinformatics online.
Figures





Similar articles
-
Efficient identification of identical-by-descent status in pedigrees with many untyped individuals.Bioinformatics. 2010 Jun 15;26(12):i191-8. doi: 10.1093/bioinformatics/btq222. Bioinformatics. 2010. PMID: 20529905 Free PMC article.
-
Faster multipoint linkage analysis using Fourier transforms.J Comput Biol. 1998 Spring;5(1):1-7. doi: 10.1089/cmb.1998.5.1. J Comput Biol. 1998. PMID: 9541867
-
A system for exact and approximate genetic linkage analysis of SNP data in large pedigrees.Bioinformatics. 2013 Jan 15;29(2):197-205. doi: 10.1093/bioinformatics/bts658. Epub 2012 Nov 18. Bioinformatics. 2013. PMID: 23162081 Free PMC article.
-
Finding starting points for Markov chain Monte Carlo analysis of genetic data from large and complex pedigrees.Genet Epidemiol. 2003 Jul;25(1):14-24. doi: 10.1002/gepi.10243. Genet Epidemiol. 2003. PMID: 12813723 Review.
-
Metropolis sampling in pedigree analysis.Stat Methods Med Res. 1993;2(3):263-82. doi: 10.1177/096228029300200305. Stat Methods Med Res. 1993. PMID: 8261261 Review.
Cited by
-
A genetic algorithm-based Boolean delay model of intracellular signal transduction in inflammation.BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S17. doi: 10.1186/1471-2105-12-S1-S17. BMC Bioinformatics. 2011. PMID: 21342546 Free PMC article.
-
Haplotypes versus genotypes on pedigrees.Algorithms Mol Biol. 2011 Apr 19;6(1):10. doi: 10.1186/1748-7188-6-10. Algorithms Mol Biol. 2011. PMID: 21504603 Free PMC article.
-
Isomorphism and similarity for 2-generation pedigrees.BMC Bioinformatics. 2015;16 Suppl 5(Suppl 5):S7. doi: 10.1186/1471-2105-16-S5-S7. Epub 2015 Mar 18. BMC Bioinformatics. 2015. PMID: 25860335 Free PMC article.
-
Efficient identification of identical-by-descent status in pedigrees with many untyped individuals.Bioinformatics. 2010 Jun 15;26(12):i191-8. doi: 10.1093/bioinformatics/btq222. Bioinformatics. 2010. PMID: 20529905 Free PMC article.
-
Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping.Bioinformatics. 2010 Jun 15;26(12):i175-82. doi: 10.1093/bioinformatics/btq204. Bioinformatics. 2010. PMID: 20529903 Free PMC article.
References
-
- Abecasis GR, et al. Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat. Genet. 2002;30:97–101. - PubMed
-
- Dechter R. Bucket elimination: a unifying framework for probabilistic inference. In: Jordan MI, editor. Learning in Graphical Models. Kluwer Academic Press; 1998. pp. 75–104.
-
- Elston R, Stewart J. A general model for the analysis of pedigree data. Hum. Hered. 1971;21:523–542. - PubMed