Dynamic model based algorithms for screening and genotyping over 100 K SNPs on oligonucleotide microarrays
- PMID: 15657097
- DOI: 10.1093/bioinformatics/bti275
Dynamic model based algorithms for screening and genotyping over 100 K SNPs on oligonucleotide microarrays
Abstract
Motivation: A high density of single nucleotide polymorphism (SNP) coverage on the genome is desirable and often an essential requirement for population genetics studies. Region-specific or chromosome-specific linkage studies also benefit from the availability of as many high quality SNPs as possible. The availability of millions of SNPs from both Perlegen and the public domain and the development of an efficient microarray-based assay for genotyping SNPs has brought up some interesting analytical challenges. Effective methods for the selection of optimal subsets of SNPs spanning the genome and methods for accurately calling genotypes from probe hybridization patterns have enabled the development of a new microarray-based system for robustly genotyping over 100,000 SNPs per sample.
Results: We introduce a new dynamic model-based algorithm (DM) for screening over 3 million SNPs and genotyping over 100,000 SNPs. The model is based on four possible underlying states: Null, A, AB and B for each probe quartet. We calculate a probe-level log likelihood for each model and then select between the four competing models with an SNP-level statistical aggregation across multiple probe quartets to provide a high-quality genotype call along with a quality measure of the call. We assess performance with HapMap reference genotypes, informative Mendelian inheritance relationship in families, and consistency between DM and another genotype classification method. At a call rate of 95.91% the concordance with reference genotypes from the HapMap Project is 99.81% based on over 1.5 million genotypes, the Mendelian error rate is 0.018% based on 10 trios, and the consistency between DM and MPAM is 99.90% at a comparable rate of 97.18%. We also develop methods for SNP selection and optimal probe selection.
Availability: The DM algorithm is available in Affymetrix's Genotyping Tools software package and in Affymetrix's GDAS software package. See http://www.affymetrix.com for further information. 10 K and 100 K mapping array data are available on the Affymetrix website.
Similar articles
-
Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays.Nat Methods. 2004 Nov;1(2):109-11. doi: 10.1038/nmeth718. Nat Methods. 2004. PMID: 15782172
-
Dynamic variable selection in SNP genotype autocalling from APEX microarray data.BMC Bioinformatics. 2006 Nov 30;7:521. doi: 10.1186/1471-2105-7-521. BMC Bioinformatics. 2006. PMID: 17137502 Free PMC article.
-
A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.Bioinformatics. 2007 Jun 15;23(12):1459-67. doi: 10.1093/bioinformatics/btm131. Epub 2007 Apr 25. Bioinformatics. 2007. PMID: 17459966
-
Analysis of SNPs and other genomic variations using gel-based chips.Hum Mutat. 2002 Apr;19(4):343-60. doi: 10.1002/humu.10077. Hum Mutat. 2002. PMID: 11933189 Review.
-
Automation in genotyping of single nucleotide polymorphisms.Hum Mutat. 2001 Jun;17(6):475-92. doi: 10.1002/humu.1131. Hum Mutat. 2001. PMID: 11385706 Review.
Cited by
-
Mismatch and G-stack modulated probe signals on SNP microarrays.PLoS One. 2009 Nov 17;4(11):e7862. doi: 10.1371/journal.pone.0007862. PLoS One. 2009. PMID: 19924253 Free PMC article.
-
SNPExpress: integrated visualization of genome-wide genotypes, copy numbers and gene expression levels.BMC Genomics. 2008 Jan 25;9:41. doi: 10.1186/1471-2164-9-41. BMC Genomics. 2008. PMID: 18221515 Free PMC article.
-
Genotyping and inflated type I error rate in genome-wide association case/control studies.BMC Bioinformatics. 2009 Feb 23;10:68. doi: 10.1186/1471-2105-10-68. BMC Bioinformatics. 2009. PMID: 19236714 Free PMC article.
-
Evaluation of Bayesian alphabet and GBLUP based on different marker density for genomic prediction in Alpine Merino sheep.G3 (Bethesda). 2021 Oct 19;11(11):jkab206. doi: 10.1093/g3journal/jkab206. G3 (Bethesda). 2021. PMID: 34849779 Free PMC article.
-
High-resolution global genomic survey of 178 gliomas reveals novel regions of copy number alteration and allelic imbalances.Cancer Res. 2006 Oct 1;66(19):9428-36. doi: 10.1158/0008-5472.CAN-06-1691. Cancer Res. 2006. PMID: 17018597 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Miscellaneous