A genotype calling algorithm for the Illumina BeadArray platform
- PMID: 17846035
- PMCID: PMC2666488
- DOI: 10.1093/bioinformatics/btm443
A genotype calling algorithm for the Illumina BeadArray platform
Abstract
Motivation: Large-scale genotyping relies on the use of unsupervised automated calling algorithms to assign genotypes to hybridization data. A number of such calling algorithms have been recently established for the Affymetrix GeneChip genotyping technology. Here, we present a fast and accurate genotype calling algorithm for the Illumina BeadArray genotyping platforms. As the technology moves towards assaying millions of genetic polymorphisms simultaneously, there is a need for an integrated and easy-to-use software for calling genotypes.
Results: We have introduced a model-based genotype calling algorithm which does not rely on having prior training data or require computationally intensive procedures. The algorithm can assign genotypes to hybridization data from thousands of individuals simultaneously and pools information across multiple individuals to improve the calling. The method can accommodate variations in hybridization intensities which result in dramatic shifts of the position of the genotype clouds by identifying the optimal coordinates to initialize the algorithm. By incorporating the process of perturbation analysis, we can obtain a quality metric measuring the stability of the assigned genotype calls. We show that this quality metric can be used to identify SNPs with low call rates and accuracy.
Availability: The C++ executable for the algorithm described here is available by request from the authors.
Figures




References
-
- Affymetrix Inc BRLMM: an improved genotype calling method for the GenChip Human Mapping 500K Array Set. 2006. http://www.affymetrix.com/support/technical/whitepapers/brlmm_whitepaper....
-
- Bolstad BM, et al. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003;19:185–193. - PubMed
-
- Carvalho B, et al. Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data. Biostatistics. 2007;8:485–499. - PubMed
-
- Di X, et al. Dynamic model based algorithms for screening and genotyping over 100K SNPs on oligonucleotide microarrays. Bioinformatics. 2005;21:1958–1963. - PubMed
-
- Gudmundsson J, et al. Genome-wide association study identifies a second prostate cancer susceptibility variantat 8q24. Nat. Genet. 2007;39:631–637. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous