Candidate genes versus genome-wide associations: which are better for detecting genetic susceptibility to infectious disease?

W Amos¹, E Driscoll, J I Hoffman

Affiliations

PMID: 20926441
PMCID: PMC3049081
DOI: 10.1098/rspb.2010.1920

Candidate genes versus genome-wide associations: which are better for detecting genetic susceptibility to infectious disease?

W Amos et al. Proc Biol Sci. 2011.

. 2011 Apr 22;278(1709):1183-8.

doi: 10.1098/rspb.2010.1920. Epub 2010 Oct 6.

Authors

W Amos¹, E Driscoll, J I Hoffman

Affiliation

¹ Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, UK. w.amos@zoo.cam.ac.uk

PMID: 20926441
PMCID: PMC3049081
DOI: 10.1098/rspb.2010.1920

Abstract

Technological developments allow increasing numbers of markers to be deployed in case-control studies searching for genetic factors that influence disease susceptibility. However, with vast numbers of markers, true 'hits' may become lost in a sea of false positives. This problem may be particularly acute for infectious diseases, where the control group may contain unexposed individuals with susceptible genotypes. To explore this effect, we used a series of stochastic simulations to model a scenario based loosely on bovine tuberculosis. We find that a candidate gene approach tends to have greater statistical power than studies that use large numbers of single nucleotide polymorphisms (SNPs) in genome-wide association tests, almost regardless of the number of SNPs deployed. Both approaches struggle to detect genetic effects when these are either weak or if an appreciable proportion of individuals are unexposed to the disease when modest sample sizes (250 each of cases and controls) are used, but these issues are largely mitigated if sample sizes can be increased to 2000 or more of each class. We conclude that the power of any genotype-phenotype association test will be improved if the sampling strategy takes account of exposure heterogeneity, though this is not necessarily easy to do.

PubMed Disclaimer

Figures

**Figure 1.**
How the probability of finding a significant genotype–phenotype association for an infectious disease varies with marker type, exposure rate, recombination rate and size of genetic effect. Each cell contains the proportion of approximately 100 replicate simulations that yielded an experiment-wide significant association at α = 5%. Greyscale represents: white = 0–5% significant; light grey = 5–50% significant; dark grey = 50–90% significant; black = 90–100% significant. Exposure is the proportion of individuals exposed to the disease, genetic effect is the probability of an exposed, genetically resistant individual catching the disease relative to an exposed, susceptible individual (p = 1). (a) ‘SNPs’ (=marker carries three genotypes), recombination rate = 10⁻³; (b) ‘SNPs’, recombination rate = 10⁻⁵; (c) ‘microsatellites’ (=marker carries 10+ genotypes), recombination rate = 10⁻³; (d) ‘microsatellites’, recombination rate = 10⁻⁵.

**Figure 2.**
How the probability of finding a significant genotype–phenotype association for an infectious disease varies with exposure rate and size of genetic effect. (a) Results for a panel of 50 000 SNPs that includes the functional mutation. (b) Results for a CG approach using a sample size of 4000 (equal numbers of cases and controls). For details of simulations and greyscale, see legend of figure 1.

**Figure 3.**
Effectiveness of a genome-wide association (GWA) study to reveal a genotype–phenotype association with sample sizes of 500 and 4000. In each set of simulations, we assume a large, SNP-based study deploying either 50 000 or two million markers. The probability of finding an association is calculated by combining the probability of one marker lying close enough to the functional gene for the recombination rate to be 10⁻⁵ with the probability that the functional mutation itself is included in the panel. For other details, see legend of figure 1. (a) 50 000 SNPs, sample size = 500 (250 case plus 250 controls); (b) 50 000 SNPs, sample size = 4000; (c) two million SNPs, sample size = 500; (d) two million SNPs, sample size = 4000.

See this image and copyright information in PMC

References

1. Hinds D. A., Stuve L. L., Nilsen G. B., Halperin E., Eskin E., Balinger D. G., Frazer K. A., Cox D. R. 2005. Whole-genome patterns of common DNA variation in three human populations. Science 307, 1072–107910.1126/science.1105436 (doi:10.1126/science.1105436) - DOI - DOI - PubMed
1. The International HapMap Consortium 2007. A second generation human haplotype map of over 3.1 million SNPs. Nature 449, 851–85310.1038/nature06258 (doi:10.1038/nature06258) - DOI - DOI - PMC - PubMed
1. McCarthy M. I., Abecasis G. R., Cardon L. R., Goldstein D. B., Little J., Ionnidis J. P. A., Hirschhorn J. N. 2008. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat. Rev. Genet. 9, 356–36910.1038/nrg2344 (doi:10.1038/nrg2344) - DOI - DOI - PubMed
1. Schulze T. G., McMahon F. J. 2002. Genetic association mapping at the crossroads: which test and why? Overview and practical guidelines. Am. J. Med. Genet. (Neuropsych. Genet.) 114, 1–1110.1002/ajmg.10042 (doi:10.1002/ajmg.10042) - DOI - DOI - PubMed
1. Altet L., Francimo O., Solano-Gallego L., Renier C., Sánchez A. 2002. Mapping and sequencing of the canine NRAMP1 gene and identification of mutations in Leishmaniasis-susceptible dogs. Infect. Immun. 70, 2763–277110.1128/IAI.70.6.2763-2771.2002 (doi:10.1128/IAI.70.6.2763-2771.2002) - DOI - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Candidate genes versus genome-wide associations: which are better for detecting genetic susceptibility to infectious disease?

Affiliation

Candidate genes versus genome-wide associations: which are better for detecting genetic susceptibility to infectious disease?

Authors

Affiliation

Abstract

Figures

References

MeSH terms

Substances

LinkOut - more resources

Full Text Sources