Data quality control in genetic case-control association studies
- PMID: 21085122
- PMCID: PMC3025522
- DOI: 10.1038/nprot.2010.116
Data quality control in genetic case-control association studies
Abstract
This protocol details the steps for data quality assessment and control that are typically carried out during case-control association studies. The steps described involve the identification and removal of DNA samples and markers that introduce bias. These critical steps are paramount to the success of a case-control study and are necessary before statistically testing for association. We describe how to use PLINK, a tool for handling SNP data, to perform assessments of failure rate per individual and per SNP and to assess the degree of relatedness between individuals. We also detail other quality-control procedures, including the use of SMARTPCA software for the identification of ancestral outliers. These platforms were selected because they are user-friendly, widely used and computationally efficient. Steps needed to detect and establish a disease association using case-control data are not discussed here. Issues concerning study design and marker selection in case-control studies have been discussed in our earlier protocols. This protocol, which is routinely used in our labs, should take approximately 8 h to complete.
Figures



Similar articles
-
Basic statistical analysis in genetic case-control studies.Nat Protoc. 2011 Feb;6(2):121-33. doi: 10.1038/nprot.2010.182. Epub 2011 Feb 3. Nat Protoc. 2011. PMID: 21293453 Free PMC article.
-
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217. Cochrane Database Syst Rev. 2022. PMID: 36321557 Free PMC article.
-
Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.J Vis Exp. 2023 Oct 18;(200). doi: 10.3791/6576. J Vis Exp. 2023. PMID: 37851522
-
Odyssey: a semi-automated pipeline for phasing, imputation, and analysis of genome-wide genetic data.BMC Bioinformatics. 2019 Jun 28;20(1):364. doi: 10.1186/s12859-019-2964-5. BMC Bioinformatics. 2019. PMID: 31253090 Free PMC article.
-
Genotype imputation in genome-wide association studies.Curr Protoc Hum Genet. 2013 Jul;Chapter 1:Unit 1.25. doi: 10.1002/0471142905.hg0125s78. Curr Protoc Hum Genet. 2013. PMID: 23853078 Review.
Cited by
-
Lung expression of genes putatively involved in SARS-CoV-2 infection is modulated in cis by germline variants.Eur J Hum Genet. 2021 Jun;29(6):1019-1026. doi: 10.1038/s41431-021-00831-y. Epub 2021 Mar 1. Eur J Hum Genet. 2021. PMID: 33649539 Free PMC article.
-
Exploring a Region on Chromosome 8p23.1 Displaying Positive Selection Signals in Brazilian Admixed Populations: Additional Insights Into Predisposition to Obesity and Related Disorders.Front Genet. 2021 Mar 25;12:636542. doi: 10.3389/fgene.2021.636542. eCollection 2021. Front Genet. 2021. PMID: 33841501 Free PMC article.
-
Novel susceptibility loci for A(H7N9) infection identified by next generation sequencing and functional analysis.Sci Rep. 2020 Jul 16;10(1):11768. doi: 10.1038/s41598-020-68675-y. Sci Rep. 2020. PMID: 32678187 Free PMC article.
-
Impact of demography and population dynamics on the genetic architecture of human longevity.Aging (Albany NY). 2018 Aug 8;10(8):1947-1963. doi: 10.18632/aging.101515. Aging (Albany NY). 2018. PMID: 30089705 Free PMC article.
-
Polymorphisms in RAS/RAF/MEK/ERK Pathway Are Associated with Gastric Cancer.Genes (Basel). 2018 Dec 28;10(1):20. doi: 10.3390/genes10010020. Genes (Basel). 2018. PMID: 30597917 Free PMC article.
References
-
- Clayton DG, et al. Population structure, differential bias and genomic control in a large-scale, case-control association study. Nat Genet. 2005;37:1243. - PubMed
-
- Marchini J, Howie B, Myers SR, McVean G, Donnelly P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet. 2007;39:906. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases