Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2020 Jun 5:11:447.
doi: 10.3389/fgene.2020.00447. eCollection 2020.

Recommendations for Choosing the Genotyping Method and Best Practices for Quality Control in Crop Genome-Wide Association Studies

Affiliations
Review

Recommendations for Choosing the Genotyping Method and Best Practices for Quality Control in Crop Genome-Wide Association Studies

Stefano Pavan et al. Front Genet. .

Abstract

High-throughput genotyping boosts genome-wide association studies (GWAS) in crop species, leading to the identification of single-nucleotide polymorphisms (SNPs) associated with economically important traits. Choosing a cost-effective genotyping method for crop GWAS requires careful examination of several aspects, namely, the purpose and the scale of the study, crop-specific genomic features, and technical and economic matters associated with each genotyping option. Once genotypic data have been obtained, quality control (QC) procedures must be applied to avoid bias and false signals in genotype-phenotype association tests. QC for human GWAS has been extensively reviewed; however, QC for crop GWAS may require different actions, depending on the GWAS population type. Here, we review most popular genotyping methods based on next-generation sequencing (NGS) and array hybridization, and report observations that should guide the investigator in the choice of the genotyping method for crop GWAS. We provide recommendations to perform QC in crop species, and deliver an overview of bioinformatics tools that can be used to accomplish all needed tasks. Overall, this work aims to provide guidelines to harmonize those procedures leading to SNP datasets ready for crop GWAS.

Keywords: GWAS; bioinformatics tools; crops; genotyping; quality control.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Overview of quality control procedures for crop GWAS. These include: filtering steps that are common to any GWAS experiment; filtering steps depending on the GWAS population structure (homozygous or heterozygous); the removal of duplicated samples; the characterization of ancestral relationships, starting from a SNP dataset pruned for markers in linkage disequilibrium.
FIGURE 2
FIGURE 2
Frequency distribution analysis to define filtering solutions for (A) SNP call rate; (B) genotype call rate; (C) SNP inbreeding coefficient (FIT); (D) SNP proportion of heterozygosity. Dashed lines indicate possible filtering thresholds, based on classes occurring at suspiciously low (A,B) or high (D) frequency, and distribution gaps (C). Genotypic data used to build histograms are all relative to published genotyping-by-sequencing experiments, carried out in the self-pollinated crops Cicer arietinum L. (Pavan et al., 2017, A,C) and Lens culinaris Medik (Pavan et al., 2019, B), and the open-pollinated crop Cynara cardunculus L. (Pavan et al., 2018, D).

Similar articles

Cited by

References

    1. Alexander D. H., Novembre J., Lange K. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19 1655–1664. 10.1101/gr.094052.109 - DOI - PMC - PubMed
    1. Allen A. M., Winfield M. O., Burridge A. J., Downie R. C., Benbow H. R., Barker G. L. A., et al. (2017). Characterization of a wheat breeders’ array suitable for high-throughput SNP genotyping of global accessions of hexaploid bread wheat (Triticum aestivum). Plant Biotechnol. J. 15 390–401. 10.1111/pbi.12635 - DOI - PMC - PubMed
    1. Anderson C. A., Pettersson F. H., Clarke G. M., Cardon L. R., Morris A. P., Zondervan K. T. (2010). Data quality control in genetic case-control association studies. Nat. Protoc. 5 1564–1573. 10.1038/nprot.2010.116 - DOI - PMC - PubMed
    1. Ashrafi H., Hill T., Stoffel K., Kozik A., Yao J., Chin-Wo S. R., et al. (2012). De novo assembly of the pepper transcriptome (Capsicum annuum): a benchmark for in silico discovery of SNPs, SSRs and candidate genes. BMC Genomics 13:571. 10.1186/1471-2164-13-571 - DOI - PMC - PubMed
    1. Astle W., Balding D. J. (2009). Population structure and cryptic relatedness in genetic association studies. Stat. Sci. 24 451–471. 10.1214/09-STS307 - DOI