Candidate genes and genetic architecture of symbiotic and agronomic traits revealed by whole-genome, sequence-based association genetics in Medicago truncatula
- PMID: 23741505
- PMCID: PMC3669257
- DOI: 10.1371/journal.pone.0065688
Candidate genes and genetic architecture of symbiotic and agronomic traits revealed by whole-genome, sequence-based association genetics in Medicago truncatula
Abstract
Genome-wide association study (GWAS) has revolutionized the search for the genetic basis of complex traits. To date, GWAS have generally relied on relatively sparse sampling of nucleotide diversity, which is likely to bias results by preferentially sampling high-frequency SNPs not in complete linkage disequilibrium (LD) with causative SNPs. To avoid these limitations we conducted GWAS with >6 million SNPs identified by sequencing the genomes of 226 accessions of the model legume Medicago truncatula. We used these data to identify candidate genes and the genetic architecture underlying phenotypic variation in plant height, trichome density, flowering time, and nodulation. The characteristics of candidate SNPs differed among traits, with candidates for flowering time and trichome density in distinct clusters of high linkage disequilibrium (LD) and the minor allele frequencies (MAF) of candidates underlying variation in flowering time and height significantly greater than MAF of candidates underlying variation in other traits. Candidate SNPs tagged several characterized genes including nodulation related genes SERK2, MtnodGRP3, MtMMPL1, NFP, CaML3, MtnodGRP3A and flowering time gene MtFD as well as uncharacterized genes that become candidates for further molecular characterization. By comparing sequence-based candidates to candidates identified by in silico 250K SNP arrays, we provide an empirical example of how reliance on even high-density reduced representation genomic makers can bias GWAS results. Depending on the trait, only 30-70% of the top 20 in silico array candidates were within 1 kb of sequence-based candidates. Moreover, the sequence-based candidates tagged by array candidates were heavily biased towards common variants; these comparisons underscore the need for caution when interpreting results from GWAS conducted with sparsely covered genomes.
Conflict of interest statement
Figures
References
-
- Smil V (1999) Nitrogen in crop production. Global Biogeochem Cy 13: 647–662.
-
- Cleveland CC, Townsend AR, Schimel DS, Fisher H, Howarth RW, et al. (1999) Global patterns of terrestrial biological nitrogn (N2) fixation in natural ecosystems. Global Biogeochem Cy 13: 623–645.
-
- Oldroyd GE, Downie JA (2004) Calcium, kinases and nodulation signalling in legumes. Nat Rev Mol Cell Biol 5: 566–576. - PubMed
-
- Young ND, Udvardi M (2009) Translating Medicago truncatula genomics to crop legumes. Current Opinion Plant Biology 12: 193–201. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
