Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2021 Jul 14;218(3):iyab087.
doi: 10.1093/genetics/iyab087.

Meta-analysis identifies pleiotropic loci controlling phenotypic trade-offs in sorghum

Affiliations
Meta-Analysis

Meta-analysis identifies pleiotropic loci controlling phenotypic trade-offs in sorghum

Ravi V Mural et al. Genetics. .

Abstract

Community association populations are composed of phenotypically and genetically diverse accessions. Once these populations are genotyped, the resulting marker data can be reused by different groups investigating the genetic basis of different traits. Because the same genotypes are observed and scored for a wide range of traits in different environments, these populations represent a unique resource to investigate pleiotropy. Here, we assembled a set of 234 separate trait datasets for the Sorghum Association Panel, a group of 406 sorghum genotypes widely employed by the sorghum genetics community. Comparison of genome-wide association studies (GWAS) conducted with two independently generated marker sets for this population demonstrate that existing genetic marker sets do not saturate the genome and likely capture only 35-43% of potentially detectable loci controlling variation for traits scored in this population. While limited evidence for pleiotropy was apparent in cross-GWAS comparisons, a multivariate adaptive shrinkage approach recovered both known pleiotropic effects of existing loci and new pleiotropic effects, particularly significant impacts of known dwarfing genes on root architecture. In addition, we identified new loci with pleiotropic effects consistent with known trade-offs in sorghum development. These results demonstrate the potential for mining existing trait datasets from widely used community association populations to enable new discoveries from existing trait datasets as new, denser genetic marker datasets are generated for existing community association populations.

Keywords: GWAS; community genetic resources; pleiotropy; quantitative genetics; sorghum.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Characteristics of Sorghum Association Panel trait datasets. (A) Geographic distribution of trials where trait datasets were collected. Size of circles indicates number of traits collected at a specific geographic location. Colors of circles indicate types of trait datasets collected at that location. Labels for which colors correspond to which types of traits are given in Panel (B). A set of 30 traits scored in Nova Porteirinha, Minas Gerais, Brazil (Queiroz et al. 2015; Paiva et al. 2017) are not visible in this panel. (B) Representation of seven broad phenotypic categories among the 234 traits collected here. Category assignments for individual traits are provided in Supplementary Table S1. (C) Distributions of narrow sense heritability values, calculated using the 2020 genetic marker dataset (Miao et al. 2020a), across the same seven broad phenotypic categories are shown in panel (B).
Figure 2.
Figure 2.
Combined Manhattan plots comparing MTAs identified using different marker datasets for the same individuals. (A) Combined Manhattan plot for 36 traits with at least one significant GWAS hit when analyzed using the 2013 genetic marker dataset and considering data from only those 304 sorghum lines genotyped in both the 2013 and 2020 datasets. Green lines topped with circles indicate the physical position and -log10P-value for the single most significant SNP within a GWAS peak identified for a particular trait. Text labels for individual traits employ trait names provided in Supplementary Table S1. Dashed red line indicates the cutoff for statistical significance calculated from the effective SNP number in the 2013 genetic marker dataset. (B) Combined Manhattan plot for 40 traits with at least one significant GWAS hit when analyzed using the 2020 genetic marker dataset and considering data from only those 304 sorghum lines genotyped in both the 2013 and 2020 datasets. Locations and P-values of the most significant SNP within each peak and statistical significance cutoff labeled as above. Blue labels indicate peaks shared between datasets. Red labels indicate traits where at least one significant GWAS peak is identified in both datasets but none of the peaks are shared between datasets. Black labels indicate traits where one or more significant GWAS peaks were identified with one of the marker datasets but no significant GWAS peaks were identified when the other marker dataset was employed. (C) Relationship between the identification of one or more significant GWAS peaks for a given trait dataset in each of the two genetic marker datasets. (D) Number of GWAS peaks which were either identified using both or only one of the two genetic marker datasets tested.
Figure 3.
Figure 3.
Combined Manhattan plot for GWAS using all 343 individuals genotyped in the 2020 SNP set. (A) Combined Manhattan plot for 43 traits with at least one significant GWAS hit when analyzed using the 2020 genetic marker dataset and all 343 sorghum lines genotyped in the 2020 genetic marker dataset. Green lines topped with circles indicate the physical position and -log10P-value for the single most significant SNP within a GWAS peak identified for a given trait. Text labels employ trait names provided in Supplementary Table S1. Dashed red line indicates the cutoff for statistical significance calculated from the effective SNP number in the 2020 genetic marker dataset. Lower panel indicate positions of a set of cloned sorghum mutants, taken from (Boyles et al. 2019). Estimates of LD among summit SNPs of each peak are shown in Supplementary Figure S5A. (B) Summary of results from GWAS analysis using all 343 SAP lines included in the 2020 marker dataset. (C) Number of traits where one or more significant GWAS peaks were identified in the 2013 dataset considering only accessions shared with the 2020 dataset, the 2020 dataset considering only accessions shared with the 2013 dataset, and/or all accessions in the 2020 dataset.
Figure 4.
Figure 4.
Pleiotropic analysis of SAP phenotypes. (A) Markers assigned significant Bayes factor values in MashR analysis. Green lines topped with circles indicate the physical position and log10 Bayes factor for the most significant SNP within a peak identified for a pleiotropic loci. Text labels indicate the position and name of the most significant marker within each peak. The number of trait datasets significantly associated with a marker at lfsr <0.001 is indicated in brackets. It should be noted that trait datasets include both measurements of different traits and the same trait scored across different environments in different studies. Dashed red line indicates the cutoff for statistical significance at log10 Bayes factor of 4. Estimates of linkage disequilibrium among the summit SNPs of each distinct peak are shown in Supplementary Figure S5B. (B) Distribution of the number of trait datasets which were significantly associated with each unique peak. (C) Distribution of effect sizes and directions of a subset of the 60 trait datasets for which the genetic marker S04_6082617 has a significant effect (lfsr < 0.001). To aid readability, only the subset of trait datasets where the effect size is >0.05 or <0.05 are shown. Bar thickness is proportional to the relative estimated statistical significance of each association with the thickest bars marking the most significantly associated trait for a given marker and the thinnest the least significantly associated trait for a given marker which still passed all filtering criteria. (D) Distribution of effect sizes and directions for a subset of the 87 trait datasets for which the genetic marker S06_60660773 has significant effects. Cutoffs for visualization are the same as applied for panel C. (E) Distribution of effect sizes and directions for a subset of the 90 trait datasets for which the genetic marker (S08_55231823) has significant effects. Cutoffs for visualization are the same as applied for panels C and D.

References

    1. Adeyanju A, Little C, Yu J, Tesso T.. 2015. Genome-wide association study on resistance to stalk rot diseases in grain sorghum. G3 (Bethesda). 5:1165–1175. - PMC - PubMed
    1. Atwell S, Huang YS, Vilhjálmsson BJ, Willems G, Horton M, et al. 2010. Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature. 465:627–631. - PMC - PubMed
    1. Bailey D. 2004. Recombinant inbred strains and bilineal congenic strains. In: Foster HL, Small JD, Fox JG, editors. The Mouse in Biomedical Research. New York: Academic Press. p. 223–239.
    1. Bates D, Mächler M, Bolker B, Walker S.. 2015. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67: 1–48. doi:http://dx.doi.org/10.18637/jss.v067.i01.
    1. Bellis ES, Kelly EA, Lorts CM, Gao H, DeLeo VL, et al. 2020. Genomics of sorghum local adaptation to a parasitic plant. Proc Natl Acad Sci USA. 117:4243–4251. - PMC - PubMed

Publication types