Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 8:15:1340189.
doi: 10.3389/fpls.2024.1340189. eCollection 2024.

The effect of marker types and density on genomic prediction and GWAS of key performance traits in tetraploid potato

Affiliations

The effect of marker types and density on genomic prediction and GWAS of key performance traits in tetraploid potato

Trine Aalborg et al. Front Plant Sci. .

Abstract

Genomic prediction and genome-wide association studies are becoming widely employed in potato key performance trait QTL identifications and to support potato breeding using genomic selection. Elite cultivars are tetraploid and highly heterozygous but also share many common ancestors and generation-spanning inbreeding events, resulting from the clonal propagation of potatoes through seed potatoes. Consequentially, many SNP markers are not in a 1:1 relationship with a single allele variant but shared over several alleles that might exert varying effects on a given trait. The impact of such redundant "diluted" predictors on the statistical models underpinning genome-wide association studies (GWAS) and genomic prediction has scarcely been evaluated despite the potential impact on model accuracy and performance. We evaluated the impact of marker location, marker type, and marker density on the genomic prediction and GWAS of five key performance traits in tetraploid potato (chipping quality, dry matter content, length/width ratio, senescence, and yield). A 762-offspring panel of a diallel cross of 18 elite cultivars was genotyped by sequencing, and markers were annotated according to a reference genome. Genomic prediction models (GBLUP) were trained on four marker subsets [non-synonymous (29,553 SNPs), synonymous (31,229), non-coding (32,388), and a combination], and robustness to marker reduction was investigated. Single-marker regression GWAS was performed for each trait and marker subset. The best cross-validated prediction correlation coefficients of 0.54, 0.75, 0.49, 0.35, and 0.28 were obtained for chipping quality, dry matter content, length/width ratio, senescence, and yield, respectively. The trait prediction abilities were similar across all marker types, with only non-synonymous variants improving yield predictive ability by 16%. Marker reduction response did not depend on marker type but rather on trait. Traits with high predictive abilities, e.g., dry matter content, reached a plateau using fewer markers than traits with intermediate-low correlations, such as yield. The predictions were unbiased across all traits, marker types, and all marker densities >100 SNPs. Our results suggest that using non-synonymous variants does not enhance the performance of genomic prediction of most traits. The major known QTLs were identified by GWAS and were reproducible across exonic and whole-genome variant sets for dry matter content, length/width ratio, and senescence. In contrast, minor QTL detection was marker type dependent.

Keywords: GBLUP; GWAS; Solanum tuberosum; genomic prediction; marker density; marker type; tetraploid potato breeding.

PubMed Disclaimer

Conflict of interest statement

Author HK is currently employed by Arla Foods Denmark. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Heat map of marker density in 1-Mb windows for each chromosome. (A) Non-synonymous variants, (B) non-coding variants, (C) synonymous variants, and (D) combination set of annotated variants. The color gradient denotes marker count.
Figure 2
Figure 2
Distribution of phenotypes in the MASPOT panel. (A) Chipping quality, (B) length/width ratio, (C) senescence, (D) dry matter content, and (E) yield.
Figure 3
Figure 3
Principal component analysis of the genomic relationship matrix of the annotated combination set of 93,170 SNP markers for 755 MASPOT panel clones colored by (A) mother and (B) father. Principal component 1 is plotted against principal component 2 in the top and against principal component 3 in the bottom. The principal components explain 19.62%, 10.8%, and 9.5% of the total explained variance, respectively.
Figure 4
Figure 4
GBLUP prediction correlation coefficient and bias within the MASPOT panel over a number of annotated markers. The markers were reduced from full annotated SNP data sets by iterative reduction to every other marker (every 10th for the final reduction) to avoid the introduction of positional bias. Each reduction was repeated up to four times for each iteration. Color by marker annotation (SNPtype). (A) Prediction correlation coefficient between observed and predicted phenotypic values. Boxplots of correlation coefficients determined for each marker count with connecting lines through the mean. (B) Bias of GEBVs estimated as the slope of the linear regression line between observed and predicted phenotypic values. Boxplots of biases determined for each marker count with connecting lines through the mean bias.
Figure 5
Figure 5
Q–Q plots of expected -log10(p-values), assuming no marker–trait association, against observed (chipping quality and the full combination set in this example). (A) Before correction with the genomic inflation factor and (B) after correction.
Figure 6
Figure 6
Manhattan plots of -log10(p-value) GWAS results of the full 93,170 marker annotated combination set. (A) Chipping quality, (B) length/width ratio, (C) senescence, (D) dry matter content, and (E) yield. Chromosome 0 is pseudomolecules. The significance threshold with Bonferroni correction is indicated with a horizontal line.

Similar articles

Cited by

References

    1. Abera Desta Z., Ortiz R. (2014). Genomic selection: genome-wide prediction in plant improvement. Trends Plant Sci. 19, 592–601. doi: 10.1016/j.tplants.2014.05.006 - DOI - PubMed
    1. Ahmad D., Zhang Z., Rasheed H., Xu X., Bao J. (2022). Recent advances in molecular improvement for potato tuber traits. Int. J. Mol. Sci. 23. doi: 10.3390/ijms23179982 - DOI - PMC - PubMed
    1. Ashraf B. H., Byrne S., Fé D., Czaban A., Asp T., Pedersen M. G., et al. . (2016). Estimating genomic heritabilities at the level of family-pool samples of perennial ryegrass using genotyping-by-sequencing. Theor. Appl. Genet. 129, 45. doi: 10.1007/s00122-015-2607-9 - DOI - PMC - PubMed
    1. Ashraf B. H., Jensen J., Asp T., Janss L. L. (2014). Association studies using family pools of outcrossing crops based on allele-frequency estimates from DNA sequencing. Theor. Appl. Genet. 127, 1331. doi: 10.1007/s00122-014-2300-4 - DOI - PMC - PubMed
    1. Bates D., Mächler M., Bolker B. M., Walker S. C. (2015). Fitting linear mixed-effects models using lme4. J. Stat. Software 67. doi: 10.18637/jss.v067.i01 - DOI

LinkOut - more resources