Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar;18(1):e20560.
doi: 10.1002/tpg2.20560.

Combining genome-wide association and genomic prediction to unravel the genetic architecture of carotenoid accumulation in carrot

Affiliations

Combining genome-wide association and genomic prediction to unravel the genetic architecture of carotenoid accumulation in carrot

William R Rolling et al. Plant Genome. 2025 Mar.

Abstract

Carrots (Daucus carota L.) are a rich source of provitamin A, namely, α- and β-carotene. Breeding programs prioritize increasing β-carotene content for improved color and nutrition. Understanding the genetic basis of carotenoid accumulation is crucial for implementing genomic-assisted selection to develop high-carotenoid lines. While previous studies identified loci (Y2, Y, Or, and REC) associated with carrot color and carotenoid content, this study employed genome-wide association (GWA) in a diverse panel of 738 carrot accessions. We discovered a novel locus with a candidate gene encoding phytoene synthase, a key enzyme in carotenoid biosynthesis. The Y2, Y, Or, and REC loci are mostly fixed in orange varieties, yet considerable variation in carotenoid concentration persists. This suggests a multigenic trait influenced by the environment. GWA of carotenoid concentration identified a quantitative trait locus for total carotenoids and α-carotene. We explored the accuracy of genomic prediction (GP) models to predict carotenoid concentration. We determined the optimal number of plants and plots required for accurate carotenoid phenotyping, finding ≥5 plants per plot and three plots per site as the minimum effective sample per accession. GP models achieved accuracies ranging from 0.06 to 0.40 depending on the carotenoid measured and environment the carrots were assayed. Additional studies in breeding programs will clarify the potential of genomic-assisted selection for high-carotenoid carrots.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

FIGURE 1
FIGURE 1
Genome‐wide association (GWA) mapping loci associated with orange carrot color. The Manhattan plot depicts −log10(p‐value) for single nucleotide polymorphism (SNP) markers across the carrot genome. The three loci identified in this study include the REC, PSY, and Y2 loci.
FIGURE 2
FIGURE 2
Genome‐wide association mapping of α‐carotene and total carotenoid content in carrot. (A) Manhattan plots illustrating −log10 (p‐value) for single nucleotide polymorphisms (SNPs) associated with total carotenoid (triangle) and α‐carotene (circle) content with combined data from WI_2018_605PI and CA_2019_730PI trials (n = 603). (B, C) Boxplots comparing total carotenoid (B) and α‐carotene (C) concentrations (µg/g dry tissue) between genotypes at a significant locus (p < 0.05).
FIGURE 3
FIGURE 3
Influence of marker density and model choice on genomic prediction accuracy. (A) Comparison of prediction accuracy for different marker densities (82,625 to 81 markers) when averaged across five genomic prediction models. Both the WI_2018_605PI (gray) and CA_2019_730PI (orange) were tested for all combinations of genomic prediction model and marker number. (B) Average prediction accuracy and standard error across various traits using a fixed marker density of 2582 markers. PI, plant introduction. BGLR, Bayesian generalized linear regression; BRR, Bayesian ridge regression.
FIGURE 4
FIGURE 4
Average prediction accuracy (correlation of genomic estimated breeding values [GEBVs] and observed phenotypes) when the Bayesian ridge regression model was applied using 2582 markers in the CA_2019_730PI (A) and WI_2018_605PI (B) trials. Additions to models included adding population structure and the color of the core of a carrot. Each carotenoid and the total concentration of carotenoids were tested for all combinations of input data. PI, plant introduction.
FIGURE 5
FIGURE 5
Variation in carotenoid measurements depending on location, sample size, and plots per inbred. (A) Boxplots describing β‐carotene concentrations in carrots grown in Wisconsin (WI) in 2022 and California (CA) in 2022 or 2023. Significant differences within a line between location or year (HSD.test, p‐value < 0.05) are indicated by different colored plots. (B) Boxplots describing α‐carotene concentrations in carrots grown in Wisconsin in 2022 and California in 2022 or 2023. Significant differences between location or year (HSD.test, p‐value < 0.05) are indicated by different colored plots. (C) Percentage of the largest confidence interval (n = 2) as more samples were added. Sampling without (square shape) and with replacement (circle) was tested. (D) Confidence interval width for estimated average carotenoid concentration based on different numbers of plots per phenotype (n = 10 samples/plot). (E) β‐Carotene concentration variability among plots of inbred Ns5154 grown in California in 2022. The average concentration is represented by a gray square, and individual samples by black dots.

Similar articles

Cited by

References

    1. Alexander, D. H. , & Lange, K. (2011). Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinformatics, 12, Article 246. 10.1186/1471-2105-12-246 - DOI - PMC - PubMed
    1. Banga, O. (1957). Effect of some environmental factors on the carotene content of carrots. IVT, 92, 796–805. - PubMed
    1. Banga, O. , & De Bruyn, J. W. (1956). Selection of carrots for carotene content: III Planting distances and ripening equilibrium of the roots. Euphytica, 5, 87–93. 10.1007/BF00021287 - DOI
    1. Barrett, J. C. , Fry, B. , Maller, J. , & Daly, M. J. (2005). Haploview: Analysis and visualization of LD and haplotype maps. Bioinformatics, 21, 263–265. 10.1093/bioinformatics/bth457 - DOI - PubMed
    1. Bernardo, R. (2008). Molecular markers and selection for complex traits in plants: Learning from the last 20 years. Crop Science, 48, 1649–1664. 10.2135/cropsci2008.03.0131 - DOI

MeSH terms

Substances

LinkOut - more resources