Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 15;119(7):e2115865119.
doi: 10.1073/pnas.2115865119.

Metabolomic selection for enhanced fruit flavor

Affiliations

Metabolomic selection for enhanced fruit flavor

Vincent Colantonio et al. Proc Natl Acad Sci U S A. .

Abstract

Although they are staple foods in cuisines globally, many commercial fruit varieties have become progressively less flavorful over time. Due to the cost and difficulty associated with flavor phenotyping, breeding programs have long been challenged in selecting for this complex trait. To address this issue, we leveraged targeted metabolomics of diverse tomato and blueberry accessions and their corresponding consumer panel ratings to create statistical and machine learning models that can predict sensory perceptions of fruit flavor. Using these models, a breeding program can assess flavor ratings for a large number of genotypes, previously limited by the low throughput of consumer sensory panels. The ability to predict consumer ratings of liking, sweet, sour, umami, and flavor intensity was evaluated by a 10-fold cross-validation, and the accuracies of 18 different models were assessed. The prediction accuracies were high for most attributes and ranged from 0.87 for sourness intensity in blueberry using XGBoost to 0.46 for overall liking in tomato using linear regression. Further, the best-performing models were used to infer the flavor compounds (sugars, acids, and volatiles) that contribute most to each flavor attribute. We found that the variance decomposition of overall liking score estimates that 42% and 56% of the variance was explained by volatile organic compounds in tomato and blueberry, respectively. We expect that these models will enable an earlier incorporation of flavor as breeding targets and encourage selection and release of more flavorful fruit varieties.

Keywords: artificial intelligence; flavor; fruit quality.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
(A) Weighted correlation network analysis of tomato metabolites and their assigned clusters based on their known biochemical classification. The size of each metabolite node indicates betweenness centrality, a measure of how often a node exists on the shortest path between other nodes. The thickness of the lines connecting metabolites is scaled relative to the correlation between the metabolites. The identity of each metabolite is denoted by number in the legend. (B) Distribution of metabolite concentrations for each volatile group across the tomato population. Volatile concentrations are reported in nanograms per gram fresh weight per hour (ng/gfw/h) on a log 10 scale.
Fig. 2.
Fig. 2.
(A) Weighted correlation network analysis of blueberry metabolites and their assigned clusters based on their known biochemical classification. The size of each metabolite node indicates betweenness centrality. The thickness of the lines connecting metabolites is scaled relative to the correlation between the metabolites. The identity of each metabolite is denoted by number in the legend. (B) Distribution of metabolite concentrations for each volatile group across the blueberry population. Volatile concentrations are reported in nanograms per gram fresh weight per hour (ng/gfw/h) on a log 10 scale.
Fig. 3.
Fig. 3.
Variation in sensory panel ratings explained by sugars/acids and volatiles overall, and by groups of metabolites of known biochemical classification in tomato and blueberry.
Fig. 4.
Fig. 4.
(A) Accuracy of predicting flavor ratings from metabolome data across a range of statistical and machine learning models for tomato and blueberry. Averages and model rankings are inclusive of umami accuracies depicted in SI Appendix, Fig. S1 and Datasets S4 and S5. (B) Accuracy of predicting perception traits for tomato using 70 individuals with genomic and metabolomic data. (C) Accuracies for tomato flavor prediction with a consistent test set of 39 samples and increasing training set sizes ranging from 50 to 170 samples.
Fig. 5.
Fig. 5.
Relative importance estimated with gradient-boost machine (x axis) and β coefficients estimated with Bayes A (y axis) of each metabolite in predicting ratings of flavor attributes in blueberry and tomato. Colors indicate the type of metabolite. TA, titratable acidity.
Fig. 6.
Fig. 6.
Schematic representation of how the use of metabolomic selection could be applied in earlier stages of a breeding program, when compared to sensory panels.

Comment in

References

    1. Fernqvist F., Hunter E., Who’s to blame for tasteless tomatoes? The effect of tomato chilling on consumers’ taste perceptions. Eur. J. Hortic. Sci. 77, 193–198 (2012).
    1. Klee H. J., Tieman D. M., The genetics of fruit flavour preferences. Nat. Rev. Genet. 19, 347–356 (2018). - PubMed
    1. Bartoshuk L. M., Klee H. J., Better fruits and vegetables through sensory analysis. Curr. Biol. 23, R374–R378 (2013). - PubMed
    1. Tieman D., et al. , The chemical interactions underlying tomato flavor preferences. Curr. Biol. 22, 1035–1039 (2012). - PubMed
    1. Tieman D., et al. , A chemical genetic roadmap to improved tomato flavor. Science 355, 391–394 (2017). - PubMed

LinkOut - more resources