Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Dec 14:14:1272966.
doi: 10.3389/fpls.2023.1272966. eCollection 2023.

Reference genome of the nutrition-rich orphan crop chia (Salvia hispanica) and its implications for future breeding

Affiliations

Reference genome of the nutrition-rich orphan crop chia (Salvia hispanica) and its implications for future breeding

Parul Gupta et al. Front Plant Sci. .

Abstract

Chia (Salvia hispanica L.) is one of the most popular nutrition-rich foods and pseudocereal crops of the family Lamiaceae. Chia seeds are a rich source of proteins, polyunsaturated fatty acids (PUFAs), dietary fibers, and antioxidants. In this study, we present the assembly of the chia reference genome, which spans 303.6 Mb and encodes 48,090 annotated protein-coding genes. Our analysis revealed that ~42% of the chia genome harbors repetitive content, and identified ~3 million single nucleotide polymorphisms (SNPs) and 15,380 simple sequence repeat (SSR) marker sites. By investigating the chia transcriptome, we discovered that ~44% of the genes undergo alternative splicing with a higher frequency of intron retention events. Additionally, we identified chia genes associated with important nutrient content and quality traits, such as the biosynthesis of PUFAs and seed mucilage fiber (dietary fiber) polysaccharides. Notably, this is the first report of in-silico annotation of a plant genome for protein-derived small bioactive peptides (biopeptides) associated with improving human health. To facilitate further research and translational applications of this valuable orphan crop, we have developed the Salvia genomics database (SalviaGDB), accessible at https://salviagdb.org.

Keywords: Salvia hispanica; biopeptide; chia; lectins; polyunsaturated fatty acid (PUFA); reference genome; seed mucilage; terpene synthase.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Figures

Figure 1
Figure 1
Overview of the chia (Salvia hispanica) genome assembly and the annotated features.
Figure 2
Figure 2
Synteny analysis of the chia genome. (A) Chromosome wise colinear genes shared between chia, Arabidopsis and tomato; (B) Chromosome wise collinear gene location; (C) The ratio of non-synonymous (Kn) to synonymous (KS) substitutions per site between syntenic orthologues.
Figure 3
Figure 3
Chia gene family and phylogenetic analysis. (A) Inparanoid-based gene family analysis; (B) Gene family member gain/loss analysis. Gains (blue), conserved (orange), and loss (Grey); (C) Species wise gene distribution in the most significant families.
Figure 4
Figure 4
Chia gene family and expression analysis. (A) Lectin gene family tree with gene expression profiles; (B) terpene synthase (TPS) gene family tree with gene expression profile data.
Figure 5
Figure 5
Chia genes involved in PUFA biosynthesis pathway and their expression across different developmental stages. (A) Biosynthesis of omega-3 and omega-6 fatty acids. LA, Linoleic acid; ALA, α-linolenic acid; GLA, γ‐linolenic acid; SDA, Stearidonic acid; ETA, Eicosatetraenoic acid; DGLA, dihomo-γ-linolenic acid; DHA, Docosahexaenoic acid; DPA, Docosapentaenoic acid. (B) expression of genes involved in omega-3 and omega-6 fatty acids biosynthesis; (C) expression of oleosin, caleosin, and steroleosin genes. The reactions with dashed arrows represent multiple steps. The gene IDs marked with an asterisk (*) represent tandemly duplicated genes. Gene IDs and names in green text are potentially targeted to chloroplast.
Figure 6
Figure 6
Biosynthesis pathways and expression analysis of Chia genes involved in biosynthesis of seed mucilage polysaccharide monomer units. (A) Glucuronate biosynthesis; (B) D-Galacturonate biosynthesis; (C) D-Mannose biosynthesis; (D) D-Xylose biosynthesis; (E) L-Arabinose biosynthesis. The gene IDs marked with an asterisk (*) represent tandemly duplicated genes.

Similar articles

Cited by

References

    1. Aguilar-Toalá J. E., Liceaga A. M. (2020). Identification of chia seed (Salvia hispanica L.) peptides with enzyme inhibition activity towards skin-aging enzymes. Amino Acids 52, 1149–1159. doi: 10.1007/s00726-020-02879-4 - DOI - PubMed
    1. Aguilar-Toalá J. E., Vidal-Limon A., Liceaga A. M. (2022). Multifunctional analysis of chia seed (Salvia hispanica L.) bioactive peptides using peptidomics and molecular dynamics simulations approaches. Int. J. Mol. Sci. 23, 7288. - PMC - PubMed
    1. ALBERTO C. M., SANSO A. M., XIFREDA C. C. (2003). Chromosomal studies in species of Salvia (Lamiaceae) from Argentina. Bot. J. Linn. Soc. 141, 483–490. doi: 10.1046/j.1095-8339.2003.t01-1-00178.x - DOI
    1. Alejo-Jacuinde G., Nájera-González H. R., Chávez Montes R. A., Gutierrez Reyes C. D., Barragán-Rosillo A. C., Perez Sanchez B., et al. . (2023). Multi-omic analyses reveal the unique properties of chia (Salvia hispanica) seed metabolism. Commun. Biol. 6, 1–13. doi: 10.1038/s42003-023-05192-4 - DOI - PMC - PubMed
    1. Alfredo V.-O., Gabriel R.-R., Luis C.-G., David B.-A. (2009). Physicochemical properties of a fibrous fraction from chia (Salvia hispanica L.). LWT Food Sci. Technol. 42, 168–173. doi: 10.1016/j.lwt.2008.05.012 - DOI