An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora
- PMID: 21303543
- PMCID: PMC3045888
- DOI: 10.1186/1471-2229-11-30
An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora
Abstract
Background: Coffee is one of the world's most important crops; it is consumed worldwide and plays a significant role in the economy of producing countries. Coffea arabica and C. canephora are responsible for 70 and 30% of commercial production, respectively. C. arabica is an allotetraploid from a recent hybridization of the diploid species, C. canephora and C. eugenioides. C. arabica has lower genetic diversity and results in a higher quality beverage than C. canephora. Research initiatives have been launched to produce genomic and transcriptomic data about Coffea spp. as a strategy to improve breeding efficiency.
Results: Assembling the expressed sequence tags (ESTs) of C. arabica and C. canephora produced by the Brazilian Coffee Genome Project and the Nestlé-Cornell Consortium revealed 32,007 clusters of C. arabica and 16,665 clusters of C. canephora. We detected different GC3 profiles between these species that are related to their genome structure and mating system. BLAST analysis revealed similarities between coffee and grape (Vitis vinifera) genes. Using KA/KS analysis, we identified coffee genes under purifying and positive selection. Protein domain and gene ontology analyses suggested differences between Coffea spp. data, mainly in relation to complex sugar synthases and nucleotide binding proteins. OrthoMCL was used to identify specific and prevalent coffee protein families when compared to five other plant species. Among the interesting families annotated are new cystatins, glycine-rich proteins and RALF-like peptides. Hierarchical clustering was used to independently group C. arabica and C. canephora expression clusters according to expression data extracted from EST libraries, resulting in the identification of differentially expressed genes. Based on these results, we emphasize gene annotation and discuss plant defenses, abiotic stress and cup quality-related functional categories.
Conclusion: We present the first comprehensive genome-wide transcript profile study of C. arabica and C. canephora, which can be freely assessed by the scientific community at http://www.lge.ibi.unicamp.br/coffea. Our data reveal the presence of species-specific/prevalent genes in coffee that may help to explain particular characteristics of these two crops. The identification of differentially expressed transcripts offers a starting point for the correlation between gene expression profiles and Coffea spp. developmental traits, providing valuable insights for coffee breeding and biotechnology, especially concerning sugar metabolism and stress tolerance.
Figures






Similar articles
-
Transcriptome analysis in Coffea eugenioides, an Arabica coffee ancestor, reveals differentially expressed genes in leaves and fruits.Mol Genet Genomics. 2016 Feb;291(1):323-36. doi: 10.1007/s00438-015-1111-x. Epub 2015 Sep 3. Mol Genet Genomics. 2016. PMID: 26334613
-
RBCS1 expression in coffee: Coffea orthologs, Coffea arabica homeologs, and expression variability between genotypes and under drought stress.BMC Plant Biol. 2011 May 16;11:85. doi: 10.1186/1471-2229-11-85. BMC Plant Biol. 2011. PMID: 21575242 Free PMC article.
-
Micro-collinearity and genome evolution in the vicinity of an ethylene receptor gene of cultivated diploid and allotetraploid coffee species (Coffea).Plant J. 2011 Jul;67(2):305-17. doi: 10.1111/j.1365-313X.2011.04590.x. Epub 2011 May 12. Plant J. 2011. PMID: 21457367
-
Advances in genomics for the improvement of quality in coffee.J Sci Food Agric. 2016 Aug;96(10):3300-12. doi: 10.1002/jsfa.7692. Epub 2016 Apr 5. J Sci Food Agric. 2016. PMID: 26919810 Review.
-
An overview on the Brazilian Coffea canephora scenario and the current chemometrics-based spectroscopic research.Food Res Int. 2024 Oct;194:114866. doi: 10.1016/j.foodres.2024.114866. Epub 2024 Aug 3. Food Res Int. 2024. PMID: 39232507 Review.
Cited by
-
Nitrogen starvation, salt and heat stress in coffee (Coffea arabica L.): identification and validation of new genes for qPCR normalization.Mol Biotechnol. 2013 Mar;53(3):315-25. doi: 10.1007/s12033-012-9529-4. Mol Biotechnol. 2013. PMID: 22421886
-
Transcriptome analysis in Coffea eugenioides, an Arabica coffee ancestor, reveals differentially expressed genes in leaves and fruits.Mol Genet Genomics. 2016 Feb;291(1):323-36. doi: 10.1007/s00438-015-1111-x. Epub 2015 Sep 3. Mol Genet Genomics. 2016. PMID: 26334613
-
Long-read sequencing of the coffee bean transcriptome reveals the diversity of full-length transcripts.Gigascience. 2017 Nov 1;6(11):1-13. doi: 10.1093/gigascience/gix086. Gigascience. 2017. PMID: 29048540 Free PMC article.
-
Small RNA and transcriptome deep sequencing proffers insight into floral gene regulation in Rosa cultivars.BMC Genomics. 2012 Nov 21;13:657. doi: 10.1186/1471-2164-13-657. BMC Genomics. 2012. PMID: 23171001 Free PMC article.
-
The Rhizosphere Microbiomes of Five Species of Coffee Trees.Microbiol Spectr. 2022 Apr 27;10(2):e0044422. doi: 10.1128/spectrum.00444-22. Epub 2022 Mar 15. Microbiol Spectr. 2022. PMID: 35289671 Free PMC article.
References
-
- Pay E. The market for organic and fair-trade coffee. FAO Rome. 2009.
-
- Charrier A, Berthaud J. In: Coffee: botany, biochemistry, and production of beans and beverage. New York. Clifforf MN, Wilsson KC, editor. 1985. Botanical classification of coffee; pp. 13–47.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous