Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 6:13:1069443.
doi: 10.3389/fmicb.2022.1069443. eCollection 2022.

Mining novel cis-regulatory elements from the emergent host Rhodosporidium toruloides using transcriptomic data

Affiliations

Mining novel cis-regulatory elements from the emergent host Rhodosporidium toruloides using transcriptomic data

Luísa Czamanski Nora et al. Front Microbiol. .

Abstract

The demand for robust microbial cell factories that produce valuable biomaterials while resisting stresses imposed by current bioprocesses is rapidly growing. Rhodosporidium toruloides is an emerging host that presents desirable features for bioproduction, since it can grow in a wide range of substrates and tolerate a variety of toxic compounds. To explore R. toruloides suitability for application as a cell factory in biorefineries, we sought to understand the transcriptional responses of this yeast when growing under experimental settings that simulated those used in biofuels-related industries. Thus, we performed RNA sequencing of the oleaginous, carotenogenic yeast in different contexts. The first ones were stress-related: two conditions of high temperature (37 and 42°C) and two ethanol concentrations (2 and 4%), while the other used the inexpensive and abundant sugarcane juice as substrate. Differential expression and functional analysis were implemented using transcriptomic data to select differentially expressed genes and enriched pathways from each set-up. A reproducible bioinformatics workflow was developed for mining new regulatory elements. We then predicted, for the first time in this yeast, binding motifs for several transcription factors, including HAC1, ARG80, RPN4, ADR1, and DAL81. Most putative transcription factors uncovered here were involved in stress responses and found in the yeast genome. Our method for motif discovery provides a new realm of possibilities in studying gene regulatory networks, not only for the emerging host R. toruloides, but for other organisms of biotechnological importance.

Keywords: RNA sequencing; biorefineries; cell factory; industrial stress; motifs; sugarcane juice; transcription factor.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Workflow for the discovery of cis-regulatory elements using transcriptomic data.
Figure 2
Figure 2
Venn diagram of differentially expressed genes (DEGs) for all conditions. The Venn diagram shows how many DEGs are shared between all conditions and how many are exclusive for each condition. Numbers outside of the Venn diagram represent the total of DEGs when compared to each respective control.
Figure 3
Figure 3
Enriched kyoto encyclopedia of genes and genomes (KEGG) pathways for Rhodosporidium toruloides grown on sugarcane juice (SCJ). Bubble map showing the biochemical pathways of R. toruloides noted by KEGG enriched in the SCJ condition, as obtained by the GAGE package. Pathways that have an enrichment value greater than 0 are up-regulated, while those that have a value less than 0 are down-regulated. Blue scale inside the bubbles represents the decreasing value of ps. The different sizes of the bubbles define the approximate number of DEGs in each biochemical pathway.
Figure 4
Figure 4
Enriched kyoto encyclopedia of genes and genomes (KEGG) pathways for R. toruloides grown at 42°C. Bubble map showing the biochemical pathways of R. toruloides noted by KEGG that are enriched in the 42°C condition, as obtained by the GAGE package. Pathways that have an enrichment value greater than 0 are up-regulated, while those that have a value less than 0 are down-regulated. Blue scale inside the bubbles represents the decreasing value of ps. The different sizes of the bubbles define the approximate number of DEGs in each biochemical pathway.
Figure 5
Figure 5
Eukaryotic orthologous groups (KOG) selected for TFBS finding. Percentage of DEGs compared to the total number of genes in the R. toruloides genome for each condition are shown as annotated using KOG. These are the most relevant conditions with the greatest amount of DEGs and were selected for further analysis. The remaining KOG annotations are shown in Supplementary Figure S5.
Figure 6
Figure 6
Examples of putative motifs aligned with transcription factors motifs from the JASPAR database using TOMTOM. (A) Top motif is the binding motif for ARG81 from Saccharomyces cerevisiae. Bottom motif is from SCJ condition. Eukaryotic orthologous groups (KOG) class: Amino acid transport and metabolism. Value of p: 1.08e–03. E-value: 1.51e + 00 (B) Top motif for AGR80 from S. cerevisiae. Bottom motif is from 42°C condition. KOG class: Amino acid transport and metabolism. Value of p: 4.98e–03. E-value: 8.76e–01. (C) Top motif for DAL80 from S. cerevisiae. Bottom motif is from ethanol 2% condition. KOG class: Amino acid transport and metabolism. value of p: 3.38e–02. E-value: 5.95e + 00. (D) Top for RPN4 from S. cerevisiae. Bottom motif is from 42°C condition. KOG class: Chromatin structure and dynamics. value of p: 4.15e–03. E-value: 7.31e–01. (E) Top motif for MAC1 from S. cerevisiae. Bottom motif is from ethanol 2% condition. KOG class: Secondary metabolites biosynthesis transport and catabolism. Value of p: 4.15e–03. E-value: 7.30e–01. (F) Top motif for HCM1 from S. cerevisiae. Bottom motif is from SCJ condition. KOG class: Chromatin structure and dynamics. Value of p: 3.44e–02. E-value: 6.05e + 00.
Figure 7
Figure 7
Graph of putative transcription factors and their putative binding sites. Graph showing interaction between putative transcription factors for R. toruloides with the transcription factors (TF) binding motifs predicted using HOMER. Green nodes represent the sugarcane juice condition, orange nodes represent the 42°C condition, dark green nodes are the 2% ethanol, and blue nodes are the 4% ethanol condition. Written inside nodes are the number of the TFBS and the name of the gene class annotated by eukaryotic orthologous groups (KOG) in which that motif was found. Pink nodes are the putative transcription FTs found using TOMTOM.

Similar articles

Cited by

References

    1. Abdel-Banat B. M. A., Hoshida H., Ano A., Nonklang S., Akada R. (2010). High-temperature fermentation: how can processes for ethanol production at high temperatures become superior to the traditional process using mesophilic yeast? Appl. Microbiol. Biotechnol. 85, 861–867. doi: 10.1007/s00253-009-2248-5, PMID: - DOI - PubMed
    1. Ambrosini G., Vorontsov I., Penzar D., Groux R., Fornes O., Nikolaeva D. D., et al. . (2020). Insights gained from a comprehensive all-against-all transcription factor binding motif benchmarking study. Genome Biol. 21:114. doi: 10.1186/s13059-020-01996-3, PMID: - DOI - PMC - PubMed
    1. Andrews S. (2018). FastQC: a quality control tool for high throughput sequence data, FastQC: a quality control tool for high throughput sequence data. Available at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (Accessed October 10, 2022).
    1. Antonieto A. C. C., Nogueira K. M. V., de Paula R. G., Nora L. C., Cassiano M. H. A., Guazzaroni M. E., et al. . (2019). A novel Cys2His2 zinc finger homolog of AZF1 modulates Holocellulase expression in Trichoderma reesei. mSystems 4, 1–14. doi: 10.1128/mSystems.00161-19, PMID: - DOI - PMC - PubMed
    1. Bastian M., Heymann S., Jacomy M. (2009). ‘Gephi: an open source software for exploring and manipulating networks’. In Proceedings of the Third International ICWSM Conference, pp. 361–362.

LinkOut - more resources