Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 10;22(1):399.
doi: 10.1186/s12859-021-04312-3.

From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon

Affiliations

From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon

Jorge Oliveira et al. BMC Bioinformatics. .

Abstract

Numerous genomes are sequenced and made available to the community through the NCBI portal. However, and, unlike what happens for gene function annotation, annotation of promoter sequences and the underlying prediction of regulatory associations is mostly unavailable, severely limiting the ability to interpret genome sequences in a functional genomics perspective. Here we present an approach where one can download a genome of interest from NCBI in the GenBank Flat File (.gbff) format and, with a minimum set of commands, have all the information parsed, organized and made available through the platform web interface. Also, the new genomes are compared with a given genome of reference in search of homologous genes, shared regulatory elements and predicted transcription associations. We present this approach within the context of Community YEASTRACT of the YEASTRACT + portal, thus benefiting from immediate access to all the comparative genomics queries offered in the YEASTRACT + portal. Besides the yeast community, other communities can install the platform independently, without any constraints. In this work, we exemplify the usefulness of the presented tool, within Community YEASTRACT, in constructing a dedicated database and analysing the genome of the highly promising oleaginous red yeast species Rhodotorula toruloides currently poorly studied at the genome and transcriptome levels and with limited genome editing tools. Regulatory prediction is based on the conservation of promoter sequences and available regulatory networks. The case-study examined is focused on the Haa1 transcription factor-a key regulator of yeast resistance to acetic acid, an important inhibitor of industrial bioconversion of lignocellulosic hydrolysates. The new tool described here led to the prediction of a RtHaa1 regulon with expected impact in the optimization of R. toruloides robustness for lignocellulosic and pectin-rich residue biorefinery processes.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Relational database schema
Fig. 2
Fig. 2
Promoter sequence nucleotide difference (using the Levenshtein distance), as a boxplot distribution, between S. cerevisiae S288C in comparison to all the species gathered in the YEASTRACT + platform, including the newly added R. toruloides NP11
Fig. 3
Fig. 3
Fraction of common TF binding sites, relative to sites conserved only in S. cerevisiae, to all the species gathered in the YEASTRACT + platform, including the newly added R. toruloides NP11. For each species, the boxplot distribution of the fraction is presented
Fig. 4
Fig. 4
Predicted putative networks for the RHTO_01077 gene—encoding the closest homolog of the Haa1 protein in R. toruloides—projected from the documented regulatory associations of S. cerevisiae, C. glabrata and Z. bailii
Fig. 5
Fig. 5
Inferred Regulon Venn Diagram, of the Haa1p transcription factor targets, considering the documented regulatory associations in S. cerevisiae, C. glabrata and Z. bailii
Fig. 6
Fig. 6
The RtHaa1 regulon in weak acid stress. A Predicted RtHaa1-regulon, based on the set of documented regulatory associations of S. cerevisiae, C. glabrata and Z. bailii under weak acid stress. B Inferred Regulon Venn Diagram, of the Haa1 transcription factor targets, considering the documented regulatory associations in S. cerevisiae, C. glabrata and Z. bailii under weak acid stress conditions

Similar articles

Cited by

References

    1. Cherry JM, Hong EL, Amundsen C, Balakrishnan R, Binkley G, Chan ET, Christie KR, Costanzo MC, Dwight SS, Engel SR, Fisk DG, Hirschman JE, Hitz BC, Karra K, Krieger CJ, Miyasato SR, Nash RS, Park J, Skrzypek MS, Simison M, Weng S, Wong ED. Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res. 2012;40:D700–D705. doi: 10.1093/nar/gkr1029. - DOI - PMC - PubMed
    1. Skrzypek MS, Binkley J, Binkley G, Miyasato SR, Simison M, Sherlock G. The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data. Nucleic Acids Res. 2017;45:D592–D596. doi: 10.1093/nar/gkw924. - DOI - PMC - PubMed
    1. Monteiro PT, Oliveira J, Pais P, Antunes M, Palma M, Cavalheiro M, Galocha M, Godinho CP, Martins LC, Bourbon N, Mota MN, Ribeiro RA, Viana R, Sá-Correia I, Teixeira MC. YEASTRACT+: a portal for cross-species comparative genomics of transcription regulation in yeasts. Nucleic Acids Res. 2020;48:D642–D649. doi: 10.1093/nar/gkz859. - DOI - PMC - PubMed
    1. High-density cultivation of oleaginous yeast Rhodosporidium toruloides Y4 in fed-batch culture. Enzyme Microb Technol. 2007;41:312–7.
    1. Hu C, Zhao X, Zhao J, Wu S, Zhao ZK. Effects of biomass hydrolysis by-products on oleaginous yeast Rhodosporidium toruloides. Bioresour Technol. 2009;100:4843–4847. doi: 10.1016/j.biortech.2009.04.041. - DOI - PubMed

Substances

Supplementary concepts

LinkOut - more resources