Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Aug 16:9:920492.
doi: 10.3389/fmolb.2022.920492. eCollection 2022.

DNA- and RNA-Binding Proteins Linked Transcriptional Control and Alternative Splicing Together in a Two-Layer Regulatory Network System of Chronic Myeloid Leukemia

Affiliations

DNA- and RNA-Binding Proteins Linked Transcriptional Control and Alternative Splicing Together in a Two-Layer Regulatory Network System of Chronic Myeloid Leukemia

Chuhui Wang et al. Front Mol Biosci. .

Abstract

DNA- and RNA-binding proteins (DRBPs) typically possess multiple functions to bind both DNA and RNA and regulate gene expression from more than one level. They are controllers for post-transcriptional processes, such as splicing, polyadenylation, transportation, translation, and degradation of RNA transcripts in eukaryotic organisms, as well as regulators on the transcriptional level. Although DRBPs are reported to play critical roles in various developmental processes and diseases, it is still unclear how they work with DNAs and RNAs simultaneously and regulate genes at the transcriptional and post-transcriptional levels. To investigate the functional mechanism of DRBPs, we collected data from a variety of databases and literature and identified 118 DRBPs, which function as both transcription factors (TFs) and splicing factors (SFs), thus called DRBP-SF. Extensive investigations were conducted on four DRBP-SFs that were highly expressed in chronic myeloid leukemia (CML), heterogeneous nuclear ribonucleoprotein K (HNRNPK), heterogeneous nuclear ribonucleoprotein L (HNRNPL), non-POU domain-containing octamer-binding protein (NONO), and TAR DNA-binding protein 43 (TARDBP). By integrating and analyzing ChIP-seq, CLIP-seq, RNA-seq, and shRNA-seq data in K562 using binding and expression target analysis and Statistical Utility for RBP Functions, we discovered a two-layer regulatory network system centered on these four DRBP-SFs and proposed three possible regulatory models where DRBP-SFs can connect transcriptional and alternative splicing regulatory networks cooperatively in CML. The exploration of the identified DRBP-SFs provides new ideas for studying DRBP and regulatory networks, holding promise for further mechanistic discoveries of the two-layer gene regulatory system that may play critical roles in the occurrence and development of CML.

Keywords: DNA- and RNA-binding protein; alternative splicing regulatory network; chronic myeloid leukemia; splicing factor; transcription factor; transcriptional regulatory network.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Overview of this study. We collected DRBPs from high-throughput, dedicated database, and annotation data. Then, the intersection of DRBP and splicing factor lists are defined as DRBP-SFs. To investigate the functions of DRBP-SFs, we carried out differential expression analysis, functional enrichment analysis and network construction for HNRNPK, HNRNPL, NONO and TARDBP in chronic myeloid leukemia. At last, we verified the model by text mining.
FIGURE 2
FIGURE 2
Gene Ontology (GO) annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis. (A) The top 20 GO terms in biological process, cellular components, and molecular functions. (B) The top 29 KEGG pathways.
FIGURE 3
FIGURE 3
Two-layer regulatory network of Heterogeneous nuclear ribonucleoprotein K (HNRNPK). Pink indicates the target genes of the transcriptional regulatory network of HNRNPK, blue indicates the target genes of splicing regulatory network of HNRNPK, and purple indicates the co-regulated targets of HNRNPK. For more detailed information on genes in the network, please refer to Supplementary Table S7.
FIGURE 4
FIGURE 4
Hypothetical two-layer network regulatory models of genes. (A) DNA- and RNA-binding proteins splicing factors (DRBP-SFs) may regulate the same genes at the transcriptional and splicing level as transcription factors (TFs) and SFs, respectively. (B) One DRBP-SF may act as SF to regulate the splicing of one gene with another SF controlled by this DRBP-SF in the transcriptional regulation level. (C) One DRBP-SF may act as a TF to regulate the transcription of one gene with another TF controlled by this DRBP-SF in the splicing regulation.
FIGURE 5
FIGURE 5
Co-regulatory gene network diagram of the 4 DNA- and RNA-binding proteins splicing factors (DRBP-SFs) and analysis of their Venn analysis. (A) Two-layer co-regulated gene network diagram of the four proteins; red indicates a gene co-regulated by all four proteins, gray indicates the genes co-regulated by three proteins, purple indicates the genes co-regulated by two proteins, blue indicates the genes regulated by heterogeneous nuclear ribonucleoprotein K, pink indicates the genes regulated by heterogeneous nuclear ribonucleoprotein L, green indicates the genes regulated by non-POU domain–containing octamer–binding protein, and yellow indicates the genes regulated by TAR DNA-binding protein 43. (B) Venn analysis diagram of the co-regulation of the four proteins.
FIGURE 6
FIGURE 6
Target genes of the two-layer regulatory networks associated with heterogeneous nuclear ribonucleoprotein K (HNRNPK), heterogeneous nuclear ribonucleoprotein L (HNRNPL), non-POU domain–containing octamer–binding protein, and TAR DNA-binding protein 43 (TARDBP). Principal component analysis dimensionality reduction was performed on the expression datasets of Cells-Leukemia cell line (CML) and Whole Blood. (A) Target genes of the two-layer regulatory networks associated with HNRNPK, HNRNPL, NONO, and TARDBP in 3D. (B) Target genes of the two-layer regulatory networks associated with HNRNPK, HNRNPL, NONO, and TARDBP in 2D. (C) Target genes of the two-layer regulatory networks associated with HNRNPK. (D) Target genes of the two-layer regulatory networks associated with HNRNPL. (E) Target genes of the two-layer regulatory networks associated with NONO. (F) Target genes of the two-layer regulatory networks associated with TARDBP.
FIGURE 7
FIGURE 7
Network diagram of regulation model Ⅱ. (A) Heterogeneous nuclear ribonucleoprotein K (HNRNPK) regulates the transcription of nuclear cap–binding protein subunit 2 (NCBP2), and then HNRNPK and NCBP2 can jointly regulate the splicing of genes. Green indicates the genes that act as splicing factors (SFs) regulated by HNRNPK at the transcription level, pink indicates the genes regulated by HNRNPK at the splicing level, purple indicates the genes co-regulated by HNRNPK and NCBP2 at the splicing level, and blue indicates the genes regulated by NCBP2 at the splicing level. Rectangles indicate the target genes of HNRNPK at the transcription level, and ellipse indicates the target gene of HNRNPK and NCBP2 at the splicing level. (B) The binding regions of HNRNPK and NCBP2 in splicing target genes, ACTB, AKT1, ASXL1, TCF3, TFRC, and VEGFA. Input the ChIP-seq data of HNRNPK and NCBP2 into the UCSC Genome Browser to obtain the position image of peak in the genome. Purple and red indicate the protein binding sites, and blue indicates the location of genes in the genome. For more detailed information on genes in the network, please refer to Supplementary Table S8.
FIGURE 8
FIGURE 8
Network diagram of regulation model Ⅲ. (A) Heterogeneous nuclear ribonucleoprotein K (HNRNPK) regulates the splicing of scaffold attachment factor B1 (SAFB); HNRNPK and SAFB can jointly regulate the transcription of downstream genes. Green indicates genes that act as transcription factors regulated by HNRNPK at the splicing level, pink indicates genes regulated by HNRNPK at the transcriptional level, blue indicates genes regulated by SAFB at the transcriptional level, and purple indicates genes co-regulated by HNRNPK and nuclear cap–binding protein subunit 2 (NCBP2) at the transcriptional level. Ellipses indicate the target genes of SAFB and HNRNPK at the transcriptional level, and rectangles indicate target genes of HNRNPK at the splicing level. (B) The binding regions of HNRNPK and SAFB in transcriptional regulation target genes, BCL6 corepressor, CRK-like proto-oncogene adaptor protein, DNA methyltransferase 3 beta, enhancer of zeste 2 polycomb repressive complex 2 subunit, and fibroblast growth factor receptor 3. Input the ChIP-seq data of HNRNPK and SAFB into the UCSC Genome Browser to obtain the position image of peak in the genome. Purple and red indicate the protein binding sites, and blue indicates the location of genes in the genome. For more detailed information on genes in the network, please refer to Supplementary Table S9.

References

    1. Andersson M. K., Ståhlberg A., Arvidsson Y., Olofsson A., Semb H., Stenman G., et al. (2008). The Multifunctional FUS, EWS and TAF15 Proto-Oncoproteins Show Cell Type-specific Expression Patterns and Involvement in Cell Spreading and Stress Response. BMC Cell. Biol. 9, 37. 10.1186/1471-2121-9-37 - DOI - PMC - PubMed
    1. Bailey T. L., Elkan C. (1994). Fitting a Mixture Model by Expectation Maximization to Discover Motifs in Biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 2, 28–36. - PubMed
    1. Bailey T. L., Johnson J., Grant C. E., Noble W. S. (2015). The MEME Suite. Nucleic Acids Res. 43 (W1), W39–W49. 10.1093/nar/gkv416 - DOI - PMC - PubMed
    1. Bailey T. L. (2021). STREME: Accurate and Versatile Sequence Motif Discovery. Bioinformatics 37 (18), 2834–2840. 10.1093/bioinformatics/btab203 - DOI - PMC - PubMed
    1. Bardou P., Mariette J., Escudié F., Djemiel C., Klopp C. (2014). Jvenn: an Interactive Venn Diagram Viewer. BMC Bioinforma. 15, 293. 10.1186/1471-2105-15-293 - DOI - PMC - PubMed