Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Oct 9;24(10):e57128.
doi: 10.15252/embr.202357128. Epub 2023 Sep 4.

The polyA tail facilitates splicing of last introns with weak 3' splice sites via PABPN1

Affiliations

The polyA tail facilitates splicing of last introns with weak 3' splice sites via PABPN1

Li Huang et al. EMBO Rep. .

Abstract

The polyA tail of mRNAs is important for many aspects of RNA metabolism. However, whether and how it regulates pre-mRNA splicing is still unknown. Here, we report that the polyA tail acts as a splicing enhancer for the last intron via the nuclear polyA binding protein PABPN1 in HeLa cells. PABPN1-depletion induces the retention of a group of introns with a weaker 3' splice site, and they show a strong 3'-end bias and mainly locate in nuclear speckles. The polyA tail is essential for PABPN1-enhanced last intron splicing and functions in a length-dependent manner. Tethering PABPN1 to nonpolyadenylated transcripts also promotes splicing, suggesting a direct role for PABPN1 in splicing regulation. Using TurboID-MS, we construct the PABPN1 interactome, including many spliceosomal and RNA-binding proteins. Specifically, PABPN1 can recruit RBM26&27 to promote splicing by interacting with the coiled-coil and RRM domain of RBM27. PABPN1-regulated terminal intron splicing is conserved in mice. Together, our study establishes a novel mode of post-transcriptional splicing regulation via the polyA tail and PABPN1.

Keywords: PABPN1; RBM26&27; last intron splicing; polyA tail; weak 3′ splice site.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

Figure 1
Figure 1. PABPN1‐loss induced preferential retention of last introns
  1. A

    Differential intron retention (IR) analysis of RNA‐seq data from samples under PABPN1 knockdown (siPABPN1) and control (NC) conditions. Significantly repressed and induced intron retention events (retained introns, RIs) are indicated by blue and red dots, respectively. The threshold of IR difference (ΔIR, siPABPN1‐siNC) is 0.15, and the FDR cutoff is 0.01.

  2. B

    UCSC genome browser view showing the normalized RNA‐seq signals of COPS5 in HeLa cells under PABPN1 knockdown (si‐PABPN‐1 and si‐PABPN‐2) and control (NC‐1 and NC‐2) conditions. The quantified intron retention levels (IR score) of the last introns (dotted box) are labeled for all samples, respectively.

  3. C

    Cumulative frequency distributions of retained introns (RIs), up‐regulated (red), and down‐regulated (blue) upon PABPN1 knockdown, and of all retained introns in control (NC, green) samples by their relative intronic ranks in genes.

  4. D, E

    Validation of last intron retention of six representative genes in response to PABPN1 knockdown by semi‐quantitative PCR in HeLa cells treated with three different NC and PABPN1 siRNAs (D). The bottom box diagram shows a schematic view of the primers for spliced (EEJ, exon–exon junction) and unspliced (EIJ, exon–intron junction) isoforms. The quantified intron retention (IR) scores of the last introns are shown on the right for all samples, respectively (E).

  5. F

    Relative ratios of the unspliced (EIJ) to spliced (EEJ) isoforms of the last introns for LARP7, BRCA1, and DUSP12 genes in HeLa cells treated in the same manner as in (D).

  6. G

    Western blot validating the degradation effect of PABPN1 under the treatment of dTAG13 for 0, 3, and 6 h.

  7. H

    UCSC genome browser view showing the normalized RNA‐seq signals of COPS5 in PABPN1‐dTAG cells under PABPN1 rapid degradation for 0, 3, and 6 h. The quantified intron retention levels (IR score) of the last introns (dotted box) are labeled for all samples, respectively.

  8. I

    The last intron splicing analysis of LARP7 and COPS5 in response to PABPN1 rapid degradation by dTAG13 for 0, 3, and 6 h by RT–PCR in PABPN1 dTAG cell lines.

  9. J

    Heatmap showing the IR ratio change of PABPN1 knockdown‐induced retained last introns under rapid degradation of PABPN1 for 3 and 6 h compared to 0 h. The ΔIR values of 143 PABPN1‐regulated last introns from the knockdown assay are shown on the right for comparison.

Data information: (E, F) Data are from n = 3 biologically independent experiments. The error bar represents SD. The P‐values are calculated by unpaired t‐test (****P < 0.0001; ***P < 0.001; **P < 0.01; *P < 0.05; ns, nonsignificant). See also Fig EV1. Source data are available online for this figure.
Figure EV1
Figure EV1. Data quality evaluation and characteristics of retained introns and APA events in response to PABPN1 depletion, related to Fig 1
  1. A

    qPCR validation of the RNAi knockdown efficiency of PABPN1 in HeLa cells.

  2. B

    Pearson correlation between two replicates of RNA‐seq data with or without PABPN1 knockdown based on the expression of annotated genes.

  3. C

    Venn diagram showing the overlapping of retained introns (RIs) among three groups: up‐regulated and down‐regulated RIs upon PABPN1 knockdown, and RIs in control (NC) samples.

  4. D

    The schematic diagram of Fkbp12F36V knocking in site of PABPN1 rapidly degrading cell line.

  5. E, F

    UCSC genome browser view showing the normalized RNA‐seq signals of LARP7 (E) and DUSP12 (F) in PABPN1–dTAG cells under PABPN1 rapid degradation for 0, 3, and 6 h. The quantified intron retention levels (IR score) of the last introns (dotted box) are labeled for all samples, respectively.

  6. G

    Global analysis of dPAS usage by PDUI metric between control (NC) and PABPN1 knockdown. The PDUI is calculated by the relative usage of distal PAS (dPAS) to the sum of dPAS and proximal PAS (pPAS) usages (top). The threshold for significant PDUI change is: ¦ΔPDUI¦ ≥ 0.15 and FDR is < 0.01.

  7. H

    Venn diagram showing the overlap between genes with regulated APA and intron retention (IR) events in response to PABPN1 knockdown.

Data information: (A) Data are from n = 3 biologically independent experiments. The error bars represent SD. The P‐values are calculated by unpaired t‐test (**P < 0.01).
Figure 2
Figure 2. Replacing endogenous PAS with histone or Malat1 3′ end without CPA‐mediated cleavage and polyadenylation mimics the phenomenon of PABPN1‐loss‐induced last intron retention
  1. A

    Schematic diagrams of the GFP reporter constructs for downstream splicing assay. The associated elements are annotated on the right. For one endogenous gene, four reporters are constructed: C1 & C2 are the spliced and unspliced CDS controls with SV40 3′ UTR and histone 3′ end, respectively; C3 & C4 contain the last intron and its franking exons of the endogenous gene, with the endogenous 3′ end (C3), histone or Malat1 3′ end (C4).

  2. B

    Western blots of the C1–C4 reporters for two endogenous regulated genes (LARP7 and COPS5) and three endogenous nonregulated control genes (TDRD9, CENPV, and SMTN).

  3. C

    RT–PCR analysis of the spliced and unspliced isoforms from the C3 and C4 reporters for two endogenous regulated genes LARP7 and COPS5.

  4. D

    Schematic diagram of MS2 tethering reporter for PABPN1. 5× MS2 stem loops are inserted to the C4 reporters constructed in Fig 2A which are behind the 3′ UTR and in front of histone/Malat1 3′ end.

  5. E

    RT–PCR quantification of MS2 reporters of histone 3′ end (left) and Malat1 3′ end (right) for two PABPN1‐regulated genes LARP7 and COPS5.

Data information: See also Fig EV2. Source data are available online for this figure.
Figure EV2
Figure EV2. UCSC genome browser views of the genes used to construct the reporters for splicing assay, related to Fig 2
  1. A–D

    Representative gene regulated by PABPN1 are LARP7 (A) and COPS5 (Fig 1B), whereas three nonregulated genes are TDRD9 (B), CENPV (C), and SMTN (D). The dotted box marks the last intron with quantified intron retention (IR) levels.

  2. E

    Western blot analysis of MS2 reporters with histone/Malat1 3′ end for two PABPN1‐regulated genes LARP7 and COPS5.

  3. F

    UCSC genome browser view showing the RNA‐seq signals of FBXO4 upon PABPN1 knockdown. The dotted box marks the retained intron, and it promotes the usage of an internal 3′ splice site (A3SS, red dot) with the maximum score in all intronic 3′ splice sites predicted by spliceAI.

  4. G

    Western blots of the C1‐C4 reporters and an extra control reporter (A3SS) for FBXO4. The A3SS reporter is similar to C1 control with the CDS from the isoform using the internal 3′ splice site in the retained intron.

  5. H

    Splicing analysis of the C3 and C4 reporters for FBXO4 by RT‐PCR.

Figure 3
Figure 3. Long polyA tail promotes the splicing of PABPN1‐enhanced last introns
  1. A, B

    Bar plot showing the enrichment of polyA sequences in PABPN1 immunoprecipitated RNAs using 8‐mer motif analysis (A) or examining the reads ending with 6 or 30 more As.

  2. B

    In (D), only the top motif enriched in PABPN1 IP or input control is shown. The PABPN1 eCLIP data was from ENCODE project (ENCSR820UYE).

  3. C

    Schematic diagram of the splicing reporters by replacing MS2 stem loops in Fig 3A with polyA sequences of different lengths.

  4. D

    RNA quantification by RT–PCR for the spliced and unspliced isoforms (top) and calculated IR scores (bottom) of the reporters in Fig 3F.

  5. E

    Quantification of the spliced (EEJ) over unspliced isoforms (EIJ) of the reporters in Fig 3F.

  6. F

    The polyA tail length distribution of regulated last intron‐retained genes and other genes by nanopore sequencing. The P‐values are calculated by the Wilcoxon test (***P < 0.001).

  7. G

    Splicing analysis of the C3, C4, and polyA reporters in Fig 3F for endogenous genes LARP7, COPS5, and NSUN5 by RT–PCR upon PABPN1 knockdown (top) and the quantified intron retention levels (IR score) of these reporters (bottom).

  8. H

    Protein domains of PABPN1 with known important mutation sites marked by triangles.

  9. I

    Schematic diagrams of PABPN1 mutants for downstream rescue assay.

  10. J

    Relative ratios of the unspliced isoforms (EIJ) to the spliced isoforms (EEJ) of the last introns for LARP7, DUSP12, and BRCA1 genes in HeLa cells treated with si‐PABPN1 and then rescuing PABPN1 mutants constructed in G.

Data information: (D, E, G, and J) Data are from n = 3 biologically independent experiments. The error bars represent SD. The P‐values are calculated by unpaired t‐test (****P < 0.0001; ***P < 0.001; **P < 0.01; *P < 0.05; ns, nonsignificant). See also Fig EV3. Source data are available online for this figure.
Figure EV3
Figure EV3. PABPN1 binds to the polyA tails in vivo and Poly(A) polymerase contributes to PABPN1‐enhanced last intron splicing, related to Fig 3
  1. A

    The average levels of polyA read signals of PABPN1 eCLIP‐seq (two replicates: IP1, green; IP2, orange) and input sample (purple) around the polyA sites.

  2. B

    Pie chart of the fraction of last intron‐retained genes with or without PABPN1 binding downstream the last intron to the polyA site in PABPN1 eCLIP data.

  3. C

    The statistics of CLIP sites of PABPN1 binding to last intron retained genes at the 3′ UTR with or without A‐stretches, or at polyA tails.

  4. D

    UCSC genome browser view showing a representative gene example AMDHD2 with last intron retention upon PABPN1 knockdown and with PABPN1 binding from eCLIP‐seq. The gray dotted box marks the retained intron, and the red dotted box marks the region of PABPN1 binding. The intron retention (IR) values for control (NC‐1 and NC‐2) and PABPN1 knockdown (si‐PABPN1‐1 and si‐PABPN1‐2) samples are shown.

  5. E

    Double digestion validation of the inserted polyA sequences in the GFP reporters constructed in Fig 3E.

  6. F

    Western blot of the polyA reporters with histone 3′ end in Fig 3F for endogenous genes LARP7, COPS5, NSUN5.

  7. G–I

    Western blot of the polyA reporters with Malat1 3′ end in Fig 3F for endogenous genes LARP7 (G), COPS5 (H), NSUN5 (I).

  8. J

    Relative ratios of the unspliced isoforms (EIJ) to spliced isoforms (EEJ) of the last introns for LARP7, DUSP12, and BRCA1 genes in HeLa cells upon knocking down PAP and some other exosome components.

  9. K

    qPCR quantitation of the PAP and PABPN1 expression in HeLa cells treated in the same manner as in A.

  10. L

    Differential intron retention (IR) analysis upon PAP knockdown (siPAP) compared with control (NC). The RNA‐seq data are from Bresson et al (2015a). Significantly repressed and induced retained introns (RIs) are indicated by blue and red dots, respectively. The threshold of IR difference (ΔIR, siPAP ‐ NC) is 0.1, and the FDR cutoff is 0.01.

  11. M

    Venn diagram showing the overlap between genes with up‐regulated retained introns in response to PABPN1 and PAP knockdowns.

  12. N

    UCSC genome browser view showing the normalized RNA‐seq signals of LARP7 in 293T cells under PAP knockdown (si‐PAP‐1 and si‐PAP‐2) and control (NC‐1 and NC‐2) conditions. The quantified intron retention levels (IRs) of the last introns (dotted box) are labeled for different samples, respectively.

Data information: (J, K) Data are from n = 3 biologically independent experiments. The error bars represent SD. The P‐values are calculated by unpaired t‐test (****P < 0.0001; **P < 0.01).
Figure 4
Figure 4. Genomic features of PABPN1‐depletion induced up‐regulated retained introns (RIs)
  1. A

    Box plot showing the splicing score analysis of the 5′ ss (left) and 3′ ss (right) of PABPN1 loss‐induced up‐regulated retained introns (orange) and random last introns (gray) calculated by spliceAI.

  2. B

    Consensus sequences of the PABPN1 loss‐induced up‐regulated retained introns (top) and randomly control introns (bottom) aligned at 3′ ss (20 nt intron sequence and 3 nt exon sequence).

  3. C

    Mutants construct near the 3′ splice sites for NSUN5 (left) and LARP7 (right). All the mutants are based on the C4 reporters in Fig 2A.

  4. D

    3′ splice site score of NSUN5 (purple) and LARP7 (blue) mutated reporters predicted by SpliceAI.

  5. E, F

    Splicing analysis of NSUN5 and LARP7 reporter mutants by RT–PCR (E) and Western blot (F).

  6. G

    The exon GC contents of PABPN1‐regulated genes with up‐ or down‐regulated RIs and other groups of genes located in different nuclear substructures including the nucleolus, nuclear speckle, nuclear LAMINA, Cajal body, HLB, PML, and SAM68.

  7. H

    Distribution of PABPN1‐regulated genes with up‐regulated RIs (left) and those genes whose last intron is up‐regulated RI (right) in nuclear substructures.

Data information: (A) The central line of the box plot represents the median value, and the lower and upper whiskers of the box represent the first and third quartiles, respectively. The upper whisker extends from the hinge to the largest value no further than 1.5 * IQR from the hinge (where IQR is the inter‐quartile range). The lower whisker extends from the hinge to the smallest value at most 1.5 * IQR of the hinge. There are 283 up‐regulated RIs and 1,000 random control RIs for splice score calculation by 5′ ss and 3′ ss, respectively. The P‐values are calculated by the unpaired two‐sided Wilcoxon test. See also Fig EV4. Source data are available online for this figure.
Figure EV4
Figure EV4. PABPN1 loss‐induced up‐regulated retained introns (RIs) have weaker 3′ ss, related to Fig 5
  1. A

    Nucleotide frequency around 3′ ss (50 nt intron sequence and 50 nt exon sequence) of PABPN1 loss‐induced IR‐upregulated introns and randomly control introns.

  2. B

    The 3′ ss score analysis of PABPN1 loss‐induced IR‐upregulated introns grouped by their intron retention (IR) levels, low (0.15 ≤ IR ≤ 0.3) and high (IR > 0.3).

Data information: (B) The central line of the box plot represents the median value, and the lower and upper whiskers of the box represent the first and third quartiles, respectively. The upper whisker extends from the hinge to the largest value no further than 1.5 * IQR from the hinge (where IQR is the inter‐quartile range). The lower whisker extends from the hinge to the smallest value at most 1.5 * IQR of the hinge. The number of points is n = 202 for the ‘IR low’ RIs and n = 81 for the ‘IR high’ RIs. The P‐values are calculated by the Wilcoxon test.
Figure 5
Figure 5. Identification of PABPN1 interactome and RBM26&27 knockdown induces intron retention of PABPN1‐regulated last retained introns (RIs)
  1. A

    Flowchart of TurboID‐GFP‐PABPN1 proximity labeling coupled mass spectrometry (MS) assay.

  2. B

    Volcano plots of the enriched proteins identified by PABPN1 TurboID‐MS. Significantly enriched proteins are colored in orange, and a few representative proteins are highlighted with labels.

  3. C

    RT‐PCR analysis of the last intron splicing of five representative PABPN1‐dependent endogenous genes upon RBM26/27 knockdown with quantified IR values.

  4. D

    Relative ratios of the unspliced (EIJ) to spliced (EEJ) isoforms of the last introns for LARP7, COPS5, BRCA1, and DUSP12 genes in HeLa cells treated in the same manner as in (C).

  5. E

    Splicing analysis of MS2 reporters for two PABPN1‐regulated genes LARP7 and COPS5 when tethering RBM26/27 by RT‐PCR.

  6. F

    Splicing analysis of the polyA reporters for LARP7, COPS5, and NSUN5 under the condition of RBM26/27 knockdown.

  7. G

    Cumulative frequency distributions of up‐regulated RIs upon PABPN1 (red), RBM26&27 (blue), ZC3H3 (green), and ZFC3H1 (yellow) knockdown by their relative intronic rank in genes.

  8. H

    Western blot validation of endogenous PABPN1 interacting with the myc‐tagged RBM26 and RBM27.

  9. I

    Representative immunofluorescence images showing the localization of endogenous PABPN1 relative to SC35, and the mCherry‐tagged RBM26&27 relative to SRRM2 in vivo.

  10. J

    Protein domains of RBM27 and the schematic diagrams of constructed myc‐tagged RBM27 mutants.

  11. K, L

    Western blot of endogenous PABPN1 immunoprecipitated myc‐tagged RBM27 mutated proteins.

Data information: (C–D) Data are from n = 3 biologically independent experiments. The error bars represent SD. The P‐values are calculated by unpaired t‐test (****P < 0.0001; ***P < 0.001; **P < 0.01; *P < 0.05; ns, nonsignificant). See also Fig EV5 and Appendix Fig S1. Source data are available online for this figure.
Figure EV5
Figure EV5. PABPN1 interacts with a large number of proteins associated with RNA spliceosome complex, related to Fig 5
  1. A

    Coomassie Brilliant Blue staining of captured proteins from PABPN1 Turbo‐ID proximity label.

  2. B

    Venn diagram showing the intersection of proteins identified from four independent studies, including the PABPN1 interactome from our TurboID‐MS, the spliceosome‐associated proteins, RNA 3′ end processing proteins, and the RNA degradation‐associated proteins.

  3. C

    GO‐terms enriched in PABPN1 interacting proteins. For Cellular Component (CC) categories, only the top 10 terms are listed; for the KEGG category, only the top 5 terms are listed.

  4. D

    Enriched GO‐terms in the cellular component category of PABPN1 interacting proteins. The enrichment and the count of proteins associated with the GO‐terms are represented by the color and size of the nodes, respectively. At the bottom, the PABPN1 interacting proteins associated with three representative GO‐terms and those not associated with shown terms are presented in four highlighting circles together with the protein–protein interactions annotated in the STRING database. The sizes of the blue nodes represent the enrichment level of the protein interacting with PABPN1.

  5. E

    Western blot validation of PABPN1 interacting with splicing factors and RNA 3′ end processing proteins identified by MS.

  6. F

    Splicing analysis of C3 reporters for LARP7 and COPS5 in Fig 2A and the endogenous genes under pladienolide B (Pla‐B) treatment in repressing U2 complex.

Figure 6
Figure 6. Graphical model for polyA tail‐PABPN1 axis in enhancing last intron splicing of specific RNAs
Within the nucleus, many introns are spliced co‐transcriptionally, while certain introns remain unspliced with 3′‐end already being polyadenylated. The added polyA tail can recruit nuclear polyA binding protein PABPN1 (N1), which may further cooperate with RBM26&27 to facilitate 3′ splice site (ss) recognition by U2 snRNP, and thus functions as an enhancer for the post‐transcriptional splicing of last introns with weaker 3′ ss.

Similar articles

Cited by

References

    1. Antoniou M, Geraghty F, Hurst J, Grosveld F (1998) Efficient 3′‐end formation of human beta‐globin mRNA in vivo requires sequences within the last intron but occurs independently of the splicing reaction. Nucleic Acids Res 26: 721–729 - PMC - PubMed
    1. Anvar SY, Raz Y, Verway N, van der Sluijs B, Venema A, Goeman JJ, Vissing J, van der Maarel SM, t Hoen PA, van Engelen BG et al (2013) A decline in PABPN1 induces progressive muscle weakness in oculopharyngeal muscle dystrophy and in muscle aging. Aging (Albany NY) 5: 412–426 - PMC - PubMed
    1. Barutcu AR, Wu M, Braunschweig U, Dyakov BJA, Luo Z, Turner KM, Durbic T, Lin ZY, Weatheritt RJ, Maass PG et al (2022a) Systematic mapping of nuclear domain‐associated transcripts reveals speckles and lamina as hubs of functionally distinct retained introns. Mol Cell 82: 1035–1052.e9 - PubMed
    1. Barutcu AR, Wu M, Braunschweig U, Dyakov BJA, Luo Z, Turner KM, Durbic T, Lin ZY, Weatheritt RJ, Maass PG et al (2022b) Gene Expression Omnibus GSE176439 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE176439). [DATASET] - PubMed
    1. Beaulieu YB, Kleinman CL, Landry‐Voyer AM, Majewski J, Bachand F (2012) Polyadenylation‐dependent control of long noncoding RNA expression by the poly(A)‐binding protein nuclear 1. PLoS Genet 8: e1003078 - PMC - PubMed

LinkOut - more resources