. 2019 Apr 8;15(4):e8689.

doi: 10.15252/msb.20188689.

Defining the RNA interactome by total RNA-associated protein purification

Vadim Shchepachev¹, Stefan Bresson¹, Christos Spanos¹, Elisabeth Petfalski¹, Lutz Fischer², Juri Rappsilber^{3

2}, David Tollervey³

Affiliations

¹ Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK.
² Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany.
³ Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK juri.rappsilber@ed.ac.uk D.Tollervey@ed.ac.uk.

PMID: 30962360
PMCID: PMC6452921
DOI: 10.15252/msb.20188689

Defining the RNA interactome by total RNA-associated protein purification

Vadim Shchepachev et al. Mol Syst Biol. 2019.

. 2019 Apr 8;15(4):e8689.

doi: 10.15252/msb.20188689.

Authors

Vadim Shchepachev¹, Stefan Bresson¹, Christos Spanos¹, Elisabeth Petfalski¹, Lutz Fischer², Juri Rappsilber^{3

2}, David Tollervey³

Affiliations

¹ Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK.
² Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany.
³ Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK juri.rappsilber@ed.ac.uk D.Tollervey@ed.ac.uk.

PMID: 30962360
PMCID: PMC6452921
DOI: 10.15252/msb.20188689

Abstract

The RNA binding proteome (RBPome) was previously investigated using UV crosslinking and purification of poly(A)-associated proteins. However, most cellular transcripts are not polyadenylated. We therefore developed total RNA-associated protein purification (TRAPP) based on 254 nm UV crosslinking and purification of all RNA-protein complexes using silica beads. In a variant approach (PAR-TRAPP), RNAs were labelled with 4-thiouracil prior to 350 nm crosslinking. PAR-TRAPP in yeast identified hundreds of RNA binding proteins, strongly enriched for canonical RBPs. In comparison, TRAPP identified many more proteins not expected to bind RNA, and this correlated strongly with protein abundance. Comparing TRAPP in yeast and E. coli showed apparent conservation of RNA binding by metabolic enzymes. Illustrating the value of total RBP purification, we discovered that the glycolytic enzyme enolase interacts with tRNAs. Exploiting PAR-TRAPP to determine the effects of brief exposure to weak acid stress revealed specific changes in late 60S ribosome biogenesis. Furthermore, we identified the precise sites of crosslinking for hundreds of RNA-peptide conjugates, using iTRAPP, providing insights into potential regulation. We conclude that TRAPP is a widely applicable tool for RBPome characterization.

Keywords: RNA binding sites; mass spectrometry; phase separation; protein–RNA interaction; yeast.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

**Figure 1. TRAPP and PAR‐TRAPP reveal the yeast RBPome**
TRAPP and PAR‐TRAPP workflows used to identify RNA‐interacting proteins with SILAC MS‐MS. See the main text for details.
Scatter plot of Log₂ SILAC ratios +UVC/−UVC (1,360 mJ cm⁻²) for *Saccharomyces cerevisiae* proteins, quantified with TRAPP. Proteins were subdivided based on the indicated GO term categories. Proteins belonging to GO terms “membrane” and “DNA binding” do not contain proteins mapping to GO terms “RNA metabolic process”, “RNA binding”, “ribonucleoprotein complex”. Black dots represent proteins that failed to pass statistical significance cut‐off (P‐value adjusted < 0.05).
Scatter plot of Log₂ SILAC ratios +UVA/−UVA for *S. cerevisiae* proteins, quantified with PAR‐TRAPP. Proteins were subdivided based on the indicated GO term categories. Proteins belonging to GO terms “small molecule metabolism”, “membrane” and “DNA binding” do not contain proteins mapping to GO terms “RNA metabolic process”, “RNA binding”, “ribonucleoprotein complex”. Black dots represent proteins that failed to pass statistical significance cut‐off (P‐value adjusted < 0.05). See Methods and Protocols for calculation of significance.
Venn diagram showing the overlap between proteins identified in TRAPP and PAR‐TRAPP and proteins of intermediary metabolism annotated in the yeast metabolome database (YMDB).
5 most enriched GO terms amongst proteins identified only in TRAPP or exclusively in PAR‐TRAPP.
6 most significantly enriched domains (lowest P‐value) in PAR‐TRAPP‐identified proteins were selected if the same domain was enriched amongst TRAPP‐identified proteins. Domain fold enrichment in the recovered proteins is plotted on the x‐axis, while colour indicates log₁₀ Benjamini–Hochberg adjusted P‐value.

**Figure EV1. TRAPP and PAR‐TRAPP techniques and reproducibility**
Pairwise Pearson correlation coefficients for protein Log₂ +UV/−UV ratios obtained in *S. cerevisiae* with TRAPP and PAR‐TRAPP experiments. Forward isotopic labelling and reverse isotopic labelling are indicated as “FWD” and “RV”, respectively.
Data for *Escherichia coli* 1 forward and 2 reverse labelling TRAPP repeats were processed as in (A).
The analysis of *S. cerevisiae* forward and reverse PAR‐TRAPP experiments upon sorbic acid exposure performed as in (A).
Effect of 4tU and UVA treatments on the growth of yeast cells. Exponentially growing yeast cells were treated for 2 h with 4‐thiouracil at the indicated concentrations. The cultures were then irradiated with 350 nm UVA light in the eBox for 30 s delivering 5.8 J cm⁻². The lag time of treated cultures was measured by monitoring samples growth curve with Tecan sunrise instrument. Samples: “4tU wash out +UVA”—growth delay of 4‐thiouracil‐treated UVA‐irradiated cells, compared to UVA‐exposed sample. 4tU was removed prior to irradiation; “4tU w/o wash out +UVA”—As sample 1, but 4tU persisted in the media while cells were irradiated; “4tU alone”—growth delay of cells treated with 4tU for 2 h as compared to untreated cells without irradiation.
Frontal view on the eBox irradiation apparatus. The frontal door and the shutters are not present on the picture. Red arrows indicate rails for shutters, designed to prevent sample exposure to UV light, while the lamps are warming up for stable UVA output. The UVA transparent sample tray made of borosilicate glass is placed between the two UVA lamp banks.

**Figure EV2. TRAPP protocol predominately recovers RNA‐bound proteins**
The experimental set‐up indicating the stages when samples are collected. Coloured circles designate treatment with the indicated enzyme.
Samples are purified following the TRAPP protocol as described in Materials and Methods. After RNase A and RNase T1 treatment to degrade the co‐purifying RNA, sample was resolved on polyacrylamide gel and silver staining was performed.
TRAPP‐purified samples were treated with the indicated enzymes and loaded onto silica once again. After elution, nucleic acids were resolved with agarose gel electrophoresis (see Fig EV2D), while the remainder of the sample was treated with the indicated enzyme followed by polyacrylamide gel electrophoresis and silver staining.
Same as in (C), but the samples were collected before the second nuclease treatment and were then resolved on a SYBR Safe stained agarose gel. * denotes residual nucleic acid species in the cyanase‐treated sample.

**Figure 2. The effect of UVC dose in *Saccharomyces cerevisiae* on the proteins identified in TRAPP**
Venn diagram showing the overlap between proteins identified in TRAPP using the indicated UVC irradiation regime.
Scatter plot of Log₂ SILAC ratios +UVC/−UVC (for the indicated UV doses) for *S. cerevisiae* proteins, quantified with TRAPP. Proteins were subdivided based on the indicated GO term categories. Proteins, belonging to GO terms “membrane” and “small molecule metabolism” do not contain proteins mapping to GO terms “RNA metabolic process”, “RNA binding”, “ribonucleoprotein complex”. Black dots represent proteins that failed to pass statistical significance cut‐off (P‐value adjusted < 0.05).
Proteins, identified in TRAPP and PAR‐TRAPP were subdivided into 2 categories: “RNA biology” proteins (GO terms “RNA metabolic process”, “RNA binding”, “ribonucleoprotein complex”) (orange bars); Proteins, not classified with either of the 3 GO terms above (blue bars). Numbers of proteins in each category are plotted per experiment.
Proteins quantified in both TRAPP and PAR‐TRAPP were filtered to remove proteins annotated with GO terms “RNA metabolic process”, “RNA binding”, “ribonucleoprotein complex” (blue bars in Fig 2C). The remaining proteins were split into 10 bins by abundance (see Materials and Methods). For each bin, the ratio between enriched to detected proteins was calculated as well as median protein abundance as reported by PaxDb.

**Figure EV3. Quantification of TRAPP and PAR‐TRAPP data**
The percentage of peptides with reported intensity in +UV sample, but not in −UV sample (superenriched peptides) by MaxQuant in *Saccharomyces cerevisiae* TRAPP (at 1,360 mJ cm⁻²) SILAC quantification experiments without (black bars) or with (grey bars) “requantify” option enabled. 3 biological repeats had light‐labelled cells UV irradiated (1F, 2F,3F), while three other repeats (1R, 2R, 3R) had heavy‐labelled cells UV irradiated.
The data of *S. cerevisiae* PAR‐TRAPP experiments were analysed the same way as in (A).
The data of *E. coli* TRAPP experiments were analysed the same way as in (A), except the 2 biological repeats, which had light‐labelled cells UV irradiated, were labelled 1R and 2R.
The percentage of peptides with reported intensity in −UV sample, but not in +UV sample (superdepleted peptides) by MaxQuant in *S. cerevisiae* TRAPP (at 1,360 mJ cm⁻²) without (dotted) or with (chequered) “requantify” option enabled. Sample labelling as in (A).
The data of *S. cerevisiae* PAR‐TRAPP experiments were analysed the same way as in (D).
The data of *E. coli* TRAPP (at 1,360 mJ cm⁻²) experiments were analysed the same way as in (D), sample labelling was as in panel (C).
Box plot of Log₁₀ peptide intensity of −UV peptides from *S. cerevisiae* TRAPP (at 1,360 mJ cm⁻²) (blue) samples (labelling as in (A)), plotted together with Log₁₀ peptide intensity values imputed by imputeLCMD R package for −UV samples (orange). Box represents values between 25^th and 75^th percentiles, while whiskers represent 10^th and 90^th percentiles. All other data are represented as points below or above 10^th or 90^th percentiles, respectively. Line inside the box shows median value.
Histogram of peptide intensity frequency obtained from −UV sample (1F), plotted for intensities from 0 to 5 × 10⁵ units. Colour labelling is as in (G).
Same as panel (H), performed for sample 1R which had reversed SILAC labelling, compared to the sample analysed in panel (H).
Box plot of Log₁₀ peptide intensity of −UV peptides from *S. cerevisiae* PAR‐TRAPP (blue) samples (labelling as in (B)), plotted together with Log₁₀ peptide intensity values imputed by imputeLCMD R package for −UV samples (orange).
Histogram of peptide intensity frequency obtained from −UV sample (1F), plotted for intensities from 0 to 5 × 10⁵ units. Colour labelling is as in (J).
Same analysis as panel (K), performed for sample 1R which had reversed SILAC labelling, compared to the sample analysed in panel (K).
Box plot of Log₁₀ peptide intensity of −UV peptides from *E. coli* TRAPP (at 1,360 mJ cm⁻²) (blue) samples (labelling as in (C)), plotted together with Log₁₀ peptide intensity values imputed by imputeLCMD R package for −UV samples (orange).
Histogram of peptide intensity frequency obtained from −UV sample (1F), plotted for intensities from 0 to 5 × 10⁵ units. Colour labelling is as in (M).
Same analysis as panel (N), performed for sample 1R which had reversed SILAC labelling, compared to the sample analysed in panel (N).

**Figure EV4. Volcano plots for protein enrichment in TRAPP with different UV exposure**
A–G
Volcano plot showing Log₂ UV fold enrichment plotted against – Log₁₀ per protein for the following experiments: (A) *Saccharomyces cerevisiae* TRAPP at 1,360 mJ cm⁻²; (B) *S. cerevisiae* PAR‐TRAPP at 7.2 J cm⁻²; (C) *S. cerevisiae* TRAPP at 400 mJ cm⁻²; (D) *S. cerevisiae* TRAPP at 800 mJ cm⁻²; (E) *E. coli* TRAPP at 1,360 mJ cm⁻²; (F) *E. coli* TRAPP at 800 mJ cm⁻²; (G) *E. coli* TRAPP at 400 mJ cm⁻².

**Figure EV5. Glycolytic enzymes identified in TRAPP**
Glycolysis pathway in yeast *Saccharomyces cerevisiae*, indicating intermediate metabolites and participating enzymes. Proteins identified as enriched by TRAPP (1.4 J cm⁻²) and PAR‐TRAPP are shown with grey and white stars, respectively.

**Figure 3. The yeast RBPome identified by TRAPP compared to poly(A) RNA RBPome**
Venn diagram showing the overlap between proteins identified in PAR‐TRAPP, poly(A) capture and TRAPP.
Scatter plot of Log₂ PAR‐TRAPP SILAC ratios +UVA/−UVA for *Saccharomyces cerevisiae* proteins and Log₂ +UVA/−UVA fold enrichment for poly(A) capture technique. Only proteins identified in both methods as RBPs are shown. Proteins were subdivided based on the indicated GO term categories. Proteins belonging to GO terms “membrane” and “small molecule metabolism” do not contain proteins mapping to GO terms “RNA metabolic process”, “RNA binding”, “ribonucleoprotein complex”. Black dots represent proteins that failed to pass statistical significance cut‐off (P‐value adjusted < 0.05).
Scatter plot of Log₂ SILAC ratios +UVA/−UVA for all *S. cerevisiae* proteins identified in PAR‐TRAPP plotted together with Log₂ +UVA/−UVA fold enrichment for all proteins, reported as RBPs in poly(A) capture technique. Labelling is as in panel (B).

Figure 4. TRAPP reveals RNA binding proteins conserved from *Escherichia coli* to *Saccharomyces cerevisiae*
Venn diagram showing the overlap between proteins identified in TRAPP using the indicated UVC irradiation regime.
Scatter plot of Log₂ SILAC ratios +UVC/−UVC for *E. coli* proteins, quantified with TRAPP. Proteins were subdivided based on the indicated GO term categories. Proteins belonging to GO terms “membrane” and “small molecule metabolism” do not contain proteins mapping to GO terms “RNA metabolic process”, “RNA binding”, “ribonucleoprotein complex”. Black dots represent proteins that failed to pass statistical significance cut‐off (P‐value adjusted < 0.05).
Proteins, identified in *E. coli* TRAPP, were subdivided into two categories: “RNA biology” proteins (GO terms “RNA metabolic process”, “RNA binding”, “ribonucleoprotein complex”) (orange bars); Proteins, not classified with either of the 3 GO terms above (blue bars). Numbers of proteins in each category are plotted per experiment.
Most significantly enriched protein domains (lowest P‐value) in *E. coli* TRAPP‐identified proteins at 1,360 mJ cm⁻². Fold enrichment of the indicated domain amongst the recovered proteins is plotted on the x‐axis, while colour indicates log₁₀ Benjamini–Hochberg adjusted P‐value. Domains found enriched in the yeast TRAPP data (at 1,360 mJ cm⁻²) are labelled with red colour.
Proteins quantified in all of the *E. coli* TRAPP experiments were filtered to remove proteins annotated with GO terms “RNA metabolic process”, “RNA binding”, “ribonucleoprotein complex” (blue bars in Fig 4C). The remaining proteins were split into 10 bins by abundance (see Materials and Methods). For each bin, the ratio between enriched to detected proteins was calculated as well as median protein abundance as reported by PaxDb.
Pie chart of Inparanoid 8.0 database orthologous clusters between *S. cerevisiae* and *E. coli*. For a cluster to be labelled as conserved RNA interacting (“conserved, RBPs”), it was required to contain at least one bacterial and one yeast protein enriched in TRAPP (at 1,360 mJ cm⁻²). “Conserved, RBPs metabolic” are clusters where at least one protein in yeast or bacteria is identified in the YMDB or in ECMDB databases, respectively (see Materials and Methods).

**Figure 5. TRAPP reveals the dynamics of RBPome upon stress**
Volcano plot showing Log₂ protein abundance fold change in RBPome plotted against – Log₁₀ P‐value. Black points represent proteins showing no statistically significant change upon sorbic acid exposure in PAR‐TRAPP, while proteins changing significantly (P‐value adjusted < 0.05) are labelled with blue. Only proteins observed as RNA interacting in PAR‐TRAPP were included in the analysis.
Scatter plot of Log₂ SILAC ratios +Sorbic/−Sorbic for *Saccharomyces cerevisiae* proteins, quantified with PAR‐TRAPP. Grey points represent proteins showing no statistically significant change upon sorbic acid exposure in PAR‐TRAPP, while proteins changing significantly (P‐value adjusted < 0.05) are labelled with other colours. Only proteins observed as RNA interacting in PAR‐TRAPP were included in the analysis, except for proteins in the category “protein of intermediary metabolism extended”, for which this criterion was dropped. Proteins annotated with GO term categories “RNA binding”, “translation”, “translation initiation”, “P‐body” and “small molecule metabolism” are displayed together with proteins annotated in literature‐curated lists: “ribosome biogenesis 40S”, “ribosome biogenesis 60S” (Woolford & Baserga, 2013). Proteins belonging to categories “protein of intermediary metabolism” and “protein of intermediary metabolism extended” are yeast enzymes and transporters of intermediary metabolism, obtained from YMDB and further filtered to remove aminoacyl‐tRNA synthetases. “P‐body core” category contains proteins identified as core components of P‐bodies in yeast (Buchan *et al*, 2010). Numbers label the following protein on the chart: 1 – Rtc3; 2 – Tif3; 3 – Rpg1; 4 – Tif35; 5 – Gcd11; 6 – Rlp24; 7 – Ssd1; 8 – Rbp7; 9 – Nmd3.
Volcano plot showing Log₂ fold change in protein abundance upon sorbic acid exposure plotted against – Log₁₀ P‐value. Black points represent proteins showing no statistically significant change in abundance upon sorbic acid exposure (P‐value adjusted > 0.05).
The cytoplasmic phase of large subunit maturation in yeast (Lo *et al*, 2010). Proteins altered in abundance in PAR‐TRAPP data upon sorbic acid exposure are indicated with arrows. Blue arrow denotes decrease, while red arrows indicate increase in PAR‐TRAPP recovery upon stress. For proteins passing the statistical significance cut‐off (P‐value adjusted < 0.05), fold change is indicated.

**Figure 6. Identifying the RNA‐crosslinked peptides with iTRAPP**
iTRAPP workflow to directly observe crosslinked RNA–peptides species by mass spectrometry. See the main text for details.
Pie chart of RNA species observed crosslinked to peptides by the Xi search engine.
The analysis of amino acids, reported as crosslinked by the Xi search engine. Amino acids are represented by single letter IUPAC codes. Black bars—crosslink efficiency, defined as ratio between the frequency of the crosslinked amino acid and the frequency of the amino acid in all crosslinked peptides.
Venn diagram showing the overlap between proteins identified in PAR‐TRAPP, RNP^xl and Xi. Protein groups, reported by RNP^xl and Xi, were expanded to single proteins so as to maximize the resulting overlap.
Domain structure of selected proteins, identified as crosslinked by the Xi search engine. Domains (coloured rectangles) and sites of phosphorylation (light green rhombi) from the UniProt database were plotted onto proteins represented by grey rectangles. Crosslink sites, identified by Xi, are indicated with red pentagons. See also Appendix Supplementary Methods.

**Figure EV6. iTRAPP including Lambda phosphatase treatment**
The number of protein phosphorylation sites, reported by MaxQuant software for the λ phosphatase‐treated sample and untreated control, demonstrating that the treatment effectively removed phosphorylation from amino acids.
Pie chart of RNA species observed crosslinked to peptides by the Xi search engine in the sample treated with Lambda phosphatase.

**Figure 7. RNA binding by Eno1**
Representative gel showing the recovery of radiolabelled RNA after CRAC purification. Lane 1: Untagged control strain (BY4741). Lane 2: Strain expressing Eno1‐HTP from its endogenous locus.
Bar charts showing the relative distribution of reads amongst different classes of RNA.
The binding of Eno1 to the representative tRNA tA(UGC). The four upper tracks show the distribution of entire reads, while the four lower tracks show putative crosslinking sites (deletions). Tracks are scaled by reads (or deletions) per million, and this value is denoted in the upper left corner of each track. Two independent replicates are shown for the untagged BY control and Eno1‐HTP. The tRNA sequence is shown below with the T‐loop sequence highlighted in orange.
Global view of Eno1 binding to cytoplasmic tRNAs (left) or mitochondrial tRNAs (right). For ease of viewing, tRNAs across the genome were concatenated into a single “chromosome”, with each tRNA gene annotation shown in blue. Two independent replicates are shown.
Metagene plots showing the distribution of reads or deletions summed across all tRNAs. tRNA genes were aligned from either the 5′ end (left) or the 3′ end (right).
Heat map showing the distribution of putative crosslinking sites (deletions) across all Eno1‐bound tRNAs. tRNA genes are sorted by increasing length, and the 3′ end for each gene is denoted in black. The domain structures of a typical short (tH(GUG)) and long (tS(AGA)) tRNA are included for comparison.

See this image and copyright information in PMC

References

1. Angelov D, Stefanovsky V, Dimitrov SI, Russanova VR, Keskinova E, Pashev IG (1988) Protein‐DNA crosslinking in reconstituted nucleohistone, nuclei and whole cells by picosecond UV laser irradiation. Nucleic Acids Res 16: 4525–4538 - PMC - PubMed
1. Avison M (2008) Measuring gene expression. London: Taylor & Francis;
1. Baltz AG, Munschauer M, Schwanhausser B, Vasile A, Murakawa Y, Schueler M, Youngs N, Penfold‐Brown D, Drew K, Milek M, Wyler E, Bonneau R, Selbach M, Dieterich C, Landthaler M (2012) The mRNA‐bound proteome and its global occupancy profile on protein‐coding transcripts. Mol Cell 46: 674–690 - PubMed
1. Bao X, Guo X, Yin M, Tariq M, Lai Y, Kanwal S, Zhou J, Li N, Lv Y, Pulido‐Quetglas C, Wang X, Ji L, Khan MJ, Zhu X, Luo Z, Shao C, Lim D‐H, Liu X, Li N, Wang W et al (2018) Capturing the interactome of newly transcribed RNA. Nat Meth 15: 213 - PMC - PubMed
1. Beckmann BM, Horos R, Fischer B, Castello A, Eichelbaum K, Alleaume A‐M, Schwarzl T, Curk T, Foehr S, Huber W, Krijgsveld J, Hentze MW (2015) The RNA‐binding proteomes from yeast to man harbour conserved enigmRBPs. Nat Comm 6: 10127 - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Molecular Biology Databases

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Defining the RNA interactome by total RNA-associated protein purification

Affiliations

Defining the RNA interactome by total RNA-associated protein purification

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Molecular Biology Databases