Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jun 4;47(10):4958-4969.
doi: 10.1093/nar/gkz250.

Discovering sequence and structure landscapes in RNA interaction motifs

Affiliations

Discovering sequence and structure landscapes in RNA interaction motifs

Marta Adinolfi et al. Nucleic Acids Res. .

Abstract

RNA molecules are able to bind proteins, DNA and other small or long RNAs using information at primary, secondary or tertiary structure level. Recent techniques that use cross-linking and immunoprecipitation of RNAs can detect these interactions and, if followed by high-throughput sequencing, molecules can be analysed to find recurrent elements shared by interactors, such as sequence and/or structure motifs. Many tools are able to find sequence motifs from lists of target RNAs, while others focus on structure using different approaches to find specific interaction elements. In this work, we make a systematic analysis of RBP-RNA and RNA-RNA datasets to better characterize the interaction landscape with information about multi-motifs on the same RNAs. To achieve this goal, we updated our BEAM algorithm to combine both sequence and structure information to create pairs of patterns that model motifs of interaction. This algorithm was applied to several RNA binding proteins and ncRNAs interactors, confirming already known motifs and discovering new ones. This landscape analysis on interaction variability reflects the diversity of target recognition and underlines that often both primary and secondary structure are involved in molecular recognition.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Absolute frequency of co-localized pairs of motifs in the CLIP (left) and eCLIP (right) datasets. The majority of sequence-structural pairs are centered around 19% with extreme cases of ∼40% (orange line). More cases of co-localization on RNAs are present in the sequence-sequence pairs while the structure-structure pairs are relatively low, with a mean of 10%. eCLIP results show a generalized higher co-localization because of the lower experimental noise produced by the assay.
Figure 2.
Figure 2.
RBM5 interaction landscape. Top: sequence and structure motifs shared by 45% of the RNA molecules. Bottom: location of sequence (left) and structure (right) motifs; the residue in position 0 corresponds to the start of the binding site reported by the experiment. Lower right: score distributions of input sequences and background sequences with respect to the motif model.
Figure 3.
Figure 3.
TIAL1 interaction landscape. Top: sequence and structure motifs shared by 49% of the sequences. Bottom: location of sequence (left) and structure (center) motifs; the residue in position 0 corresponds to the start of the binding site reported by the experiment. Lower right: score distributions of input sequences and background sequences with respect to the motif model.
Figure 4.
Figure 4.
Structural trend and sequence motifs for miRNA targets. Red lines correspond to PSSS mean score; the weblogo shows the sequence motif identified (top) in that specific regions compared to background (bottom). Here, position 0 corresponds to miRNA binding site as described in methods.
Figure 5.
Figure 5.
Overview of motifs results for miRNA targets. Each spot is a sequence-structure motif pair found in one of the 383 miRNA set of targets. Ten representative pairs are shown in red circles. The size of the spot is proportional to the size of the dataset and the color represents the co-coverage, the position along x- and y- axes are respectively the reverse log10P-value for structure and sequence motifs.
Figure 6.
Figure 6.
TINCR interaction landscape. Top: sequence and structure motifs shared by 16% of the sequences. Bottom: location of sequence (left) and structure (center) motifs; the residue in position #0 corresponds to the start of the binding site reported by the experiment. Lower right: score distributions of input sequences and background sequences with respect to the motif model.
Figure 7.
Figure 7.
XIST interaction landscape. Top: sequence and structure motifs shared by 42% of the sequences. Bottom: location of sequence (left) and structure (center) motifs; the residue in position 0 corresponds to the start of the binding site reported by the experiment. Lower right: score distributions of input sequences and background sequences with respect to the motif model.

References

    1. Dominguez D., Freese P., Alexis M.S., Su A., Hochman M., Palden T., Bazile C., Lambert N.J., Van Nostrand E.L., Pratt G.A. et al. .. Sequence, structure, and context preferences of human RNA binding proteins. Mol. Cell. 2018; 70:854–867. - PMC - PubMed
    1. Gerstberger S., Hafner M., Tuschl T.. A census of human RNA-binding proteins. Nat. Rev. Genet. 2014; 15:829–845. - PMC - PubMed
    1. Hannigan M.M., Zagore L.L., Licatalosi D.D.. Mapping transcriptome-wide protein–RNA interactions to elucidate RNA regulatory programs. Quant. Biol. 2018; 6:228–238. - PMC - PubMed
    1. Ferrè F., Colantoni A., Helmer-Citterich M.. Revealing protein–lncRNA interaction. Brief. Bioinform. 2016; 17:106–116. - PMC - PubMed
    1. Licatalosi D.D., Mele A., Fak J.J., Ule J., Kayikci M., Chi S.W., Clark T.A., Schweitzer A.C., Blume J.E., Wang X. et al. .. HITS-CLIP yields genome-wide insights into brain alternative RNA processing. Nature. 2008; 456:464–469. - PMC - PubMed

Publication types

MeSH terms