Using RNA secondary structures to guide sequence motif finding towards single-stranded regions
- PMID: 16987907
- PMCID: PMC1903381
- DOI: 10.1093/nar/gkl544
Using RNA secondary structures to guide sequence motif finding towards single-stranded regions
Abstract
RNA binding proteins recognize RNA targets in a sequence specific manner. Apart from the sequence, the secondary structure context of the binding site also affects the binding affinity. Binding sites are often located in single-stranded RNA regions and it was shown that the sequestration of a binding motif in a double-strand abolishes protein binding. Thus, it is desirable to include knowledge about RNA secondary structures when searching for the binding motif of a protein. We present the approach MEMERIS for searching sequence motifs in a set of RNA sequences and simultaneously integrating information about secondary structures. To abstract from specific structural elements, we precompute position-specific values measuring the single-strandedness of all substrings of an RNA sequence. These values are used as prior knowledge about the motif starts to guide the motif search. Extensive tests with artificial and biological data demonstrate that MEMERIS is able to identify motifs in single-stranded regions even if a stronger motif located in double-strand parts exists. The discovered motif occurrences in biological datasets mostly coincide with known protein-binding sites. This algorithm can be used for finding the binding motif of single-stranded RNA-binding proteins in SELEX or other biological sequence data.
Figures








Similar articles
-
Finding the target sites of RNA-binding proteins.Wiley Interdiscip Rev RNA. 2014 Jan-Feb;5(1):111-30. doi: 10.1002/wrna.1201. Epub 2013 Nov 11. Wiley Interdiscip Rev RNA. 2014. PMID: 24217996 Free PMC article. Review.
-
ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data.Nucleic Acids Res. 2017 Nov 2;45(19):11004-11018. doi: 10.1093/nar/gkx756. Nucleic Acids Res. 2017. PMID: 28977546 Free PMC article.
-
Recognizing RNA structural motifs in HT-SELEX data for ribosomal protein S15.BMC Bioinformatics. 2017 Jun 6;18(1):298. doi: 10.1186/s12859-017-1704-y. BMC Bioinformatics. 2017. PMID: 28587636 Free PMC article.
-
A combined sequence and structure based method for discovering enriched motifs in RNA from in vivo binding data.Methods. 2017 Apr 15;118-119:73-81. doi: 10.1016/j.ymeth.2017.03.003. Epub 2017 Mar 6. Methods. 2017. PMID: 28274760
-
Recognition modes of RNA tetraloops and tetraloop-like motifs by RNA-binding proteins.Wiley Interdiscip Rev RNA. 2014 Jan-Feb;5(1):49-67. doi: 10.1002/wrna.1196. Epub 2013 Oct 3. Wiley Interdiscip Rev RNA. 2014. PMID: 24124096 Free PMC article. Review.
Cited by
-
Finding the target sites of RNA-binding proteins.Wiley Interdiscip Rev RNA. 2014 Jan-Feb;5(1):111-30. doi: 10.1002/wrna.1201. Epub 2013 Nov 11. Wiley Interdiscip Rev RNA. 2014. PMID: 24217996 Free PMC article. Review.
-
ProbeRating: a recommender system to infer binding profiles for nucleic acid-binding proteins.Bioinformatics. 2020 Sep 15;36(18):4797-4804. doi: 10.1093/bioinformatics/btaa580. Bioinformatics. 2020. PMID: 32573679 Free PMC article.
-
DynaMIT: the dynamic motif integration toolkit.Nucleic Acids Res. 2016 Jan 8;44(1):e2. doi: 10.1093/nar/gkv807. Epub 2015 Aug 7. Nucleic Acids Res. 2016. PMID: 26253738 Free PMC article.
-
Introduction to Bioinformatics Resources for Post-transcriptional Regulation of Gene Expression.Methods Mol Biol. 2022;2404:3-41. doi: 10.1007/978-1-0716-1851-6_1. Methods Mol Biol. 2022. PMID: 34694601
-
RNALigands: a database and web server for RNA-ligand interactions.RNA. 2022 Feb;28(2):115-122. doi: 10.1261/rna.078889.121. Epub 2021 Nov 3. RNA. 2022. PMID: 34732566 Free PMC article.
References
-
- Messias A.C., Sattler M. Structural basis of single-stranded RNA recognition. Acc. Chem. Res. 2004;37:279–287. - PubMed
-
- Hall K.B. RNA-protein interactions. Curr. Opin. Struct. Biol. 2002;12:283–288. - PubMed
-
- Thisted T., Lyakhov D.L., Liebhaber S.A. Optimized RNA targets of two closely related triple KH domain proteins, heterogeneous nuclear ribonucleoprotein K and alphaCP-2KL, suggest distinct modes of RNA recognition. J. Biol. Chem. 2001;276:17484–17496. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources