Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Mar 16;8(19):10582-10592.
doi: 10.1039/c8ra00598b. eCollection 2018 Mar 13.

A structural dissection of protein-RNA interactions based on different RNA base areas of interfaces

Affiliations

A structural dissection of protein-RNA interactions based on different RNA base areas of interfaces

Wen Hu et al. RSC Adv. .

Abstract

Protein-RNA interactions are very common cellular processes, but the mechanisms of interactions are not fully understood, mainly due to the complicated RNA structures. By the elaborate investigation on RNA structures of protein-RNA complexes, it was firstly found in this paper that RNAs in these complexes could be clearly classified into three classes (high, medium and low) based on the different levels of P base (the percentage of base area buried in the RNA interface). In view of the three RNA classes, more detailed analyses on protein-RNA interactions were comprehensively performed from various aspects, including interface area, structure, composition and interaction force, so as to achieve a deeper understanding of the recognition specificity for the three classes of protein-RNA interactions. According to our classification strategy, the three complex classes have significant differences in terms of almost all properties. Complexes in the high class have short and extended RNA structures and behave like protein-ssDNA interactions. Their hydrogen bonds and hydrophobic interactions are strong. For complexes in low class, their RNA structures are mainly double-stranded, like protein-dsDNA interactions, and electrostatic interactions frequently occur. The complexes in medium class have the longest RNA chains and largest average interface area. Meanwhile, they do not show any preference for the interaction force. On average, in terms of composition, secondary structures and intermolecular physicochemical properties, significant feature preferences can be observed in high and low complexes, but no highly specific features are found for medium complexes. We found that our proposed P base is an important parameter which can be used as a new determinant to distinguish protein-RNA complexes. For high and low complexes, we can more easily understand the specificity of the recognition process from the interface features than for medium complexes. In the future, medium complexes should be our research focus to further structurally analyze from more feature aspects. Overall, this study may contribute to further understanding of the mechanism of protein-RNA interactions on a more detailed level.

PubMed Disclaimer

Conflict of interest statement

There are no conflicts to declare.

Figures

Fig. 1
Fig. 1. Violin plot combining the box plot and density trace for Pribose, Pphosphate and Pbase in the initial dataset.
Fig. 2
Fig. 2. Distribution of Pribose, Pphosphate and Pbase in 137 non-redundant protein–RNA complexes. (A) The percentage of Pribose, Pphosphate and Pbase in different types of protein–RNA complexes. (B) The box plot for Pbase in each class.
Fig. 3
Fig. 3. Bubble chart of Rpair in the three classes. The bigger bubble and the deeper color indicate higher frequency of Rpair in each class.
Fig. 4
Fig. 4. Size of the interfaces and interface area ratio of our dataset. (A) The frequency histogram of interface area size. (B) The box plot for interface area ratio of the three classes.
Fig. 5
Fig. 5. Analysis of the correlation coefficients between Natoms (or Nresidues/Nnucleotides) and interface area. (A) Number of interface atoms against the interface area for the three classes. (B) Interface residues or nucleotides against the interface area for the three classes.
Fig. 6
Fig. 6. Amino acid composition and propensity of each class. (A) The average percentage of 20 amino acids in the interface of each class. (B) The average propensity of 20 amino acids in each class.
Fig. 7
Fig. 7. Protein secondary structure composition and propensity of each class. (A) The average percentage of secondary structures in the interface of each class. (B) The average propensity of secondary structures in each class.
Fig. 8
Fig. 8. The distribution of the overlapped percentage between the largest electrostatic positive patches and interface.
Fig. 9
Fig. 9. Pie chart of the distribution of the five categories of h (the percent overlap between hydrophobic interface and RNA-binding interfaces) in each class. The first to the fifth categories are designated as the h ranges of 0–20%, 20–40%, 40–60%, 60–80% and 80–100%, respectively, as indicated by dark blue, red, green, dark purple and cyan, respectively.

Similar articles

Cited by

References

    1. Rohs R. West S. M. Sosinsky A. Liu P. Mann R. S. Honig B. Nature. 2009;461:1248–1253. doi: 10.1038/nature08473. - DOI - PMC - PubMed
    1. Charoensawan V. Wilson D. Teichmann S. A. Nucleic Acids Res. 2010;38:7364–7377. doi: 10.1093/nar/gkq617. - DOI - PMC - PubMed
    1. Cai Y. H. Huang H. Amino Acids. 2012;43:1141–1146. doi: 10.1007/s00726-012-1377-9. - DOI - PubMed
    1. Re A. Joshi T. Kulberkyte E. Morris Q. Workman C. T. Methods Mol. Biol. 2014;1097:491–521. - PubMed
    1. Rinn J. L. Ule J. Genome Biol. 2014;15:401. doi: 10.1186/gb4158. - DOI - PMC - PubMed