Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Jun;76(11):2043-2058.
doi: 10.1007/s00018-019-03096-3. Epub 2019 Apr 12.

The role of the protein-RNA recognition code in neurodegeneration

Affiliations
Review

The role of the protein-RNA recognition code in neurodegeneration

Jozef Nahalka. Cell Mol Life Sci. 2019 Jun.

Abstract

MicroRNAs are small endogenous RNAs that pair and bind to sites on mRNAs to direct post-transcriptional repression. However, there is a possibility that microRNAs directly influence protein structure and activity, and this influence can be termed post-translational riboregulation. This conceptual review explores the literature on neurodegenerative disorders. Research on the association between neurodegeneration and RNA-repeat toxicity provides data that support a protein-RNA recognition code. For example, this code explains why hnRNP H and SFPQ proteins, which are involved in amyotrophic lateral sclerosis, are sequestered by the (GGGGCC)n repeat sequence. Similarly, it explains why MNBL proteins and (CTG)n repeats in RNA, which are involved in myotonic dystrophy, are sequestered into RNA foci. Using this code, proteins involved in diseases can be identified. A simple protein BLAST search of the human genome for amino acid repeats that correspond to the nucleotide repeats reveals new proteins among already known proteins that are involved in diseases. For example, the (CAG)n repeat sequence, when transcribed into possible peptide sequences, leads to the identification of PTCD3, Rem2, MESP2, SYPL2, WDR33, COL23A1, and others. After confirming this approach on RNA repeats, in the next step, the code was used in the opposite manner. Proteins that are involved in diseases were compared with microRNAs involved in those diseases. For example, a reasonable correspondence of microRNA 9 and 107 with amyloid-β-peptide (Aβ42) was identified. In the last step, a miRBase search for micro-nucleotides, obtained by transcription of a prion amino acid sequence, revealed new microRNAs and microRNAs that have previously been identified as involved in prion diseases. This concept provides a useful key for designing RNA or peptide probes.

Keywords: Alzheimer’s disease; Huntington’s disease; Molecular recognition; Non-coding RNA; Parkinson’s disease; Prion diseases.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Protein–RNA recognition code. CCA trinucleotide is recognised as two dinucleotides—two-letter code (P-CC, Q-CA), or each nucleotide is recognised separately—one-letter code (T-C, T-C, N-A). One-letter code—second nucleotide in codons; two-letter code—first two nucleotides in codons. 3D5A crystal structure which contain release factor 1 (RF1) interacting with tRNA, as example [27]
Fig. 2
Fig. 2
Proteins sequestered into RNA foci (ac); and the recognition of polyadenylation signal by polyadenylation factor subunit 2 (d WDR33). a H component of the heterogeneous nuclear RNP (hnRNP H), recognising toxic (GGGGCC)n repeat; b muscleblind-like protein 1 (MNBL1), recognising toxic (CTG)n repeat; c the heat shock transcription factor 1 (HSF1), recognising toxic (CAG)n repeat; and d WDR33 recognising polyadenylation signal
Fig. 3
Fig. 3
Illustrated protein sequences that are recognised by a MIR34a, b MIR9, and c MIR107; and d cartoon of the Aβ42 fibril core, structure (5KK3)
Fig. 4
Fig. 4
Illustrated protein sequences that are recognised by a MIR7, b MIR153-1, and c MIR221-3p
Fig. 5
Fig. 5
Prion diseases. a Pathogenic mutations (red, [98]) shown in the sequence and crystal structure of human wild-type PrPC (5YJ5), short important amino acid sequences can be transcribed into nucleotide sequences and searched via the online miRBase. b, c Illustrated protein sequences that are recognised by MIR361 and MIR146a respectively

References

    1. Beadle GW, Tatum EL. Genetic control of biochemical reactions in Neurospora. Genetics. 1941;27:499–506. - PMC - PubMed
    1. Gerstein MB, Bruce C, Rozowsky JS, Zheng D, Du J, Korbel JO, Emanuelsson O, Zhang ZD, Weissman S, Snyder M. What is a gene, post-ENCODE? History and updated definition. Genome Res. 2007;17:669–681. doi: 10.1101/gr.6339607. - DOI - PubMed
    1. Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA, Doyle F, Epstein CB, Frietze S, Harrow J, Kaul R, Khatun J, Lajoie BR, Landt SG, Lee B-, Pauli F, Rosenbloom KR, et al. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. - DOI - PMC - PubMed
    1. Felsenfeld G. A brief history of epigenetics. Cold Spring Harbor Perspect Biol. 2014;6:a018200. doi: 10.1101/cshperspect.a018200. - DOI - PMC - PubMed
    1. Yao B, Christian KM, He C, Jin P, Ming G-, Song H. Epigenetic mechanisms in neurogenesis. Nat Rev Neurosci. 2016;17:537–549. doi: 10.1038/nrn.2016.70. - DOI - PMC - PubMed

MeSH terms

Substances