Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Feb 28:13:843014.
doi: 10.3389/fgene.2022.843014. eCollection 2022.

Trinucleotide CGG Repeat Diseases: An Expanding Field of Polyglycine Proteins?

Affiliations
Review

Trinucleotide CGG Repeat Diseases: An Expanding Field of Polyglycine Proteins?

Manon Boivin et al. Front Genet. .

Abstract

Microsatellites are repeated DNA sequences of 3-6 nucleotides highly variable in length and sequence and that have important roles in genomes regulation and evolution. However, expansion of a subset of these microsatellites over a threshold size is responsible of more than 50 human genetic diseases. Interestingly, some of these disorders are caused by expansions of similar sequences, sizes and localizations and present striking similarities in clinical manifestations and histopathological features, which suggest a common mechanism of disease. Notably, five identical CGG repeat expansions, but located in different genes, are the causes of fragile X-associated tremor/ataxia syndrome (FXTAS), neuronal intranuclear inclusion disease (NIID), oculopharyngodistal myopathy type 1 to 3 (OPDM1-3) and oculopharyngeal myopathy with leukoencephalopathy (OPML), which are neuromuscular and neurodegenerative syndromes with overlapping symptoms and similar histopathological features, notably the presence of characteristic eosinophilic ubiquitin-positive intranuclear inclusions. In this review we summarize recent finding in neuronal intranuclear inclusion disease and FXTAS, where the causing CGG expansions were found to be embedded within small upstream ORFs (uORFs), resulting in their translation into novel proteins containing a stretch of polyglycine (polyG). Importantly, expression of these polyG proteins is toxic in animal models and is sufficient to reproduce the formation of ubiquitin-positive intranuclear inclusions. These data suggest the existence of a novel class of human genetic pathology, the polyG diseases, and question whether a similar mechanism may exist in other diseases, notably in OPDM and OPML.

Keywords: RAN translation; microsatellite; neurodegeneration; protein aggregates toxicity; trinucleotide repeat disorders.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
CGG repeat expansions cause a spectrum of disease. (A) Identical CGG repeat expansions embedded within the 5′UTR of different genes cause various neurodevelopmental, neuromuscular and neurodegenerative disorders. (B) Brain sections of individuals with FXTAS or NIID show identical p62-or sumo-positive intranuclear inclusions. (C) FXTAS, NIID, OPML, and OPDM may belong to a continuum of neuromuscular and neurodegenerative disorders.
FIGURE 2
FIGURE 2
CGG repeat expansions are translated into polyG proteins in FXTAS and NIID. (A) Scheme of FMR1 indicating FMRpolyG upstream ORF and FMRP main ORF localization. (B) Scheme of NOTCH2NLC indicating uN2CpolyG upstream ORF and NOTCH2NLC (abbreviated N2C) main ORF localization. Initiation codons and stop codons are indicated in red for uORFs and in blue for the main ORFs. Near-cognate initiations codons are indicated in bold yellow. (C) Protein sequences of FMRpolyG and uN2CpolyG show no similitude beyond their polyglycine stretch. (D) Sequences of the putative uORFs embedded within NOTCH2, NOTCHNLA, B and C 5′UTRs. Variant amino acids are indicated in bold. The sequence required for uN2C to interact with KU70/KU80 is indicated in blue.
FIGURE 3
FIGURE 3
Repeat expansions located in “non-coding” regions are nonetheless translated. (A) CGG repeat expansions embedded in FMR1 and NOTCH2NLC 5′UTR are translated into polyglycine-containing proteins in NIID and FXTAS, respectively. (B) G3C2T and G4C2 repeats embedded in the first unspliced intron of the NOP56 and C9ORF72 genes are translated into poly(glycine-proline) and poly(glycine-alanine)-containing proteins in SCA36 and ALS/FTD, respectively. (C) CAG repeats and G2C4 repeats embedded in ATXN8, JPH3AS and C9ORF72AS long “non-coding” RNAs are translated into polyglutamine- or poly(glycine-proline)-containing proteins in SCA8, HDL2, and ALS/FTD, respectively. Initiation codons and stop codons of the ORFs containing the pathogenic expansion are indicated in red, while they are indicated in blue for the main ORFs. Near-cognate initiations codons are indicated in bold yellow.
FIGURE 4
FIGURE 4
Repeat expansion range of toxicity in different microsatellite diseases. CGG repeat expansions are pathogenic between ∼60–70 and 200–300 repeats when they are expressed into toxic polyglycine-containing proteins, but over 200 repeats when they promote DNA epigenetic changes, promoter silencing and a loss of function mechanism, such as in FXS. This range of protein toxicity is to be compared to other translated microsatellite diseases such as polyalanine-containing proteins that are generally toxic with 10–35 GCN repeats, while polyQ proteins are generally pathogenic with longer CAG expansions (∼40–80–200 repeats), with the exception of short polyglutamine stretch altering functions of the calcium voltage-gated channel subunit alpha1A (CACNA1A) in SCA6. In contrast, an RNA gain of function mechanism such as titration of the MBNL RNA binding proteins in DM1, require much longer repeat expansions.

Similar articles

Cited by

References

    1. Akarsu A., Stoilov I., Yilmaz E., Sayli B. S., Sarfarazi M. (1996). Genomic Structure of HOXD13 Gene: a Nine Polyalanine Duplication Causes Synpolydactyly in Two Unrelated Families. Hum. Mol. Genet. 5 (7), 945–952. 10.1093/hmg/5.7.945 - DOI - PubMed
    1. Albrecht A., Mundlos S. (2005). The Other Trinucleotide Repeat: Polyalanine Expansion Disorders. Curr. Opin. Genet. Develop. 15 (3), 285–293. 10.1016/j.gde.2005.04.003 - DOI - PubMed
    1. Almeida S., Krishnan G., Rushe M., Gu Y., Kankel M. W., Gao F.-B. (2019). Production of Poly(GA) in C9ORF72 Patient Motor Neurons Derived from Induced Pluripotent Stem Cells. Acta Neuropathol. 138 (6), 1099–1101. 10.1007/s00401-019-02083-z - DOI - PMC - PubMed
    1. Amiel J., Laudier B., Attié-Bitach T., Trang H., de Pontual L., Gener B., et al. (2003). Polyalanine Expansion and Frameshift Mutations of the Paired-like Homeobox Gene PHOX2B in Congenital central Hypoventilation Syndrome. Nat. Genet. 33 (4), 459–461. 10.1038/ng1130 - DOI - PubMed
    1. Ariza J., Rogers H., Monterrubio A., Reyes-Miranda A., Hagerman P. J., Martínez-Cerdeño V. (2016). A Majority of FXTAS Cases Present with Intranuclear Inclusions within Purkinje Cells. Cerebellum 15 (5), 546–551. 10.1007/s12311-016-0776-y - DOI - PubMed