Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Oct:48:568-580.
doi: 10.1016/j.ebiom.2019.09.020. Epub 2019 Oct 10.

A genetic association study of glutamine-encoding DNA sequence structures, somatic CAG expansion, and DNA repair gene variants, with Huntington disease clinical outcomes

Affiliations

A genetic association study of glutamine-encoding DNA sequence structures, somatic CAG expansion, and DNA repair gene variants, with Huntington disease clinical outcomes

Marc Ciosi et al. EBioMedicine. 2019 Oct.

Abstract

Background: Huntington disease (HD) is caused by an unstable CAG/CAA repeat expansion encoding a toxic polyglutamine tract. Here, we tested the hypotheses that HD outcomes are impacted by somatic expansion of, and polymorphisms within, the HTT CAG/CAA glutamine-encoding repeat, and DNA repair genes.

Methods: The sequence of the glutamine-encoding repeat and the proportion of somatic CAG expansions in blood DNA from participants inheriting 40 to 50 CAG repeats within the TRACK-HD and Enroll-HD cohorts were determined using high-throughput ultra-deep-sequencing. Candidate gene polymorphisms were genotyped using kompetitive allele-specific PCR (KASP). Genotypic associations were assessed using time-to-event and regression analyses.

Findings: Using data from 203 TRACK-HD and 531 Enroll-HD participants, we show that individuals with higher blood DNA somatic CAG repeat expansion scores have worse HD outcomes: a one-unit increase in somatic expansion score was associated with a Cox hazard ratio for motor onset of 3·05 (95% CI = 1·94 to 4·80, p = 1·3 × 10-6). We also show that individual-specific somatic expansion scores are associated with variants in FAN1 (pFDR = 4·8 × 10-6), MLH3 (pFDR = 8·0 × 10-4), MLH1 (pFDR = 0·004) and MSH3 (pFDR = 0·009). We also show that HD outcomes are best predicted by the number of pure CAGs rather than total encoded-glutamines.

Interpretation: These data establish pure CAG length, rather than encoded-glutamine, as the key inherited determinant of downstream pathophysiology. These findings have implications for HD diagnostics, and support somatic expansion as a mechanistic link for genetic modifiers of clinical outcomes, a driver of disease, and potential therapeutic target in HD and related repeat expansion disorders.

Funding: CHDI Foundation.

Keywords: DNA repair; Genetic association study; Huntington disease; Somatic expansion.

PubMed Disclaimer

Conflict of interest statement

MC, AM, SAC, DJHM, AMA, MDF, AD, BRL, PH, and LJ have nothing to disclose. RACR reports personal fees from UniQure, outside the submitted work. DRL reports personal fees from Roche Pharmaceutical, Voyager, Teva Pharmaceutical, Wave Life Sciences, Takeda Pharmaceutical Company and Axon Advisors, and other from CHDI, outside the submitted work. SK reports and is employed by CHDI Management, Inc., as an advisor to the CHDI Foundation. SJT reports personal fees from Alnylam Pharmaceuticals, DDF Discovery, F. Hoffmann-La Roche, Genentech, GSK, Heptares Therapeutics, Takeda Pharmaceutical Company, Teva Pharmaceuticals, Triplet Therapeutics, UCB Pharma and Vertex Pharmaceuticals, outside the submitted work. DGM reports other from CHDI Foundation, during the conduct of the study and personal fees from Vertex Pharmaceuticals, LoQus23 Therapeutics and Triplet Therapeutics, outside the submitted work.

Figures

Fig. 1
Fig. 1
Study design and sample exclusions.
Fig. 2
Fig. 2
Allelic variation at the HTT exon one repeat locus. The schematic diagram shows the typical reference allele structure of the CAG/CCG repeat region in exon one of HTT (top). We here define the number of pure CAGs as Q1 and the number of additional downstream glutamine-encoding CAA/CAG repeats as Q2. Thus, total encoded-glutamine QT = Q1 + Q2. We here define the number of copies of the proline codons before the pure CCG repeat as P1, the number of pure CCGs as P2, and the number of copies of the CCT proline codons as P3. Thus, total encoded-proline PT = P1 + P2 + P3. The figure shows schematic representations of the atypical alleles observed. Repeat codons are depicted: CAG glutamine codons as red boxes; CAA glutamine codons as green boxes; CCG proline codons as blue boxes; CCA proline codons as yellow boxes; and CCT proline codons as pink boxes. Disease-associated alleles with either zero, three or four downstream glutamine codons (Q2 = 0, 3 or 4), are indicated with a red upward triangle, green diamond, or green downward triangle, respectively. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 3
Fig. 3
Somatic expansion in Huntington disease. A) The ratio of somatic expansions of the HTT exon one CAG repeat is allele length and age-dependent. The graph shows the ratio of somatic expansions (numberofsomaticexpansionproductsnumberofprogenitoralleleproducts) in blood DNA plotted against the age at sampling for 746 participants in the TRACK-HD and Enroll-HD cohorts. Each point is colour coded with respect to the length of the inherited progenitor allele length (Q1). The scatterplot also shows the sex- and cohort-adjusted fitted regression lines for each pure CAG repeat length (Q1) derived from model SEQ1 which includes an interaction between age at sampling and pure CAG repeat length (Q1) (Table S2, appendix). B) Somatic expansion scores in blood DNA predict age at onset in Huntington disease. The graph shows adjusted time to event curves from a Cox proportional hazard model of HD motor onset according to the sign of the pure CAG somatic expansion score (determined using model SEQ1, Table S2, appendix). The solid lines represent the age-dependent probability of HD motor onset: orange for participants with positive expansion scores; and green for participants with negative expansion scores. The light shaded regions represent the 95% CI for the onset probability curves. A vertical tick mark on the curves indicates that a participant was censored at this time. Vertical dashed lines indicate median survival for each somatic expansion score category: individuals with a positive blood DNA somatic expansion score had a median age at onset = 46 years (95% CI 45 to 48); whilst individuals with a negative blood DNA somatic expansion score had a median age at onset = 49 years (95% CI 47 to 50). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 4
Fig. 4
Associations between HTT exon one CAG repeat structures and HD clinical outcomes and biases associated with the prediction of HD clinical outcomes based on fragment length estimates of CAG. The adjusted probability curves of age at HD motor onset (A/B) show the probability of motor onset predicted on the basis of the pure CAG length (Q1, A) or the fragment length estimate of CAG (QFL, B) for each number of additional glutamine codons genotype (Q2 = 0, red; Q2 = 2, black; Q2 = 4, green). The light shaded regions represent the 95% CI for the onset probability curves. A vertical tick mark on the curves indicate that a participant was censored at this time. Vertical dashed lines indicate median survival for each Q2 genotype. For the pure CAG (Q1) estimates (A): individuals Q2 = 0 had a median age at onset = 38 years (95% CI = 34 to 45); individuals Q2 = 2 had a median age at onset = 48 years (95% CI = 46 to 49); and individuals Q2 = 4 had a median age at onset = 48 years (95% CI = 45 to 54). For the CAG fragment length (QFL) estimates (B): individuals Q2 = 0 had a median age at onset = 32 years (95% CI = 29 to 39); individuals Q2 = 2 had a median age at onset = 48 years (95% CI = 46 to 49); and individuals Q2 = 4 had a median age at onset = 51 years (95% CI = 48 to 59). The scatterplots (C/E) and the whisker plots (D/F) show the ranked TRACK-HD progression score predicted on the basis of the fragment length estimate of CAG (QFL, C/D) or pure CAG (Q1, E/F) dependent on the number of additional glutamine codons (Q2). The progression scores are normally distributed around zero, with zero representing the population average disease progression. Negative values represent individuals with slower than average disease progression and positive values represent individuals with faster than average disease progression. The whisker plots show the mean (diamond) and 95% CI (whiskers) on the basis of the fragment length estimate of CAG (QFL, E) or pure CAG length (Q1, F) dependent on the number of additional glutamine codons (Q2 = 0, red; Q2 = 2, black; Q2 = 4, green). The whisker plots (G/H/I/J) show the least-square means for the rate of change of TMS (TMS rate, G) and TFC (TFC rate, H) predicted on the basis of the pure CAG length (Q1), and TMS predicted on the basis of the pure CAG (Q1, G) or fragment length estimate of CAG (QFL, H) for each number of additional glutamine codons (Q2) genotype (i.e. the means associated with each number of additional glutamine codons (Q2) genotype adjusted for all other variables in the linear regression).

References

    1. McGinty R.J., Mirkin S.M. Cis- and trans-modifiers of repeat expansions: blending model systems with human genetics. Trends Genet. 2018;34:448–465. - PMC - PubMed
    1. Caron N.S., Wright G.E.B., Hayden M.R. Huntington disease. In: Adam M., Ardinger H., Pagon R., editors. GeneReviews® [Internet] University of Washington, Seattle; Seattle (WA): 1998. Updated 2018 Jul 5.
    1. Kay C., Collins J.A., Miedzybrodzka Z. Huntington disease reduced penetrance alleles occur at high frequency in the general population. Neurology. 2016;87:282–288. - PMC - PubMed
    1. Kennedy L., Evans E., Chen C.M. Dramatic tissue-specific mutation length increases are an early molecular event in Huntington disease pathogenesis. Hum Mol Genet. 2003;12:3359–3367. - PubMed
    1. Shelbourne P.F., Keller-McGandy C., Bi W.L. Triplet repeat mutation length gains correlate with cell-type specific vulnerability in Huntington disease brain. Hum Mol Genet. 2007;16:1133–1142. - PubMed

MeSH terms