Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Mar;27(3):400-407.
doi: 10.1038/s41431-018-0302-4. Epub 2018 Nov 19.

Diagnostics of short tandem repeat expansion variants using massively parallel sequencing and componential tools

Affiliations

Diagnostics of short tandem repeat expansion variants using massively parallel sequencing and componential tools

Rick H de Leeuw et al. Eur J Hum Genet. 2019 Mar.

Abstract

Short tandem repeats (STRs) are scattered throughout the human genome. Some STRs, like trinucleotide repeat expansion (TRE) variants, cause hereditable disorders. Unambiguous molecular diagnostics of TRE disorders is hampered by current technical limitations imposed by traditional PCR and DNA sequencing methods. Here we report a novel pipeline for TRE variant diagnosis employing the massively parallel sequencing (MPS) combined with an opensource software package (FDSTools), which together are designed to distinguish true STR sequences from STR sequencing artifacts. We show that this approach can improve TRE diagnosis, such as Oculopharyngeal muscular dystrophy (OPMD). OPMD is caused by a trinucleotide expansion in the PABPN1 gene. A short GCN expansion, (GCN[10]), coding for a 10 alanine repeat is not pathogenic, but an alanine expansion is pathogenic. Applying this novel procedure in a Dutch OPMD patient cohort, we found expansion variants from GCN[11] to GCN[16], with the GCN[16] as the most abundant variant. The repeat expansion length did not correlate with clinical features. However, symptom severity was found to correlate with age and with the initial affected muscles, suggesting that aging and muscle-specific factors can play a role in modulating OPMD.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

Fig. 1
Fig. 1
A flowchart of the DNAseq procedure
Fig. 2
Fig. 2
Examples of bioanalyses and FDSTools sequence results in OPMD and controls. Each row shows the result from a single subject. The left panel shows DNA histograms, which were generated by the DNA Bioanalyzer application. The right panel shows the graphical output of FDStools sequence analysis for an anonymous control and OPMD patients with five different GCN expansion lengths. The genuine alleles are highlighted in bold. The wild-type allele is depicted in black and the expanded allele in red. Non-genuine reads (PCR stutters, PCR-generated point mutations and sequencing errors) are shown in gray
Fig. 3
Fig. 3
Stutters in trinucleotide GCN expansion. a Analysis of the wild-type allele (GCN[10]) in control and OPMD patients. The number of reads (left panel) or the percentage of reads with respect to the highest allele in the sample (right panel) are plotted against GCN length results. In the control group, results are from one PCR experiment. In the OPMD group, results are from two experiments: 20 amplification cycles (PCR1) or 40 amplification cycles (PCR2). The stutter percentage was significantly higher in the second experiment. Genuine allele length is depicted in bold and stutter allele length is in plain text. P-value was calculated by the Student’s T-test. b Analysis of sequencing results in OPMD subjects from two experiments. Plots show percentage of reads with respect to the highest allele in the sample for subjects with the same genotype (heterozygous: GCN[12], GCN[13], GCN[14], GCN[15], GCN[16], and homozygous GCN[11]). In each plot, the left side shows the wild-type GCN[10] and corresponding stutter GCN[9], the right side shows the expanded allele and its corresponding stutters. Genuine GCN length is depicted in bold and stutters are plain. Each dot represents an individual. Median is marked with a black line
Fig. 4
Fig. 4
The frequency of GCN expansion length in the Dutch OPMD patients. Bar chart shows the frequency of GCN expansion variants in familial and sporadic OPMD. The GCN[16] expansion is enriched in the Dutch familial OPMD. The GCN[15] expansion is found only in sporadic OPMD, the GCN[11] expansion is only familial

References

    1. Tilanus MGJ. Short tandem repeat markers in diagnostics: what's in a repeat? Leukemia. 2006;20:1353. doi: 10.1038/sj.leu.2404273. - DOI - PubMed
    1. Budworth H, McMurray CT. A brief history of triplet repeat diseases. Methods Mol Biol. 2013;1010:3–17. doi: 10.1007/978-1-62703-411-1_1. - DOI - PMC - PubMed
    1. Cummings CJ, Zoghbi HY. Fourteen and counting: unraveling trinucleotide repeat diseases. Hum Mol Genet. 2000;9:909–16. doi: 10.1093/hmg/9.6.909. - DOI - PubMed
    1. Tan EC, Lai PS. Molecular diagnosis of neurogenetic disorders involving trinucleotide repeat expansions. Expert Rev Mol Diagn. 2005;5:101–9. doi: 10.1586/14737159.5.1.101. - DOI - PubMed
    1. Groen JL, de Bie RMA, Foncke EMJ, Roos RAC, Leenders KL, Tijssen MAJ. Late-onset Huntington disease with intermediate CAG repeats: true or false? J Neurol, Neurosurg. 2010;81:228–30. doi: 10.1136/jnnp.2008.170902. - DOI - PubMed

Publication types

Grants and funding