The accuracy, feasibility and challenges of sequencing short tandem repeats using next-generation sequencing platforms
- PMID: 25436869
- PMCID: PMC4250034
- DOI: 10.1371/journal.pone.0113862
The accuracy, feasibility and challenges of sequencing short tandem repeats using next-generation sequencing platforms
Abstract
To date we have little knowledge of how accurate next-generation sequencing (NGS) technologies are in sequencing repetitive sequences beyond known limitations to accurately sequence homopolymers. Only a handful of previous reports have evaluated the potential of NGS for sequencing short tandem repeats (microsatellites) and no empirical study has compared and evaluated the performance of more than one NGS platform with the same dataset. Here we examined yeast microsatellite variants from both long-read (454-sequencing) and short-read (Illumina) NGS platforms and compared these to data derived through Sanger sequencing. In addition, we investigated any locus-specific biases and differences that might have resulted from variability in microsatellite repeat number, repeat motif or type of mutation. Out of 112 insertion/deletion variants identified among 45 microsatellite amplicons in our study, we found 87.5% agreement between the 454-platform and Sanger sequencing in frequency of variant detection after Benjamini-Hochberg correction for multiple tests. For a subset of 21 microsatellite amplicons derived from Illumina sequencing, the results of short-read platform were highly consistent with the other two platforms, with 100% agreement with 454-sequencing and 93.6% agreement with the Sanger method after Benjamini-Hochberg correction. We found that the microsatellite attributes copy number, repeat motif and type of mutation did not have a significant effect on differences seen between the sequencing platforms. We show that both long-read and short-read NGS platforms can be used to sequence short tandem repeats accurately, which makes it feasible to consider the use of these platforms in high-throughput genotyping. It appears the major requirement for achieving both high accuracy and rare variant detection in microsatellite genotyping is sufficient read depth coverage. This might be a challenge because each platform generates a consistent pattern of non-uniform sequence coverage, which, as our study suggests, may affect some types of tandem repeats more than others.
Conflict of interest statement
Figures

Similar articles
-
The effects of read length, quality and quantity on microsatellite discovery and primer development: from Illumina to PacBio.Mol Ecol Resour. 2014 Sep;14(5):953-65. doi: 10.1111/1755-0998.12245. Epub 2014 Mar 24. Mol Ecol Resour. 2014. PMID: 24576200
-
Microsatellite markers from the Ion Torrent: a multi-species contrast to 454 shotgun sequencing.Mol Ecol Resour. 2014 May;14(3):554-68. doi: 10.1111/1755-0998.12192. Epub 2013 Nov 29. Mol Ecol Resour. 2014. PMID: 24165148
-
Profiling of Short-Tandem-Repeat Disease Alleles in 12,632 Human Whole Genomes.Am J Hum Genet. 2017 Nov 2;101(5):700-715. doi: 10.1016/j.ajhg.2017.09.013. Am J Hum Genet. 2017. PMID: 29100084 Free PMC article.
-
Sequencing technologies and tools for short tandem repeat variation detection.Brief Bioinform. 2015 Mar;16(2):193-204. doi: 10.1093/bib/bbu001. Epub 2014 Feb 6. Brief Bioinform. 2015. PMID: 24504770 Review.
-
An update on the neurological short tandem repeat expansion disorders and the emergence of long-read sequencing diagnostics.Acta Neuropathol Commun. 2021 May 25;9(1):98. doi: 10.1186/s40478-021-01201-x. Acta Neuropathol Commun. 2021. PMID: 34034831 Free PMC article. Review.
Cited by
-
Tandem repeats mediating genetic plasticity in health and disease.Nat Rev Genet. 2018 May;19(5):286-298. doi: 10.1038/nrg.2017.115. Epub 2018 Feb 5. Nat Rev Genet. 2018. PMID: 29398703 Review.
-
Characterisation of 12 microsatellite loci in the Vietnamese commercial clam Lutraria rhynchaena Jonas 1844 (Heterodonta: Bivalvia: Mactridae) through next-generation sequencing.Mol Biol Rep. 2016 May;43(5):391-6. doi: 10.1007/s11033-016-3966-2. Epub 2016 Feb 27. Mol Biol Rep. 2016. PMID: 26922181
-
Genetic diversity and population structure of the threatened chocolate mahseer (Neolissochilus hexagonolepis McClelland 1839) based on SSR markers: implications for conservation management in Northeast India.Mol Biol Rep. 2019 Oct;46(5):5237-5249. doi: 10.1007/s11033-019-04981-7. Epub 2019 Jul 19. Mol Biol Rep. 2019. PMID: 31325143
-
Characterization of Clonal Evolution in Microsatellite Unstable Metastatic Cancers through Multiregional Tumor Sequencing.Mol Cancer Res. 2021 Mar;19(3):465-474. doi: 10.1158/1541-7786.MCR-19-0955. Epub 2020 Nov 23. Mol Cancer Res. 2021. PMID: 33229401 Free PMC article.
-
Assessment of Microsatellite Instability from Next-Generation Sequencing Data.Adv Exp Med Biol. 2022;1361:75-100. doi: 10.1007/978-3-030-91836-1_5. Adv Exp Med Biol. 2022. PMID: 35230684
References
-
- Abdelkrim J, Robertson BC, Stanton J-AL, Gemmell NJ (2009) Fast, cost-effective development of species-specific microsatellite markers by genomic sequencing. Biotechniques 46:185–192. - PubMed
-
- Dunning LT, Dennis AB, Park DC, Sinclair BJ, Newcomb RD, et al. (2013) Identification of cold-responsive genes in a New Zealand alpine stick insect using RNA-Seq. Comp Biochem Physiol D - Genomics Proteomics 8:24–31. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases