Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2014 Dec 1;9(12):e113862.
doi: 10.1371/journal.pone.0113862. eCollection 2014.

The accuracy, feasibility and challenges of sequencing short tandem repeats using next-generation sequencing platforms

Affiliations
Comparative Study

The accuracy, feasibility and challenges of sequencing short tandem repeats using next-generation sequencing platforms

Monika Zavodna et al. PLoS One. .

Abstract

To date we have little knowledge of how accurate next-generation sequencing (NGS) technologies are in sequencing repetitive sequences beyond known limitations to accurately sequence homopolymers. Only a handful of previous reports have evaluated the potential of NGS for sequencing short tandem repeats (microsatellites) and no empirical study has compared and evaluated the performance of more than one NGS platform with the same dataset. Here we examined yeast microsatellite variants from both long-read (454-sequencing) and short-read (Illumina) NGS platforms and compared these to data derived through Sanger sequencing. In addition, we investigated any locus-specific biases and differences that might have resulted from variability in microsatellite repeat number, repeat motif or type of mutation. Out of 112 insertion/deletion variants identified among 45 microsatellite amplicons in our study, we found 87.5% agreement between the 454-platform and Sanger sequencing in frequency of variant detection after Benjamini-Hochberg correction for multiple tests. For a subset of 21 microsatellite amplicons derived from Illumina sequencing, the results of short-read platform were highly consistent with the other two platforms, with 100% agreement with 454-sequencing and 93.6% agreement with the Sanger method after Benjamini-Hochberg correction. We found that the microsatellite attributes copy number, repeat motif and type of mutation did not have a significant effect on differences seen between the sequencing platforms. We show that both long-read and short-read NGS platforms can be used to sequence short tandem repeats accurately, which makes it feasible to consider the use of these platforms in high-throughput genotyping. It appears the major requirement for achieving both high accuracy and rare variant detection in microsatellite genotyping is sufficient read depth coverage. This might be a challenge because each platform generates a consistent pattern of non-uniform sequence coverage, which, as our study suggests, may affect some types of tandem repeats more than others.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: RB is affiliated to AgResearch Limited and contributed to this study as a consultant. NJG was previously a PLOS ONE Editorial Board member until early 2014. These do not alter the authors' adherence to PLOS ONE Editorial policies and criteria.

Figures

Figure 1
Figure 1. Summary of detected microsatellite InDel mutations from three sequencing platforms.
Plot of the microsatellite InDel variants per amplicon (m) detected by Illumina- (white bars), 454-sequencing (grey bars) and Sanger sequencing (black bars). Insertions are denoted by bars above the X-axis and deletions are denoted by bars below the X-axis. The length of each bar segment is proportional to the length of the InDel in repeat units inserted or deleted. The amplicons' characteristics are summarized in Table S1.

Similar articles

Cited by

References

    1. Abdelkrim J, Robertson BC, Stanton J-AL, Gemmell NJ (2009) Fast, cost-effective development of species-specific microsatellite markers by genomic sequencing. Biotechniques 46:185–192. - PubMed
    1. Zavodna M, Grueber CE, Gemmell NJ (2013) Parallel tagged next-generation sequencing on pooled samples – a new approach for population genetics in ecology and conservation. PloS ONE 8:e61471. - PMC - PubMed
    1. Schmidt D, Wilson MD, Ballester B, Schwalie PC, Brown GD, et al. (2010) Five-vertebrate ChIP-seq reveals the evolutionary dynamics of transcription factor binding. Science 328:1036–1040. - PMC - PubMed
    1. Santure AW, Gratten J, Mossman JA, Sheldon BC, Slate J (2011) Characterization of the transcriptome of a wild great tit Parus major population by next generation sequencing. BMC Genomics 12:283. - PMC - PubMed
    1. Dunning LT, Dennis AB, Park DC, Sinclair BJ, Newcomb RD, et al. (2013) Identification of cold-responsive genes in a New Zealand alpine stick insect using RNA-Seq. Comp Biochem Physiol D - Genomics Proteomics 8:24–31. - PubMed

Publication types

MeSH terms