Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Apr:2012:1-6.

Short-read, high-throughput sequencing technology for STR genotyping

Affiliations

Short-read, high-throughput sequencing technology for STR genotyping

Daniel M Bornman et al. Biotech Rapid Dispatches. 2012 Apr.

Abstract

DNA-based methods for human identification principally rely upon genotyping of short tandem repeat (STR) loci. Electrophoretic-based techniques for variable-length classification of STRs are universally utilized, but are limited in that they have relatively low throughput and do not yield nucleotide sequence information. High-throughput sequencing technology may provide a more powerful instrument for human identification, but is not currently validated for forensic casework. Here, we present a systematic method to perform high-throughput genotyping analysis of the Combined DNA Index System (CODIS) STR loci using short-read (150 bp) massively parallel sequencing technology. Open source reference alignment tools were optimized to evaluate PCR-amplified STR loci using a custom designed STR genome reference. Evaluation of this approach demonstrated that the 13 CODIS STR loci and amelogenin (AMEL) locus could be accurately called from individual and mixture samples. Sensitivity analysis showed that as few as 18,500 reads, aligned to an in silico referenced genome, were required to genotype an individual (>99% confidence) for the CODIS loci. The power of this technology was further demonstrated by identification of variant alleles containing single nucleotide polymorphisms (SNPs) and the development of quantitative measurements (reads) for resolving mixed samples.

Keywords: Bridge PCR; Illumina; SNP; STR; forensic; genotyping; high-throughput sequencing; next-generation sequencing.

PubMed Disclaimer

Conflict of interest statement

Competing interests

The authors declare no competing interests. The authors and their institutions do not specifically endorse any of the third-party products or technologies described in this report.

Figures

Figure 1
Figure 1
Visualization of mapped reads to specific alleles of the TPOX locus for two individual samples (index 2 and index 6) and a 1:1 mixture (index 12).
Figure 2
Figure 2. Probability of correct allele assignment for each CODIS STR locus
For each model, 10,000 draws were randomly obtained from a grid of (i) total reads, (ii) reads aligning to the in silico genome, and (iii) the number of reads mapping to the entire locus. The double x-axis represents a cumulative estimate of the (i) number of total raw sequencing reads and (ii) total number of reads aligning to the full STR in silico genome.

References

    1. Budowle B, Masibay A, Anderson SJ, Barna C, Biega L, Brenneke S, Brown BL, Cramer J, et al. STR primer concordance study. Forensic Sci Int. 2001;124:47–54. - PubMed
    1. Jobling MA, Gill P. Encoded evidence: DNA in forensic analysis. Nat Rev Genet. 2004;5:739–751. - PubMed
    1. Butler JM. Short tandem repeat typing technologies used in human identity testing. BioTechniques. 2007;43:ii–v. - PubMed
    1. Ellegren H. Microsatellites: simple sequences with complex evolution. Nat Rev Genet. 2004;5:435–445. - PubMed
    1. Budowle B, Moretti TR, Niezgoda SJ, Brown BL. CODIS and PCR-based short tandem repeat loci: law enforcement tools. Proceedings of the Second European Symposium on Human Identification; June 1998; Innsbruck, Austria. Madison, WI: Promega Corporation; 1998. pp. 73–88.

LinkOut - more resources