Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 May 5;37(5):731-733.
doi: 10.1093/bioinformatics/btaa736.

TRTools: a toolkit for genome-wide analysis of tandem repeats

Affiliations

TRTools: a toolkit for genome-wide analysis of tandem repeats

Nima Mousavi et al. Bioinformatics. .

Abstract

Summary: A rich set of tools have recently been developed for performing genome-wide genotyping of tandem repeats (TRs). However, standardized tools for downstream analysis of these results are lacking. To facilitate TR analysis applications, we present TRTools, a Python library and suite of command line tools for filtering, merging and quality control of TR genotype files. TRTools utilizes an internal harmonization module, making it compatible with outputs from a wide range of TR genotypers.

Availability and implementation: TRTools is freely available at https://github.com/gymreklab/TRTools. Detailed documentation is available at https://trtools.readthedocs.io.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
TRTools visualizations. (a) Allele frequency distribution at an example pentanucleotide TR output by statSTR based on GangSTR genotypes for two sample sets (YRI population consisting of Yorubans from Nigeria and CEU population of Northwestern European descent). (b) Example TR genotype comparison output by compareSTR. The plot compares genotypes (in terms of number of repeats difference from hg19) from HipSTR (x-axis) to those from ExpansionHunter (y-axis) on 5000 tetranucleotide TRs. Bubble sizes give the number of calls included in each point. (c) Example reference bias plot output by qcSTR using popSTR2 genotypes. The plot shows the average deviation of TR alleles called versus the reference length of the TR (in bp). The red line shows the cumulative percentage of allele calls below each reference length threshold

References

    1. Bakhtiari M. et al. (2018) Targeted genotyping of variable number tandem repeats with adVNTR. Genome Res., 28, 1709–1719. - PMC - PubMed
    1. Casbon J. (2012) PyVCF – A Variant Call Format Parser for Python. Available from https://pyvcf.readthedocs.io/
    1. Dolzhenko E. et al.; The US–Venezuela Collaborative Research Group. (2017) Detection of long repeat expansions from PCR-free whole-genome sequence data. Genome Res., 27, 1895–1903. - PMC - PubMed
    1. Hannan A.J. (2018) Tandem repeats mediating genetic plasticity in health and disease. Nat. Rev. Genet., 19, 286–298. - PubMed
    1. Kristmundsdottir S. et al. (2020) popSTR2 enables clinical and population-scale genotyping of microsatellites. Bioinformatics, 36, 2269–2271. - PMC - PubMed

Publication types