Targeted genotyping of variable number tandem repeats with adVNTR
- PMID: 30352806
- PMCID: PMC6211647
- DOI: 10.1101/gr.235119.118
Targeted genotyping of variable number tandem repeats with adVNTR
Abstract
Whole-genome sequencing is increasingly used to identify Mendelian variants in clinical pipelines. These pipelines focus on single-nucleotide variants (SNVs) and also structural variants, while ignoring more complex repeat sequence variants. Here, we consider the problem of genotyping Variable Number Tandem Repeats (VNTRs), composed of inexact tandem duplications of short (6-100 bp) repeating units. VNTRs span 3% of the human genome, are frequently present in coding regions, and have been implicated in multiple Mendelian disorders. Although existing tools recognize VNTR carrying sequence, genotyping VNTRs (determining repeat unit count and sequence variation) from whole-genome sequencing reads remains challenging. We describe a method, adVNTR, that uses hidden Markov models to model each VNTR, count repeat units, and detect sequence variation. adVNTR models can be developed for short-read (Illumina) and single-molecule (Pacific Biosciences [PacBio]) whole-genome and whole-exome sequencing, and show good results on multiple simulated and real data sets.
© 2018 Bakhtiari et al.; Published by Cold Spring Harbor Laboratory Press.
Figures






Similar articles
-
Detecting tandem repeat variants in coding regions using code-adVNTR.iScience. 2022 Jul 19;25(8):104785. doi: 10.1016/j.isci.2022.104785. eCollection 2022 Aug 19. iScience. 2022. PMID: 35982790 Free PMC article.
-
Variable number tandem repeats mediate the expression of proximal genes.Nat Commun. 2021 Apr 6;12(1):2075. doi: 10.1038/s41467-021-22206-z. Nat Commun. 2021. PMID: 33824302 Free PMC article.
-
Genome-wide prediction of human VNTRs.Genomics. 2005 Jan;85(1):24-35. doi: 10.1016/j.ygeno.2004.10.009. Genomics. 2005. PMID: 15607419
-
Sequencing and characterizing short tandem repeats in the human genome.Nat Rev Genet. 2024 Jul;25(7):460-475. doi: 10.1038/s41576-024-00692-3. Epub 2024 Feb 16. Nat Rev Genet. 2024. PMID: 38366034 Review.
-
The association of insertions/deletions (INDELs) and variable number tandem repeats (VNTRs) with obesity and its related traits and complications.J Physiol Anthropol. 2017 Jun 14;36(1):25. doi: 10.1186/s40101-017-0142-x. J Physiol Anthropol. 2017. PMID: 28615046 Free PMC article. Review.
Cited by
-
Genomic Analysis in the Age of Human Genome Sequencing.Cell. 2019 Mar 21;177(1):70-84. doi: 10.1016/j.cell.2019.02.032. Cell. 2019. PMID: 30901550 Free PMC article. Review.
-
Variant calling and benchmarking in an era of complete human genome sequences.Nat Rev Genet. 2023 Jul;24(7):464-483. doi: 10.1038/s41576-023-00590-0. Epub 2023 Apr 14. Nat Rev Genet. 2023. PMID: 37059810 Review.
-
Unravelling Chlamydia trachomatis diversity in Amhara, Ethiopia: MLVA-ompA sequencing as a molecular typing tool for trachoma.PLoS Negl Trop Dis. 2024 Apr 25;18(4):e0012143. doi: 10.1371/journal.pntd.0012143. eCollection 2024 Apr. PLoS Negl Trop Dis. 2024. PMID: 38662795 Free PMC article.
-
Pervasive cis effects of variation in copy number of large tandem repeats on local DNA methylation and gene expression.Am J Hum Genet. 2021 May 6;108(5):809-824. doi: 10.1016/j.ajhg.2021.03.016. Epub 2021 Mar 31. Am J Hum Genet. 2021. PMID: 33794196 Free PMC article.
-
Long and Accurate: How HiFi Sequencing is Transforming Genomics.Genomics Proteomics Bioinformatics. 2025 May 10;23(1):qzaf003. doi: 10.1093/gpbjnl/qzaf003. Genomics Proteomics Bioinformatics. 2025. PMID: 39918981 Free PMC article. Review.
References
-
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215: 403–410. - PubMed
-
- Benedetti F, Dallaspezia S, Colombo C, Pirovano A, Marino E, Smeraldi E. 2008. A length polymorphism in the circadian clock gene Per3 influences age at onset of bipolar disorder. Neurosci Lett 445: 184–187. - PubMed
-
- Berlin K, Koren S, Chin CS, Drake JP, Landolin JM, Phillippy AM. 2015. Assembling large genomes with single-molecule sequencing and locality-sensitive hashing. Nat Biotechnol 33: 623–630. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources