Characterization and visualization of tandem repeats at genome scale
- PMID: 38168995
- PMCID: PMC11921810
- DOI: 10.1038/s41587-023-02057-3
Characterization and visualization of tandem repeats at genome scale
Abstract
Tandem repeat (TR) variation is associated with gene expression changes and numerous rare monogenic diseases. Although long-read sequencing provides accurate full-length sequences and methylation of TRs, there is still a need for computational methods to profile TRs across the genome. Here we introduce the Tandem Repeat Genotyping Tool (TRGT) and an accompanying TR database. TRGT determines the consensus sequences and methylation levels of specified TRs from PacBio HiFi sequencing data. It also reports reads that support each repeat allele. These reads can be subsequently visualized with a companion TR visualization tool. Assessing 937,122 TRs, TRGT showed a Mendelian concordance of 98.38%, allowing a single repeat unit difference. In six samples with known repeat expansions, TRGT detected all expansions while also identifying methylation signals and mosaicism and providing finer repeat length resolution than existing methods. Additionally, we released a database with allele sequences and methylation levels for 937,122 TRs across 100 genomes.
© 2024. The Author(s), under exclusive licence to Springer Nature America, Inc.
Conflict of interest statement
Competing interests
E.D., G.D.S.B., T.M., W.J.R., C.K., Z.K., K.P.C., A.W. and M.A.E. are employees and shareholders of Pacific Biosciences. F.J.S. received research support from Illumina, Pacific Biosciences, Nanopore and Genentech. The remaining authors declare no competing interests.
Figures





References
-
- English A. et al. Benchmarking of small and large variants across tandem repeats. Preprint at bioRxiv 10.1101/2023.10.29.564632 (2023). - DOI
-
- Caron NS, Wright GEB & Hayden MR Huntington disease. In GeneReviews® (eds. Adam MP et al.) (Univ. Washington, 1998).
-
- Siddique N. & Siddique T. Amyotrophic lateral sclerosis overview. In GeneReviews® (eds. Adam MP et al.) (Univ. Washington, 2001). - PubMed
-
- Hunter JE, Berry-Kravis E, Hipp H. & Todd PK FMR1 disorders. In GeneReviews® (eds. Adam MP et al.) (Univ. Washington, 1998). - PubMed
MeSH terms
Grants and funding
- R21 HG013397/HG/NHGRI NIH HHS/United States
- P50 HD104458/HD/NICHD NIH HHS/United States
- R01 HG012252/HG/NHGRI NIH HHS/United States
- F31 HG011205/HG/NHGRI NIH HHS/United States
- P50 HD104463/HD/NICHD NIH HHS/United States
- K08 HG008986/HG/NHGRI NIH HHS/United States
- T32 HG008962/HG/NHGRI NIH HHS/United States
- RC2 TR004391/TR/NCATS NIH HHS/United States
- K99 HG012796/HG/NHGRI NIH HHS/United States
- OT2 OD002751/OD/NIH HHS/United States
- R01 HG010757/HG/NHGRI NIH HHS/United States
- R00 HG012796/HG/NHGRI NIH HHS/United States
- R35 NS111602/NS/NINDS NIH HHS/United States
- R01 NS051630/NS/NINDS NIH HHS/United States
- UL1 TR002366/TR/NCATS NIH HHS/United States
- UG3 NS132105/NS/NINDS NIH HHS/United States
- U01 HG011758/HG/NHGRI NIH HHS/United States
- R01 NS072248/NS/NINDS NIH HHS/United States
- P50 HD103555/HD/NICHD NIH HHS/United States
LinkOut - more resources
Full Text Sources