Linked-read sequencing for detecting short tandem repeat expansions
- PMID: 35672336
- PMCID: PMC9174224
- DOI: 10.1038/s41598-022-13024-4
Linked-read sequencing for detecting short tandem repeat expansions
Abstract
Detection of short tandem repeat (STR) expansions with standard short-read sequencing is challenging due to the difficulty in mapping multicopy repeat sequences. In this study, we explored how the long-range sequence information of barcode linked-read sequencing (BLRS) can be leveraged to improve repeat-read detection. We also devised a novel algorithm using BLRS barcodes for distance estimation and evaluated its application for STR genotyping. Both approaches were designed for genotyping large expansions (> 1 kb) that cannot be sized accurately by existing methods. Using simulated and experimental data of genomes with STR expansions from multiple BLRS platforms, we validated the utility of barcode and phasing information in attaining better STR genotypes compared to standard short-read sequencing. Although the coverage bias of extremely GC-rich STRs is an important limitation of BLRS, BLRS is an effective strategy for genotyping many other STR loci.
© 2022. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures


Similar articles
-
Sequencing and characterizing short tandem repeats in the human genome.Nat Rev Genet. 2024 Jul;25(7):460-475. doi: 10.1038/s41576-024-00692-3. Epub 2024 Feb 16. Nat Rev Genet. 2024. PMID: 38366034 Review.
-
Analysis of Tandem Repeats in Short-Read Sequencing Data: From Genotyping Known Pathogenic Repeats to Discovering Novel Expansions.Curr Protoc. 2024 Nov;4(11):e70010. doi: 10.1002/cpz1.70010. Curr Protoc. 2024. PMID: 39499075 Free PMC article.
-
Profiling of Short-Tandem-Repeat Disease Alleles in 12,632 Human Whole Genomes.Am J Hum Genet. 2017 Nov 2;101(5):700-715. doi: 10.1016/j.ajhg.2017.09.013. Am J Hum Genet. 2017. PMID: 29100084 Free PMC article.
-
Genome-wide sequencing as a first-tier screening test for short tandem repeat expansions.Genome Med. 2021 Aug 9;13(1):126. doi: 10.1186/s13073-021-00932-9. Genome Med. 2021. PMID: 34372915 Free PMC article.
-
An update on the neurological short tandem repeat expansion disorders and the emergence of long-read sequencing diagnostics.Acta Neuropathol Commun. 2021 May 25;9(1):98. doi: 10.1186/s40478-021-01201-x. Acta Neuropathol Commun. 2021. PMID: 34034831 Free PMC article. Review.
Cited by
-
Haplotype information of large neuromuscular disease genes provided by linked-read sequencing has a potential to increase diagnostic yield.Sci Rep. 2024 Feb 21;14(1):4306. doi: 10.1038/s41598-024-54866-4. Sci Rep. 2024. PMID: 38383731 Free PMC article.
-
Sequencing and characterizing short tandem repeats in the human genome.Nat Rev Genet. 2024 Jul;25(7):460-475. doi: 10.1038/s41576-024-00692-3. Epub 2024 Feb 16. Nat Rev Genet. 2024. PMID: 38366034 Review.
-
Technology-driven approaches for meiosis research in tomato and wild relatives.Plant Reprod. 2023 Mar;36(1):97-106. doi: 10.1007/s00497-022-00450-7. Epub 2022 Sep 23. Plant Reprod. 2023. PMID: 36149478 Free PMC article.
-
Genomic variant benchmark: if you cannot measure it, you cannot improve it.Genome Biol. 2023 Oct 5;24(1):221. doi: 10.1186/s13059-023-03061-1. Genome Biol. 2023. PMID: 37798733 Free PMC article. Review.
References
-
- Wang O, Chin R, Cheng X, Wu M, Mao Q, Tang J, et al. Efficient and unique co-barcoding of second-generation sequencing reads from long DNA molecules enabling cost effective and accurate sequencing, haplotyping, and de novo assembly. Genome Res. 2019;29(5):798–808. doi: 10.1101/gr.245126.118. - DOI - PMC - PubMed
-
- Chen Z, Pham L, Wu T-C, Mo G, Xia Y, Chang PL, et al. Ultralow-input single-tube linked-read library method enables short-read second-generation sequencing systems to routinely generate highly accurate and economical long-range sequencing information. Genome Res. 2020;30:898–909. doi: 10.1101/gr.260380.119. - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous