Characterization of genome-wide STR variation in 6487 human genomes
- PMID: 37045857
- PMCID: PMC10097659
- DOI: 10.1038/s41467-023-37690-8
Characterization of genome-wide STR variation in 6487 human genomes
Abstract
Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (~31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (~33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3'UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.
© 2023. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures







Similar articles
-
Sequencing and characterizing short tandem repeats in the human genome.Nat Rev Genet. 2024 Jul;25(7):460-475. doi: 10.1038/s41576-024-00692-3. Epub 2024 Feb 16. Nat Rev Genet. 2024. PMID: 38366034 Review.
-
Population-Scale Sequencing Data Enable Precise Estimates of Y-STR Mutation Rates.Am J Hum Genet. 2016 May 5;98(5):919-933. doi: 10.1016/j.ajhg.2016.04.001. Epub 2016 Apr 25. Am J Hum Genet. 2016. PMID: 27126583 Free PMC article.
-
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.Genome Res. 2015 May;25(5):736-49. doi: 10.1101/gr.185892.114. Epub 2015 Mar 30. Genome Res. 2015. PMID: 25823460 Free PMC article.
-
A comparison of software for analysis of rare and common short tandem repeat (STR) variation using human genome sequences from clinical and population-based samples.PLoS One. 2024 Apr 1;19(4):e0300545. doi: 10.1371/journal.pone.0300545. eCollection 2024. PLoS One. 2024. PMID: 38558075 Free PMC article.
-
"New turns from old STaRs": enhancing the capabilities of forensic short tandem repeat analysis.Electrophoresis. 2014 Nov;35(21-22):3173-87. doi: 10.1002/elps.201400095. Epub 2014 Jul 16. Electrophoresis. 2014. PMID: 24888494 Review.
Cited by
-
Sequencing and characterizing short tandem repeats in the human genome.Nat Rev Genet. 2024 Jul;25(7):460-475. doi: 10.1038/s41576-024-00692-3. Epub 2024 Feb 16. Nat Rev Genet. 2024. PMID: 38366034 Review.
-
RNA gain-of-function mechanisms in short tandem repeat diseases.RNA. 2025 Feb 19;31(3):349-358. doi: 10.1261/rna.080277.124. RNA. 2025. PMID: 39725460 Free PMC article. Review.
-
Short tandem repeat mutations regulate gene expression in colorectal cancer.Sci Rep. 2024 Feb 9;14(1):3331. doi: 10.1038/s41598-024-53739-0. Sci Rep. 2024. PMID: 38336885 Free PMC article.
-
Diagnostic uplift through the implementation of short tandem repeat analysis using exome sequencing.Eur J Hum Genet. 2024 May;32(5):584-587. doi: 10.1038/s41431-024-01542-w. Epub 2024 Feb 2. Eur J Hum Genet. 2024. PMID: 38308084 Free PMC article.
-
Building a catalogue of short tandem repeats in diverse populations.Nat Rev Genet. 2024 Jul;25(7):457. doi: 10.1038/s41576-024-00726-w. Nat Rev Genet. 2024. PMID: 38538746 No abstract available.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources