This is a preprint.
ULTRA-Effective Labeling of Repetitive Genomic Sequence
- PMID: 38895435
- PMCID: PMC11185745
- DOI: 10.1101/2024.06.03.597269
ULTRA-Effective Labeling of Repetitive Genomic Sequence
Update in
-
ULTRA-effective labeling of tandem repeats in genomic sequence.Bioinform Adv. 2024 Oct 9;4(1):vbae149. doi: 10.1093/bioadv/vbae149. eCollection 2024. Bioinform Adv. 2024. PMID: 39575229 Free PMC article.
Abstract
In the age of long read sequencing, genomics researchers now have access to accurate repetitive DNA sequence (including satellites) that, due to the limitations of short read sequencing, could previously be observed only as unmappable fragments. Tools that annotate repetitive sequence are now more important than ever, so that we can better understand newly uncovered repetitive sequences, and also so that we can mitigate errors in bioinformatic software caused by those repetitive sequences. To that end, we introduce the 1.0 release of our tool for identifying and annotating locally-repetitive sequence, ULTRA (ULTRA Locates Tandemly Repetitive Areas). ULTRA is fast enough to use as part of an efficient annotation pipeline, produces state-of-the-art reliable coverage of repetitive regions containing many mutations, and provides interpretable statistics and labels for repetitive regions. It released under an open license, and available for download at https://github.com/TravisWheelerLab/ULTRA.
Figures






References
-
- Li You-Chun, Korol Abraham B, Fahima Tzion, Beiles Avigdor, and Nevo Eviatar. Microsatellites: genomic distribution, putative functions and mutational mechanisms: a review. Molecular Ecology, 11(12):2453–2465, 2002. - PubMed
-
- Alec J Jeffreys Victoria Wilson, and Thein Swee Lay. Hypervariable ‘minisatellite’ regions in human DNA. Nature, 314(6006):67–73, 1985. - PubMed
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources