Efficient genome monomer higher-order structure annotation and identification using the GRMhor algorithm
- PMID: 39659587
- PMCID: PMC11630843
- DOI: 10.1093/bioadv/vbae191
Efficient genome monomer higher-order structure annotation and identification using the GRMhor algorithm
Abstract
Motivation: Tandem monomeric units, integral components of eukaryotic genomes, form higher-order repeat (HOR) structures that play crucial roles in maintaining chromosome integrity and regulating gene expression and protein abundance. Given their significant influence on processes such as evolution, chromosome segregation, and disease, developing a sensitive and automated tool for identifying HORs across diverse genomic sequences is essential.
Results: In this study, we applied the GRMhor (Global Repeat Map hor) algorithm to analyse the centromeric region of chromosome 20 in three individual human genomes, as well as in the centromeric regions of three higher primates. In all three human genomes, we identified six distinct HOR arrays, which revealed significantly greater differences in the number of canonical and variant copies, as well as in their overall structure, than would be expected given the 99.9% genetic similarity among humans. Furthermore, our analysis of higher primate genomes, which revealed entirely different HOR sequences, indicates a much larger genomic divergence between humans and higher primates than previously recognized. These results underscore the suitability of the GRMhor algorithm for studying specificities in individual genomes, particularly those involving repetitive monomers in centromere structure, which is essential for proper chromosome segregation during cell division, while also highlighting its utility in exploring centromere evolution and other repetitive genomic regions.
Availability and implementation: Source code and example binaries freely available for download at github.com/gluncic/GRM2023.
© The Author(s) 2024. Published by Oxford University Press.
Conflict of interest statement
All authors of this article declare that they have no conflicts of interest.
Figures







Similar articles
-
Neuroblastoma Breakpoint Family 3mer Higher Order Repeats/Olduvai Triplet Pattern in the Complete Genome of Human and Nonhuman Primates and Relation to Cognitive Capacity.Genes (Basel). 2024 Dec 13;15(12):1598. doi: 10.3390/genes15121598. Genes (Basel). 2024. PMID: 39766865 Free PMC article.
-
Novel Concept of Alpha Satellite Cascading Higher-Order Repeats (HORs) and Precise Identification of 15mer and 20mer Cascading HORs in Complete T2T-CHM13 Assembly of Human Chromosome 15.Int J Mol Sci. 2024 Apr 16;25(8):4395. doi: 10.3390/ijms25084395. Int J Mol Sci. 2024. PMID: 38673983 Free PMC article.
-
Novel Cascade Alpha Satellite HORs in Orangutan Chromosome 13 Assembly: Discovery of the 59mer HOR-The largest Unit in Primates-And the Missing Triplet 45/27/18 HOR in Human T2T-CHM13v2.0 Assembly.Int J Mol Sci. 2024 Jul 11;25(14):7596. doi: 10.3390/ijms25147596. Int J Mol Sci. 2024. PMID: 39062839 Free PMC article.
-
Key-string algorithm--novel approach to computational analysis of repetitive sequences in human centromeric DNA.Croat Med J. 2003 Aug;44(4):386-406. Croat Med J. 2003. PMID: 12950141 Review.
-
Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics.Cells. 2020 Dec 18;9(12):2714. doi: 10.3390/cells9122714. Cells. 2020. PMID: 33352976 Free PMC article. Review.
References
-
- Alexandrov I, Kazakov A, Tumeneva I. et al. Alpha-satellite DNA of primates: old and new families. Chromosoma 2001;110:253–66. - PubMed
-
- Alexandrov IA, Mashkova TD, Akopian TA. et al. Chromosome-specific alpha satellites: two distinct families on human chromosome 18. Genomics 1991;11:15–23. - PubMed
-
- Altschul SF, Gish W, Miller W. et al. Basic local alignment search tool. J Mol Biol 1990;215:403–10. - PubMed
LinkOut - more resources
Full Text Sources