Profound Non-Randomness in Dinucleotide Arrangements within Ultra-Conserved Non-Coding Elements and the Human Genome
- PMID: 37627009
- PMCID: PMC10452674
- DOI: 10.3390/biology12081125
Profound Non-Randomness in Dinucleotide Arrangements within Ultra-Conserved Non-Coding Elements and the Human Genome
Abstract
Long human ultra-conserved non-coding elements (UCNEs) do not have any sequence similarity to each other or other characteristics that make them unalterable during vertebrate evolution. We hypothesized that UCNEs have unique dinucleotide (DN) composition and arrangements compared to the rest of the genome. A total of 4272 human UCNE sequences were analyzed computationally and compared with the whole genomes of human, chicken, zebrafish, and fly. Statistical analysis was performed to assess the non-randomness in DN spacing arrangements within the entire human genome and within UCNEs. Significant non-randomness in DN spacing arrangements was observed in the entire human genome. Additionally, UCNEs exhibited distinct patterns in DN arrangements compared to the rest of the genome. Approximately 83% of all DN pairs within UCNEs showed significant (>10%) non-random genomic arrangements at short distances (2-6 nucleotides) relative to each other. At the extremes, non-randomness in DN spacing distances deviated up to 40% from expected values and were frequently associated with GpC, CpG, ApT, and GpG/CpC dinucleotides. The described peculiarities in DN arrangements have persisted for hundreds of millions of years in vertebrates. These distinctive patterns may suggest that UCNEs have specific DNA conformations.
Keywords: DNA structure; bioinformatics; genomics; inhomogeneity; polymorphism; stacking.
Conflict of interest statement
The authors declare that they have no competing interest.
Figures




Similar articles
-
Computational identification of ultra-conserved elements in the human genome: a hypothesis on homologous DNA pairing.NAR Genom Bioinform. 2024 Jul 2;6(3):lqae074. doi: 10.1093/nargab/lqae074. eCollection 2024 Sep. NAR Genom Bioinform. 2024. PMID: 38962254 Free PMC article.
-
Nucleotide Composition of Ultra-Conserved Elements Shows Excess of GpC and Depletion of GG and CC Dinucleotides.Genes (Basel). 2022 Nov 7;13(11):2053. doi: 10.3390/genes13112053. Genes (Basel). 2022. PMID: 36360290 Free PMC article.
-
Genomic context analysis reveals dense interaction network between vertebrate ultraconserved non-coding elements.Bioinformatics. 2012 Sep 15;28(18):i395-i401. doi: 10.1093/bioinformatics/bts400. Bioinformatics. 2012. PMID: 22962458 Free PMC article.
-
Mid-range inhomogeneity of eukaryotic genomes.ScientificWorldJournal. 2011 Apr 5;11:842-54. doi: 10.1100/tsw.2011.82. ScientificWorldJournal. 2011. PMID: 21479353 Free PMC article. Review.
-
Genomic regulatory blocks in vertebrates and implications in human disease.Brief Funct Genomic Proteomic. 2009 Jul;8(4):333-42. doi: 10.1093/bfgp/elp019. Epub 2009 Jun 26. Brief Funct Genomic Proteomic. 2009. PMID: 19561171 Review.
Cited by
-
Computational identification of ultra-conserved elements in the human genome: a hypothesis on homologous DNA pairing.NAR Genom Bioinform. 2024 Jul 2;6(3):lqae074. doi: 10.1093/nargab/lqae074. eCollection 2024 Sep. NAR Genom Bioinform. 2024. PMID: 38962254 Free PMC article.
-
DNA N-gram analysis framework (DNAnamer): A generalized N-gram frequency analysis framework for the supervised classification of DNA sequences.Heliyon. 2024 Aug 24;10(17):e36914. doi: 10.1016/j.heliyon.2024.e36914. eCollection 2024 Sep 15. Heliyon. 2024. PMID: 39281454 Free PMC article.
References
-
- Trifonov E.N. In: The Codes of Life. Barbieri M., editor. Springer; Dordrecht, The Netherlands: 2007. pp. 4–17.
LinkOut - more resources
Full Text Sources