Notable clustering of transcription-factor-binding motifs in human pericentric regions and its biological significance
- PMID: 23896648
- PMCID: PMC3761090
- DOI: 10.1007/s10577-013-9371-y
Notable clustering of transcription-factor-binding motifs in human pericentric regions and its biological significance
Abstract
Since oligonucleotide composition in the genome sequence varies significantly among species even among those possessing the same genome G + C%, the composition has been used to distinguish a wide range of genomes and called as "genome signature". Oligonucleotides often represent motif sequences responsible for sequence-specific protein binding (e.g., transcription-factor binding). Occurrences of such motif oligonucleotides in the genome should be biased compared to those observed in random sequences and may differ among genomes and genomic portions. Self-Organizing Map (SOM) is a powerful tool for clustering high-dimensional data such as oligonucleotide composition on one plane. We previously modified the conventional SOM for genome informatics to batch learning SOM or "BLSOM". When we constructed BLSOMs to analyze pentanucleotide composition in 20-, 50-, and 100-kb sequences derived from the human genome, BLSOMs did not classify human sequences according to chromosome but revealed several specific zones composed primarily of sequences derived from pericentric regions. Interestingly, various transcription-factor-binding motifs were characteristically overrepresented in pericentric regions but underrepresented in most genomic sequences. When we focused on much shorter sequences (e.g., 1 kb), the clustering of transcription-factor-binding motifs was evident in pericentric, subtelomeric and sex chromosome pseudoautosomal regions. The biological significance of the clustering in these regions was discussed in connection with cell-type and -stage-dependent chromocenter formation and nuclear organization.
Figures




















































Similar articles
-
A Novel Bioinformatics Strategy to Analyze Microbial Big Sequence Data for Efficient Knowledge Discovery: Batch-Learning Self-Organizing Map (BLSOM).Microorganisms. 2013 Nov 20;1(1):137-157. doi: 10.3390/microorganisms1010137. Microorganisms. 2013. PMID: 27694768 Free PMC article. Review.
-
CG-containing oligonucleotides and transcription factor-binding motifs are enriched in human pericentric regions.Genes Genet Syst. 2015;90(1):43-53. doi: 10.1266/ggs.90.43. Genes Genet Syst. 2015. PMID: 26119665
-
A novel bioinformatics method for efficient knowledge discovery by BLSOM from big genomic sequence data.Biomed Res Int. 2014;2014:765648. doi: 10.1155/2014/765648. Epub 2014 Apr 3. Biomed Res Int. 2014. PMID: 24804244 Free PMC article.
-
Self-Organizing Map (SOM) unveils and visualizes hidden sequence characteristics of a wide range of eukaryote genomes.Gene. 2006 Jan 3;365:27-34. doi: 10.1016/j.gene.2005.09.040. Epub 2005 Dec 20. Gene. 2006. PMID: 16364569
-
AI for the collective analysis of a massive number of genome sequences: various examples from the small genome of pandemic SARS-CoV-2 to the human genome.Genes Genet Syst. 2021 Dec 16;96(4):165-176. doi: 10.1266/ggs.21-00025. Epub 2021 Sep 27. Genes Genet Syst. 2021. PMID: 34565757 Review.
Cited by
-
The developmental and evolutionary characteristics of transcription factor binding site clustered regions based on an explainable machine learning model.Nucleic Acids Res. 2024 Jul 22;52(13):7610-7626. doi: 10.1093/nar/gkae441. Nucleic Acids Res. 2024. PMID: 38813828 Free PMC article.
-
tRNADB-CE: tRNA gene database well-timed in the era of big sequence data.Front Genet. 2014 May 1;5:114. doi: 10.3389/fgene.2014.00114. eCollection 2014. Front Genet. 2014. PMID: 24822057 Free PMC article. Review.
-
A Novel Bioinformatics Strategy to Analyze Microbial Big Sequence Data for Efficient Knowledge Discovery: Batch-Learning Self-Organizing Map (BLSOM).Microorganisms. 2013 Nov 20;1(1):137-157. doi: 10.3390/microorganisms1010137. Microorganisms. 2013. PMID: 27694768 Free PMC article. Review.
-
Evolutionary changes in vertebrate genome signatures with special focus on coelacanth.DNA Res. 2014 Oct;21(5):459-67. doi: 10.1093/dnares/dsu012. Epub 2014 May 6. DNA Res. 2014. PMID: 24800745 Free PMC article.
-
Unsupervised AI reveals insect species-specific genome signatures.PeerJ. 2024 Mar 6;12:e17025. doi: 10.7717/peerj.17025. eCollection 2024. PeerJ. 2024. PMID: 38464746 Free PMC article.
References
-
- Abe T, Wada K, Iwasaki Y, Ikemura T. Novel bioinformatics for inter- and intraspecies comparison of genome signatures in plant genomes. Plant Biotech. 2009;26:469–477. doi: 10.5511/plantbiotechnology.26.469. - DOI
-
- Bernardi G. Structural and evolutionary genomics: natural selection in genome evolution. New York: Elsevier; 2004.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources