Clusters of ancestrally related genes that show paralogy in whole or in part are a major feature of the genomes of humans and other species
- PMID: 22563380
- PMCID: PMC3338513
- DOI: 10.1371/journal.pone.0035274
Clusters of ancestrally related genes that show paralogy in whole or in part are a major feature of the genomes of humans and other species
Abstract
Arrangements of genes along chromosomes are a product of evolutionary processes, and we can expect that preferable arrangements will prevail over the span of evolutionary time, often being reflected in the non-random clustering of structurally and/or functionally related genes. Such non-random arrangements can arise by two distinct evolutionary processes: duplications of DNA sequences that give rise to clusters of genes sharing both sequence similarity and common sequence features and the migration together of genes related by function, but not by common descent. To provide a background for distinguishing between the two, which is important for future efforts to unravel the evolutionary processes involved, we here provide a description of the extent to which ancestrally related genes are found in proximity.Towards this purpose, we combined information from five genomic datasets, InterPro, SCOP, PANTHER, Ensembl protein families, and Ensembl gene paralogs. The results are provided in publicly available datasets (http://cgd.jax.org/datasets/clustering/paraclustering.shtml) describing the extent to which ancestrally related genes are in proximity beyond what is expected by chance (i.e. form paraclusters) in the human and nine other vertebrate genomes, as well as the D. melanogaster, C. elegans, A. thaliana, and S. cerevisiae genomes. With the exception of Saccharomyces, paraclusters are a common feature of the genomes we examined. In the human genome they are estimated to include at least 22% of all protein coding genes. Paraclusters are far more prevalent among some gene families than others, are highly species or clade specific and can evolve rapidly, sometimes in response to environmental cues. Altogether, they account for a large portion of the functional clustering previously reported in several genomes.
Conflict of interest statement
Figures






Similar articles
-
Genomic gene clustering analysis of pathways in eukaryotes.Genome Res. 2003 May;13(5):875-82. doi: 10.1101/gr.737703. Epub 2003 Apr 14. Genome Res. 2003. PMID: 12695325 Free PMC article.
-
Clustering of gene ontology terms in genomes.Gene. 2014 Oct 25;550(2):155-64. doi: 10.1016/j.gene.2014.06.060. Epub 2014 Jul 1. Gene. 2014. PMID: 24995610
-
OrthoDisease: a database of human disease orthologs.Hum Mutat. 2004 Aug;24(2):112-9. doi: 10.1002/humu.20068. Hum Mutat. 2004. PMID: 15241792
-
Birth and death of duplicated genes in completely sequenced eukaryotes.Trends Genet. 2001 May;17(5):237-9. doi: 10.1016/s0168-9525(01)02243-0. Trends Genet. 2001. PMID: 11335019 Review.
-
Systematic genome-wide screens of gene function.Nat Rev Genet. 2004 Jan;5(1):11-22. doi: 10.1038/nrg1248. Nat Rev Genet. 2004. PMID: 14708012 Review. No abstract available.
Cited by
-
RegenDbase: a comparative database of noncoding RNA regulation of tissue regeneration circuits across multiple taxa.NPJ Regen Med. 2018 May 29;3:10. doi: 10.1038/s41536-018-0049-0. eCollection 2018. NPJ Regen Med. 2018. PMID: 29872545 Free PMC article.
-
Meiotic DSBs and the control of mammalian recombination.Cell Res. 2012 Dec;22(12):1624-6. doi: 10.1038/cr.2012.109. Epub 2012 Jul 17. Cell Res. 2012. PMID: 22801475 Free PMC article.
-
Emergence and influence of sequence bias in evolutionarily malleable, mammalian tandem arrays.BMC Biol. 2023 Aug 23;21(1):179. doi: 10.1186/s12915-023-01673-4. BMC Biol. 2023. PMID: 37612705 Free PMC article.
References
-
- Fisher RA. 1930. The Genetic Theory of Natural Selection, Clarendon Press, Oxford, UK.
-
- Nei M. Genome evolution: let's stick together. Heredity. 2003;90:411–412. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases