Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2003 May;13(5):875-82.
doi: 10.1101/gr.737703. Epub 2003 Apr 14.

Genomic gene clustering analysis of pathways in eukaryotes

Affiliations
Comparative Study

Genomic gene clustering analysis of pathways in eukaryotes

Jennifer M Lee et al. Genome Res. 2003 May.

Erratum in

  • Genome Res. 2004 Dec;14(12):2510

Abstract

Genomic clustering of genes in a pathway is commonly found in prokaryotes due to transcriptional operons, but these are not present in most eukaryotes. Yet, there might be clustering to a lesser extent of pathway members in eukaryotic genomes, that assist coregulation of a set of functionally cooperating genes. We analyzed five sequenced eukaryotic genomes for clustering of genes assigned to the same pathway in the KEGG database. Between 98% and 30% of the analyzed pathways in a genome were found to exhibit significantly higher clustering levels than expected by chance. In descending order by the level of clustering, the genomes studied were Saccharomyces cerevisiae, Homo sapiens, Caenorhabditis elegans, Arabidopsis thaliana, and Drosophila melanogaster. Surprisingly, there is not much agreement between genomes in terms of which pathways are most clustered. Only seven of 69 pathways found in all species were significantly clustered in all five of them. This species-specific pattern of pathway clustering may reflect adaptations or evolutionary events unique to a particular lineage. We note that although operons are common in C. elegans, only 58% of the pathways showed significant clustering, which is less than in human. Virtually all pathways in S. cerevisiae showed significant clustering.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Clustering results for four pathways, illustrating different levels in clustering score observed among different genomes, pathways, and merged/unmerged data. Pathway 00361 is significantly clustered in all species, whereas the other pathways show varying degrees of clustering. In S. cerevisiae, all but two pathways showed significant clustering (using merged data). Note that the scale of the clustering score is logarithmic.
Figure 2.
Figure 2.
Distribution of pathway clustering scores (using merged data). The x-axis represents how much higher from the median the score is in terms of difference between the median and the 3rd quartile. Also shown is the score distribution from randomly chosen pathways (from all organisms) in which the genes were picked randomly from all genes in the particular organism.
Figure 3.
Figure 3.
Distribution of genes in pathways with and without significant clustering in A. thaliana. Pathway 00360 has a highly significant clustering score, whereas pathway 03020 does not. Merged data were used. The boxes represent putative loose gene clusters.
Figure 4.
Figure 4.
Relationship between clustering score per gene and number of genes in the pathway (merged data).
Figure 5.
Figure 5.
Comparison of percent identity in BLAST matches of enzymes with the same EC number to those with different EC numbers, across all organisms and pathways in this study. Less than 0.6% of all pairs with different EC numbers had an identity above 60%.

References

    1. Altschul S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403-410. - PubMed
    1. Blumenthal T. 1998. Gene clusters and polycistronic transcription in eukaryotes. BioEssays 20: 480-487. - PubMed
    1. Caron H., van Schaik, B., van der Mee, M., Baas, F., Riggins, G., van Sluis, P., Hermus, M.C., van Asperen, R., Boon, K., Voute, P.A., et al. 2001. The human transcriptome map: Clustering of highly expressed genes in chromosomal domains. Science 291: 1289-1292. - PubMed
    1. Friedman R. and Hughes, A.L. 2001. Gene duplication and the structure of eukaryotic genomes. Genome Res. 11: 373-381. - PMC - PubMed
    1. Glansdorff N. 1999. On the origin of operons and their possible role in evolution toward thermophily. J. Mol. Evol. 49: 432-438. - PubMed

Publication types

MeSH terms