Studying genomes through the aeons: protein families, pseudogenes and proteome evolution
- PMID: 12083509
- DOI: 10.1016/s0022-2836(02)00109-2
Studying genomes through the aeons: protein families, pseudogenes and proteome evolution
Abstract
Protein families can be used to understand many aspects of genomes, both their "live" and their "dead" parts (i.e. genes and pseudogenes). Surveys of genomes have revealed that, in every organism, there are always a few large families and many small ones, with the overall distribution following a power-law. This commonality is equally true for both genes and pseudogenes, and exists despite the fact that the specific families that are enlarged differ greatly between organisms. Furthermore, because of family structure there is great redundancy in proteomes, a fact linked to the large number of dispensable genes for each organism and the small size of the minimal, indispensable sub-proteome. Pseudogenes in prokaryotes represent families that are in the process of being dispensed with. In particular, the genome sequences of certain pathogenic bacteria (Mycobacterium leprae, Yersinia pestis and Rickettsia prowazekii) show how an organism can undergo reductive evolution on a large scale (i.e. the dying out of families) as a result of niche change. There appears to be less pressure to delete pseudogenes in eukaryotes. These can be divided into two varieties, duplicated and processed, where the latter involves reverse transcription from an mRNA intermediate. We discuss these collectively in yeast, worm, fly, and human. The fly has few pseudogenes apparently because of its high rate of genomic DNA deletion. In the other three organisms, the distribution of pseudogenes on the chromosome and amongst different families is highly non-uniform. Pseudogenes tend not to occur in the middle of chromosome arms, and tend to be associated with lineage-specific (as opposed to highly conserved) families that have environmental-response functions. This may be because, rather than being dead, they may form a reservoir of diverse "extra parts" that can be resurrected to help an organism adapt to its surroundings. In yeast, there may be a novel mechanism involving the [PSI+] prion that potentially enables this resurrection. In worm, the pseudogenes tend to arise out of families (e.g. chemoreceptors) that are greatly expanded in it compared to the fly. The human genome stands out in having many processed pseudogenes. These have a character very different from those of the duplicated variety, to a large extent just representing random insertions. Thus, their occurrence tends to be roughly in proportion to the amount of mRNA for a particular protein and to reflect the extent of the intergenic sequences. Further information about pseudogenes is available at http://genecensus.org/pseudogene
Similar articles
-
Digging for dead genes: an analysis of the characteristics of the pseudogene population in the Caenorhabditis elegans genome.Nucleic Acids Res. 2001 Feb 1;29(3):818-30. doi: 10.1093/nar/29.3.818. Nucleic Acids Res. 2001. PMID: 11160906 Free PMC article.
-
Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22.Genome Res. 2002 Feb;12(2):272-80. doi: 10.1101/gr.207102. Genome Res. 2002. PMID: 11827946 Free PMC article.
-
Identification of pseudogenes in the Drosophila melanogaster genome.Nucleic Acids Res. 2003 Feb 1;31(3):1033-7. doi: 10.1093/nar/gkg169. Nucleic Acids Res. 2003. PMID: 12560500 Free PMC article.
-
Processed pseudogenes: characteristics and evolution.Annu Rev Genet. 1985;19:253-72. doi: 10.1146/annurev.ge.19.120185.001345. Annu Rev Genet. 1985. PMID: 3909943 Review.
-
Comparative analysis of processed pseudogenes in the mouse and human genomes.Trends Genet. 2004 Feb;20(2):62-7. doi: 10.1016/j.tig.2003.12.005. Trends Genet. 2004. PMID: 14746985 Review.
Cited by
-
Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability.Nucleic Acids Res. 2005 Apr 28;33(8):2374-83. doi: 10.1093/nar/gki531. Print 2005. Nucleic Acids Res. 2005. PMID: 15860774 Free PMC article.
-
Fine mapping of the tomato I-3 gene for fusarium wilt resistance and elimination of a co-segregating resistance gene analogue as a candidate for I-3.Theor Appl Genet. 2004 Jul;109(2):409-18. doi: 10.1007/s00122-004-1646-4. Epub 2004 Mar 26. Theor Appl Genet. 2004. PMID: 15045176
-
Segmental duplications in the human genome reveal details of pseudogene formation.Nucleic Acids Res. 2010 Nov;38(20):6997-7007. doi: 10.1093/nar/gkq587. Epub 2010 Jul 8. Nucleic Acids Res. 2010. PMID: 20615899 Free PMC article.
-
Apparent Power Laws Can Occur without Criticality.Entropy (Basel). 2021 Nov 10;23(11):1486. doi: 10.3390/e23111486. Entropy (Basel). 2021. PMID: 34828184 Free PMC article.
-
Programmed Deviations of Ribosomes From Standard Decoding in Archaea.Front Microbiol. 2021 Jun 4;12:688061. doi: 10.3389/fmicb.2021.688061. eCollection 2021. Front Microbiol. 2021. PMID: 34149676 Free PMC article. Review.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases