Comparative analysis of protein domain organization
- PMID: 14993202
- PMCID: PMC535408
- DOI: 10.1101/gr.1610504
Comparative analysis of protein domain organization
Abstract
We have developed a set of graph theory-based tools, which we call Comparative Analysis of Protein Domain Organization (CADO), to survey and compare protein domain organizations of different organisms. In the language of CADO, the organization of protein domains in a given organism is shown as a domain graph in which protein domains are represented as vertices, and domain combinations, defined as instances of two domains found in one protein, are represented as edges. CADO provides a new way to analyze and compare whole proteomes, including identifying the consensus and difference of domain organization between organisms. CADO was used to analyze and compare >50 bacterial, archaeal, and eukaryotic genomes. Examples and overviews presented here include the analysis of the modularity of domain graphs and the functional study of domains based on the graph topology. We also report on the results of comparing domain graphs of two organisms, Pyrococcus horikoshii (an extremophile) and Haemophilus influenzae (a parasite with reduced genome) with other organisms. Our comparison provides new insights into the genome organization of these organisms. Finally, we report on the specific domain combinations characterizing the three kingdoms of life, and the kingdom "signature" domain organizations derived from those specific domain combinations.
Figures








Similar articles
-
Structural characterization of the human proteome.Genome Res. 2002 Nov;12(11):1625-41. doi: 10.1101/gr.221202. Genome Res. 2002. PMID: 12421749 Free PMC article.
-
Preferred codons and amino acid couples in hyperthermophiles.Genome Biol. 2002 Jul 19;3(8):PREPRINT0006. doi: 10.1186/gb-2002-3-8-preprint0006. Epub 2002 Jul 19. Genome Biol. 2002. PMID: 12186639
-
Function-dependent clustering of orthologues and paralogues of cyclophilins.Proteins. 2004 Sep 1;56(4):808-20. doi: 10.1002/prot.20156. Proteins. 2004. PMID: 15281132
-
[Proteins sharing PNPLA domain, a new family of enzymes regulating lipid metabolism].Med Sci (Paris). 2010 Feb;26(2):177-84. doi: 10.1051/medsci/2010262177. Med Sci (Paris). 2010. PMID: 20188050 Review. French.
-
Innovation from reduction: gene loss, domain loss and sequence divergence in genome evolution.Appl Bioinformatics. 2003;2(1):13-34. Appl Bioinformatics. 2003. PMID: 15130831 Review.
Cited by
-
Conference report--structural genomics: parsing the architecture of proteins highlights of the ABRF 2004--integrating technologies in proteomics and genomics, February 28-March 2, 2004; Portland, Oregon.MedGenMed. 2004 Apr 16;6(2):22. MedGenMed. 2004. PMID: 15266248 Free PMC article. No abstract available.
-
An integrated approach to the prediction of domain-domain interactions.BMC Bioinformatics. 2006 May 25;7:269. doi: 10.1186/1471-2105-7-269. BMC Bioinformatics. 2006. PMID: 16725050 Free PMC article.
-
On the detection of functionally coherent groups of protein domains with an extension to protein annotation.BMC Bioinformatics. 2007 Oct 16;8:390. doi: 10.1186/1471-2105-8-390. BMC Bioinformatics. 2007. PMID: 17937820 Free PMC article.
-
Wolbachia endosymbionts manipulate the self-renewal and differentiation of germline stem cells to reinforce fertility of their fruit fly host.PLoS Biol. 2023 Oct 24;21(10):e3002335. doi: 10.1371/journal.pbio.3002335. eCollection 2023 Oct. PLoS Biol. 2023. PMID: 37874788 Free PMC article.
-
Domain Architecture Based Methods for Comparative Functional Genomics Toward Therapeutic Drug Target Discovery.J Mol Evol. 2023 Oct;91(5):598-615. doi: 10.1007/s00239-023-10129-w. Epub 2023 Aug 25. J Mol Evol. 2023. PMID: 37626222 Review.
References
-
- Aasland, R., Gibson, T.J., and Stewart, A.F. 1995. The PHD finger: Implications for chromatin-mediated transcriptional regulation. Trends Biochem. Sci. 20: 56–59. - PubMed
-
- Anantharaman, V., Koonin, E.V., and Aravind, L. 2001. TRAM, a predicted RNA-binding domain, common to tRNA uracil methylation and adenine thiolation enzymes. FEMS Microbiol. Lett. 197: 215–221. - PubMed
-
- Apic, G., Gough, J., and Teichmann, S.A. 2001. Domain combinations in archaeal, eubacterial and eukaryotic proteomes. J. Mol. Biol. 310: 311–325. - PubMed
-
- Aravind, L. and Koonin, E.V. 1998. The HD domain defines a new superfamily of metal-dependent phosphohydrolases. Trends Biochem. Sci. 23: 469–472. - PubMed
-
- Aravind, L. and Koonin, E.V. 2000. The U box is a modified RING finger—A common domain in ubiquitination. Curr. Biol. 10: R132–R134. - PubMed
WEB SITE REFERENCES
-
- ftp://ftp.ncbi.nih.gov/; NCBI GenBank.
-
- http://genome.jgi-psf.org/ciona4/ciona4.download.ftp.html; Ciona intestinalis.
-
- http://genome.jgi-psf.org/fugu6/fugu6.download.ftp.html; Fugu rubripes sequence.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Miscellaneous