A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs) database
- PMID: 16608522
- PMCID: PMC1523372
- DOI: 10.1186/1471-2105-7-201
A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs) database
Abstract
Background: We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create truly orthologous clusters sharing descent from a single ancestral gene across a range of evolutionary depths. Although these non-phylogenetic gene family clusters have been used broadly for gene annotation, errors are known to be introduced by the artifactual association of slowly evolving paralogs and lack of annotation for those more rapidly evolving. A full phylogenetic framework is necessary for accurate inference of function and for many studies that address pattern and mechanism of the evolution of the genome. The automated generation of evolutionary gene clusters, creation of gene trees, determination of orthology and paralogy relationships, and the correlation of this information with gene annotations, expression information, and genomic context is an important resource to the scientific community.
Discussion: The PhIGs database currently contains 23 completely sequenced genomes of fungi and metazoans, containing 409,653 genes that have been grouped into 42,645 gene clusters. Each gene cluster is built such that the gene sequence distances are consistent with the known organismal relationships and in so doing, maximizing the likelihood for the clusters to represent truly orthologous genes. The PhIGs website contains tools that allow the study of genes within their phylogenetic framework through keyword searches on annotations, such as GO and InterPro assignments, and sequence similarity searches by BLAST and HMM. In addition to displaying the evolutionary relationships of the genes in each cluster, the website also allows users to view the relative physical positions of homologous genes in specified sets of genomes.
Summary: Accurate analyses of genes and genomes can only be done within their full phylogenetic context. The PhIGs database and corresponding website http://phigs.org address this problem for the scientific community. Our goal is to expand the content as more genomes are sequenced and use this framework to incorporate more analyses.
Figures




Similar articles
-
DETECTING EVOLUTIONARY TRANSFER OF GENES USING PhIGs(1).J Phycol. 2008 Feb;44(1):19-22. doi: 10.1111/j.1529-8817.2007.00436.x. J Phycol. 2008. PMID: 27041035
-
GeneTools--application for functional annotation and statistical hypothesis testing.BMC Bioinformatics. 2006 Oct 24;7:470. doi: 10.1186/1471-2105-7-470. BMC Bioinformatics. 2006. PMID: 17062145 Free PMC article.
-
PhyloPat: phylogenetic pattern analysis of eukaryotic genes.BMC Bioinformatics. 2006 Sep 1;7:398. doi: 10.1186/1471-2105-7-398. BMC Bioinformatics. 2006. PMID: 16948844 Free PMC article.
-
Advances in the Exon-Intron Database (EID).Brief Bioinform. 2006 Jun;7(2):178-85. doi: 10.1093/bib/bbl003. Epub 2006 Mar 9. Brief Bioinform. 2006. PMID: 16772261 Review.
-
VEGA, the genome browser with a difference.Brief Bioinform. 2005 Jun;6(2):189-93. doi: 10.1093/bib/6.2.189. Brief Bioinform. 2005. PMID: 15975227 Review.
Cited by
-
The HGNC Database in 2008: a resource for the human genome.Nucleic Acids Res. 2008 Jan;36(Database issue):D445-8. doi: 10.1093/nar/gkm881. Epub 2007 Nov 4. Nucleic Acids Res. 2008. PMID: 17984084 Free PMC article.
-
Using phylogenomic patterns and gene ontology to identify proteins of importance in plant evolution.Genome Biol Evol. 2010 Jul 12;2:225-39. doi: 10.1093/gbe/evq012. Genome Biol Evol. 2010. PMID: 20624728 Free PMC article.
-
FastBLAST: homology relationships for millions of proteins.PLoS One. 2008;3(10):e3589. doi: 10.1371/journal.pone.0003589. Epub 2008 Oct 31. PLoS One. 2008. PMID: 18974889 Free PMC article.
-
The other side of comparative genomics: genes with no orthologs between the cow and other mammalian species.BMC Genomics. 2009 Dec 14;10:604. doi: 10.1186/1471-2164-10-604. BMC Genomics. 2009. PMID: 20003425 Free PMC article.
-
Beyond linear sequence comparisons: the use of genome-level characters for phylogenetic reconstruction.Philos Trans R Soc Lond B Biol Sci. 2008 Apr 27;363(1496):1445-51. doi: 10.1098/rstb.2007.2234. Philos Trans R Soc Lond B Biol Sci. 2008. PMID: 18192190 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials