PhyloPat: phylogenetic pattern analysis of eukaryotic genes
- PMID: 16948844
- PMCID: PMC1570148
- DOI: 10.1186/1471-2105-7-398
PhyloPat: phylogenetic pattern analysis of eukaryotic genes
Abstract
Background: Phylogenetic patterns show the presence or absence of certain genes or proteins in a set of species. They can also be used to determine sets of genes or proteins that occur only in certain evolutionary branches. Phylogenetic patterns analysis has routinely been applied to protein databases such as COG and OrthoMCL, but not upon gene databases. Here we present a tool named PhyloPat which allows the complete Ensembl gene database to be queried using phylogenetic patterns.
Description: PhyloPat is an easy-to-use webserver, which can be used to query the orthologies of all complete genomes within the EnsMart database using phylogenetic patterns. This enables the determination of sets of genes that occur only in certain evolutionary branches or even single species. We found in total 446,825 genes and 3,164,088 orthologous relationships within the EnsMart v40 database. We used a single linkage clustering algorithm to create 147,922 phylogenetic lineages, using every one of the orthologies provided by Ensembl. PhyloPat provides the possibility of querying with either binary phylogenetic patterns (created by checkboxes) or regular expressions. Specific branches of a phylogenetic tree of the 21 included species can be selected to create a branch-specific phylogenetic pattern. Users can also input a list of Ensembl or EMBL IDs to check which phylogenetic lineage any gene belongs to. The output can be saved in HTML, Excel or plain text format for further analysis. A link to the FatiGO web interface has been incorporated in the HTML output, creating easy access to functional information. Finally, lists of omnipresent, polypresent and oligopresent genes have been included.
Conclusion: PhyloPat is the first tool to combine complete genome information with phylogenetic pattern querying. Since we used the orthologies generated by the accurate pipeline of Ensembl, the obtained phylogenetic lineages are reliable. The completeness and reliability of these phylogenetic lineages will further increase with the addition of newly found orthologous relationships within each new Ensembl release.
Figures




Similar articles
-
PhyloPat: an updated version of the phylogenetic pattern database contains gene neighborhood.Nucleic Acids Res. 2009 Jan;37(Database issue):D731-7. doi: 10.1093/nar/gkn645. Epub 2008 Oct 2. Nucleic Acids Res. 2009. PMID: 18832367 Free PMC article.
-
OrthologID: automation of genome-scale ortholog identification within a parsimony framework.Bioinformatics. 2006 Mar 15;22(6):699-707. doi: 10.1093/bioinformatics/btk040. Epub 2006 Jan 12. Bioinformatics. 2006. PMID: 16410324
-
GeneTools--application for functional annotation and statistical hypothesis testing.BMC Bioinformatics. 2006 Oct 24;7:470. doi: 10.1186/1471-2105-7-470. BMC Bioinformatics. 2006. PMID: 17062145 Free PMC article.
-
Advances in the Exon-Intron Database (EID).Brief Bioinform. 2006 Jun;7(2):178-85. doi: 10.1093/bib/bbl003. Epub 2006 Mar 9. Brief Bioinform. 2006. PMID: 16772261 Review.
-
Homology assessment and molecular sequence alignment.J Biomed Inform. 2006 Feb;39(1):18-33. doi: 10.1016/j.jbi.2005.11.005. Epub 2005 Dec 9. J Biomed Inform. 2006. PMID: 16380300 Review.
Cited by
-
PhyloPat: an updated version of the phylogenetic pattern database contains gene neighborhood.Nucleic Acids Res. 2009 Jan;37(Database issue):D731-7. doi: 10.1093/nar/gkn645. Epub 2008 Oct 2. Nucleic Acids Res. 2009. PMID: 18832367 Free PMC article.
-
Testicular cell adhesion molecule 1 (TCAM1) is not essential for fertility.Mol Cell Endocrinol. 2010 Feb 5;315(1-2):246-53. doi: 10.1016/j.mce.2009.09.010. Epub 2009 Sep 17. Mol Cell Endocrinol. 2010. PMID: 19766163 Free PMC article.
-
Relaxed purifying selection and possibly high rate of adaptation in primate lineage-specific genes.Genome Biol Evol. 2010 Jul 12;2:393-409. doi: 10.1093/gbe/evq019. Genome Biol Evol. 2010. PMID: 20624743 Free PMC article.
-
Preservation of genes involved in sterol metabolism in cholesterol auxotrophs: facts and hypotheses.PLoS One. 2008 Aug 6;3(8):e2883. doi: 10.1371/journal.pone.0002883. PLoS One. 2008. PMID: 18682733 Free PMC article.
-
Genomics and bioinformatics resources for crop improvement.Plant Cell Physiol. 2010 Apr;51(4):497-523. doi: 10.1093/pcp/pcq027. Epub 2010 Mar 5. Plant Cell Physiol. 2010. PMID: 20208064 Free PMC article. Review.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources