Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2019 Jan 1:2019:baz029.
doi: 10.1093/database/baz029.

Tetrahymena Comparative Genomics Database (TCGD): a community resource for Tetrahymena

Affiliations
Comparative Study

Tetrahymena Comparative Genomics Database (TCGD): a community resource for Tetrahymena

Wentao Yang et al. Database (Oxford). .

Abstract

Ciliates are a large and diverse group of unicellular organisms characterized by having the following two distinct type of nuclei within a single cell: micronucleus (MIC) and macronucleus (MAC). Although the genomes of several ciliates in different groups have been sequenced, comparative genomics data for multiple species within a ciliate genus are not yet available. Here we collected the genome information and comparative genomics analysis results for 10 species in the Tetrahymena genus, including the previously sequenced model organism Tetrahymena thermophila and 9 newly sequenced species, and constructed a genus-level comparative analysis platform, the Tetrahymena Comparative Genomics Database (TCGD). Genome sequences, transcriptomic data, gene models, functional annotation, ortholog groups and synteny maps were built into this database and a user-friendly interface was developed for searching, visualizing and analyzing these data. In summary, the TCGD (http://ciliate.ihb.ac.cn) will be an important and useful resource for the ciliate research community.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic structure of the TCGD. A flow diagram shows the database architecture. Genome sequences, CDS and protein sequences were formatted as a BLAST database. Sequences, annotation information, comparative genomics data and transcriptomic data were stored in the MySQL database. GBrowse and mGSV were used for visualization of genome data and synteny map. Search and visualization allowed user to easily access the data resources in TCGD.
Figure 2
Figure 2
Search functions implemented into TCGD. (A) An integrated search box. (B) Screenshot of search result interface for gene TPYRIF00114600 through the integrated search box. (C) A brief gene description of TPYRIF00114600, including the species the gene belongs to, putative annotation based on NCBI BLAST hits and the gene location.
Figure 3
Figure 3
A gene details page for TPYRIF00114600, showing five types of data. (A) Basic information on the gene, such as the species, putative annotation based on NCBI BLAST hits, and the gene location. (B) A snapshot of the gene structure with a hyperlink to GBrowse. (C) Annotation with InterProScan for protein domains, GO and KEGG function. (D) Homolog information for all 10 species based on OrthoMCL ortholog groups. (E) The predicted CDS and protein sequences.
Figure 4
Figure 4
Visualization of a synteny map in TCGD. (A) The ‘collinear block ID’ or ‘Gene ID’ is inserted into the search box to acquire a synteny map for 10 Tetrahymena species. (B) A summary page shows genome associations and the number of genes for each genome pair for collinear block ID chr4-27. A circular diagram shows a general overview of the associations. To obtain the full synteny display, users can choose to enter either ‘Pairwise view’ or ‘Multiple view’ mode.
Figure 5
Figure 5
Visualization of synteny map in ‘Pairwise view’ mode and ‘Multiple view’ mode. (A) In ‘Pairwise view’ mode, genes are shown as lines between adjacent genomes. Genomes can be rearranged, removed or shown more than once. Genome control panels on the left side of the interface allow the genome viewing range to be adjusted. Master controls at the top apply to all genomes. By using the control panel on the left, users can choose the shape and color of genes. Regions of visible synteny can be filtered based on the numerical criteria specified for genes. (B) In ‘Multiple view’ mode, conserved genes across all selected genomes are shown. The regions associated with one or more specific genome pairs can be hidden using the buttons above the synteny display. Genomes can also be rearranged or removed, and each genome is displayed only once.

Similar articles

Cited by

References

    1. Adl S.M., Leander B.S., Simpson A.G. et al. (2007) Diversity, nomenclature, and taxonomy of protists. Syst. Biol., 56, 684–689. - PubMed
    1. Lynn D. (2008) The Ciliated Protozoa: Characterization, Classification, and Guide to the Literature (3rd ed) Springer Science & Business Media, Dordrecht.
    1. Herrick G. (1994) Germline-soma relationships in ciliated protozoa: the inception and evolution of nuclear dimorphism in one-celled animals. Semin. Dev. Biol., 5, 3–12.
    1. Eisen J.A., Coyne R.S., Wu M. et al. (2006) Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote. PLoS Biol., 4, e286. - PMC - PubMed
    1. Aury J.-M., Jaillon O., Duret L. et al. (2006) Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia. Nature, 444, 171. - PubMed

Publication types