Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2002 Jan 1;30(1):294-8.
doi: 10.1093/nar/30.1.294.

GTOP: a database of protein structures predicted from genome sequences

Affiliations

GTOP: a database of protein structures predicted from genome sequences

Takeshi Kawabata et al. Nucleic Acids Res. .

Abstract

Large-scale genome projects generate an unprecedented number of protein sequences, most of them are experimentally uncharacterized. Predicting the 3D structures of sequences provides important clues as to their functions. We constructed the Genomes TO Protein structures and functions (GTOP) database, containing protein fold predictions of a huge number of sequences. Predictions are mainly carried out with the homology search program PSI-BLAST, currently the most popular among high-sensitivity profile search methods. GTOP also includes the results of other analyses, e.g. homology and motif search, detection of transmembrane helices and repetitive sequences. We have completed analyzing the sequences of 41 organisms, with the number of proteins exceeding 120 000 in total. GTOP uses a graphical viewer to present the analytical results of each ORF in one page in a 'color-bar' format. The assigned 3D structures are presented by Chime plug-in or RasMol. The binding sites of ligands are also included, providing functional information. The GTOP server is available at http://spock.genes.nig.ac.jp/~genome/gtop.html.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Fractions of structure-predicted ORFs in GTOP database. The digits in parentheses are the numbers of ORFs in the corresponding organism.
Figure 2
Figure 2
Snapshots of GTOP web pages. (A) The summary page of citA, an ORF in E.coli. Results of various kinds of analysis are shown in colored bar formats. If SCOP structure domains are assigned, small structure icons are shown above the colored bars. The structure icons were prepared for each superfamily of SCOP. The string indicated by ‘OrgPattern’ is the number of homologs of the given protein in the organisms stored in GTOP database. (B) An alignment of citA and the 3D structure of its homolog 1bxdA found by PSI-BLAST. The characters under the alignment (shown in a blue box) indicate the numbers of atomic contacts between a given residue and the ligand, ANP. (C) A Chime plug-in view of the 3D structure of 1bxdA, a homolog of citA. The regions of insertion and deletion are colored blue.

Similar articles

Cited by

References

    1. Berman H.M., Westbrook,J., Feng,Z., Gilliland,G., Bhat,T.N., Weissig,H., Shindyalov,I.N. and Bourne,P.E. (2000) The Protein Data Bank. Nucleic Acids Res., 28, 235–242. Updated article in this issue: Nucleic Acids Res. (2002), 30, 245–248. - PMC - PubMed
    1. Park J., Karplus,K., Barret,C., Hughey,R., Haussler,D., Hubbard,T. and Chothia,C. (1998) Sequence comparison using multiple sequences detect three times as many homologues as pairwise methods. J. Mol. Biol., 284, 1201–1210. - PubMed
    1. Wolf Y.I., Brenner,S.E., Bash,P.A. and Koonin,E.V. (1999) Distribution of protein folds in the three superkingdoms of life. Genome Res., 9, 17–26. - PubMed
    1. Teichmann S.A., Chothia,C. and Gerstein,M. (1999) Advances in structural genomics. Curr. Opin. Struct. Biol., 9, 390–399. - PubMed
    1. Burley S.K. (2001) An overview of structural genomics. Nature Struct. Biol., 7 (suppl.), 932–934. - PubMed

Publication types