Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jan;40(Database issue):D76-83.
doi: 10.1093/nar/gkr1179. Epub 2011 Dec 1.

neXtProt: a knowledge platform for human proteins

Affiliations

neXtProt: a knowledge platform for human proteins

Lydie Lane et al. Nucleic Acids Res. 2012 Jan.

Abstract

neXtProt (http://www.nextprot.org/) is a new human protein-centric knowledge platform. Developed at the Swiss Institute of Bioinformatics (SIB), it aims to help researchers answer questions relevant to human proteins. To achieve this goal, neXtProt is built on a corpus containing both curated knowledge originating from the UniProtKB/Swiss-Prot knowledgebase and carefully selected and filtered high-throughput data pertinent to human proteins. This article presents an overview of the database and the data integration process. We also lay out the key future directions of neXtProt that we consider the necessary steps to make neXtProt the one-stop-shop for all research projects focusing on human proteins.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The neXtProt home page grants access to the database via a search tool. Users can sign-in (top right) to create a personal account that allows them to personalize their usage of the platform by keeping a history of their queries and favoring or tagging search results. The home page also links to pages with more information about the current content of the platform in term of integrated resources (‘release details’ link at the bottom right).
Figure 2.
Figure 2.
The Search Results page. neXtProt is indexed across several biological areas, corresponding to the different views found on each entry (some data are present in multiple indexes). This allows users to make complex searches, for example finding all proteins localized to the lysosomes and expressed in the brain. From the search results page, it is possible to do various exports: obtain the list of proteins as an Excel file; the protein sequences as FASTA or PEFF, or the entire entry in XML format. When a user is logged in, checkboxes appear next to each entry, so that s/he is able to select specific entries for which the data are exported.
Figure 3.
Figure 3.
The Expression Data view displays data via a browser of the neXtProt human anatomy ontology. Currently, the data presented come from two different sources: Bgee and HPA (see text). Data are captured and displayed at the original granularity level (Loupe symbols), and is propagated to higher levels using the ontology to be comparable across the data sets. The tool provides a menu to toggle between showing only Gold data and showing both Gold and Silver data. An icon next to the annotation indicates the silver data. Unmarked data is gold.
Figure 4.
Figure 4.
The Sequence Viewer, accessible from the Sequence view, displays - in addition to the sequence itself, the different features of the sequence (processing, regions, modified residues, topological information, variants, sequence conflicts, etc.) as a graphical overview and a table view. When a feature is selected, either from the graphical viewer or from the feature table, the corresponding sequence is immediately highlighted on the right. The sequence viewer also provides direct access to the BLAST tool, which has the option of using the full sequence, a selection corresponding to a sequence feature or any other sub-sequence selected by the user.
Figure 5.
Figure 5.
The Exon view, available from the Gene perspective, gives the precise coordinates of all protein isoforms that can be mapped on Ensembl transcripts, based on exons. For each exon, its position on the gene is shown, as well as the length if the exon in nucleotides. The coding regions are represented by a small glyph, in which coding fragments are shown with a large green line, and non-coding sequences by a thin gray line. (Strictly non-coding exons are not shown.) The first and the last amino acid of each exon are shown, and the reading frame of each exon is indicated by red labeling of the amino acids. For example, Val 96–Ile 110 means that the first amino acid of that exon (Val) is completely encoded within that exon, while only the first nucleotide of the last one (Ile) is encoded in that exon.
Figure 6.
Figure 6.
For each term within the ontologies and hierarchical controlled vocabularies, a dedicated page shows its definition, synonyms and cross-references to other ontologies. In addition, the Graphical Ontology Viewer shows the relationship of that term to all its parents.

References

    1. The UniProt Consortium. The Universal Protein Resource (UniProt) 2009. Nucleic Acids Res. 2009;37:D169–D174. - PMC - PubMed
    1. The UniProt Consortium. Ongoing and future developments at the Universal Protein Resource. Nucleic Acids Res. 2011;39:D214–D219. - PMC - PubMed
    1. Sigrist CJ, Cerutti L, de Castro E, Langendijk-Genevaux PS, Bulliard V, Bairoch A, Hulo N. PROSITE, a protein domain database for functional characterization and annotation. Nucleic Acids Res. 2010;38:D161–D166. - PMC - PubMed
    1. Bairoch A. The ENZYME database in 2000. Nucleic Acids Res. 2000;28:304–305. - PMC - PubMed
    1. Szklarczyk D, Franceschini A, Kuhn M, Simonovic M, Roth A, Minguez P, Doerks T, Stark M, Muller J, Bork P, et al. The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res. 2011;39:D561–D568. - PMC - PubMed

Publication types