Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan 8;48(D1):D328-D334.
doi: 10.1093/nar/gkz995.

The neXtProt knowledgebase in 2020: data, tools and usability improvements

Affiliations

The neXtProt knowledgebase in 2020: data, tools and usability improvements

Monique Zahn-Zabal et al. Nucleic Acids Res. .

Abstract

The neXtProt knowledgebase (https://www.nextprot.org) is an integrative resource providing both data on human protein and the tools to explore these. In order to provide comprehensive and up-to-date data, we evaluate and add new data sets. We describe the incorporation of three new data sets that provide expression, function, protein-protein binary interaction, post-translational modifications (PTM) and variant information. New SPARQL query examples illustrating uses of the new data were added. neXtProt has continued to develop tools for proteomics. We have improved the peptide uniqueness checker and have implemented a new protein digestion tool. Together, these tools make it possible to determine which proteases can be used to identify trypsin-resistant proteins by mass spectrometry. In terms of usability, we have finished revamping our web interface and completely rewritten our API. Our SPARQL endpoint now supports federated queries. All the neXtProt data are available via our user interface, API, SPARQL endpoint and FTP site, including the new PEFF 1.0 format files. Finally, the data on our FTP site is now CC BY 4.0 to promote its reuse.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Tabular view showing the expression data for insulin (NX_P01308). Expression data at the mRNA and protein level are displayed in the same semi-quantitative manner for easier comparison. Four levels of expression (undetected, low, medium, high) are possible. Mousing-over the data point displays the expression level textually.
Figure 2.
Figure 2.
Allele frequency information in the Sequence view for BRCA1 (NX_P38398) variants. To find a gnomAD variant, search in the feature table with the gnomAD ID. A link to the corresponding variant in gnomAD is found in the description of the variant and the evidence. The allele frequency, with the allele count and allele number in brackets, as well as the homozygote count, are displayed in the evidence.
Figure 3.
Figure 3.
Protein digestion tool. (A) Input form requiring the neXtProt isoform accession number for the protein to be digested. Default digestion parameters (maximum number of miscleavages, minimum peptide length and maximum peptide length) can be modified by the user. (B) Peptide count and unique peptide count for the digestion with 27 proteases or conditions. Select a protease to see the peptides obtained. (C) Table displaying information about the peptides obtained with the selected digestion conditions. The peptide sequence, length, number of missed cleavages, position in the sequence, whether the peptide is unique or not (without taking into account variants) and whether the peptide is found in neXtProt, as a natural and synthetic (SRM peptide) are displayed. A link to the neXtProt Peptide view of the entry is provided.
Figure 4.
Figure 4.
Federated SPARQL query examples. Screenshot showing all queries tagged ‘federated’ in the neXtProt SNORQL interface.

References

    1. Gaudet P., Michel P.-A., Zahn-Zabal M., Britan A., Cusin I., Domagalski M., Duek P.D., Gateau A., Gleizes A., Hinard V. et al. .. The neXtProt knowledgebase on human proteins: 2017 update. Nucleic Acids Res. 2017; 45:D177–D182. - PMC - PubMed
    1. The UniProt Consortium UniProt: the universal protein knowledgebase. Nucleic Acids Res. 2017; 45:D158–D169. - PMC - PubMed
    1. Schaeffer M., Gateau A., Teixeira D., Michel P.-A., Zahn-Zabal M., Lane L.. The neXtProt peptide uniqueness checker: a tool for the proteomics community. Bioinformatics. 2017; 33:3471–3472. - PMC - PubMed
    1. Deutsch E.W., Overall C.M., Van Eyk J.E., Baker M.S., Paik Y.-K., Weintraub S.T., Lane L., Martens L., Vandenbrouck Y., Kusebauch U. et al. .. Human proteome project mass spectrometry data interpretation guidelines 2.1. J. Proteome Res. 2016; 15:3961–3970. - PMC - PubMed
    1. Uhlén M., Fagerberg L., Hallström B.M., Lindskog C., Oksvold P., Mardinoglu A., Sivertsson Å., Kampf C., Sjöstedt E., Asplund A. et al. .. Proteomics. Tissue-based map of the human proteome. Science. 2015; 347:1260419. - PubMed

Publication types