Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 6;51(D1):D523-D531.
doi: 10.1093/nar/gkac1052.

UniProt: the Universal Protein Knowledgebase in 2023

Collaborators

UniProt: the Universal Protein Knowledgebase in 2023

UniProt Consortium. Nucleic Acids Res. .

Abstract

The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this publication we describe enhancements made to our data processing pipeline and to our website to adapt to an ever-increasing information content. The number of sequences in UniProtKB has risen to over 227 million and we are working towards including a reference proteome for each taxonomic group. We continue to extract detailed annotations from the literature to update or create reviewed entries, while unreviewed entries are supplemented with annotations provided by automated systems using a variety of machine-learning techniques. In addition, the scientific community continues their contributions of publications and annotations to UniProt entries of their interest. Finally, we describe our new website (https://www.uniprot.org/), designed to enhance our users' experience and make our data easily accessible to the research community. This interface includes access to AlphaFold structures for more than 85% of all entries as well as improved visualisations for subcellular localisation of proteins.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(A) Growth of UniProt databases over the last 10 years and (B) Growth of Reference Proteomes and taxonomic breakdown.
Figure 2.
Figure 2.
Statistics of UniProt crowdsourcing activity. (A) Cumulative number of submissions, unique publications and proteins covered, and number of contributors for selected releases. Release 2019_08 was the first release where community submissions appeared, and 2022_03 is the latest release at the time of this manuscript preparation. (B) Taxonomic distribution of unique protein entries that have at least one publication submitted by the community. (C) Block chart showing the relative distribution of annotations by categories.
Figure 3
Figure 3
(A) Card view of results following a search of the UniProt website and (B). Table view of results following the same search.
Figure 4.
Figure 4.
UniProtKB entry CL18A_HUMAN (UniProtKB: A5D8T8) shows an embedded SwissBioPics image: the generic animal cell (Eumetazoa) is selected based on organism taxonomy. It contains 71 interactive locations, of which the endoplasmic reticulum, Golgi apparatus, endosome and secretory space are highlighted using annotations from the UniProt entry.

Similar articles

Cited by

References

    1. Varadi M., Anyango S., Deshpande M., Nair S., Natassia C., Yordanova G., Yuan D., Stroe O., Wood G., Laydon A.et al. .. AlphaFold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022; 50:D439–D444. - PMC - PubMed
    1. Arita M., Karsch-Mizrachi I., Cochrane G.. The international nucleotide sequence database collaboration. Nucleic Acids Res. 2021; 49:D121–D124. - PMC - PubMed
    1. Cummins C., Ahamed A., Aslam R., Burgin J., Devraj R., Edbali O., Gupta D., Harrison P.W., Haseeb M., Holt S.et al. .. The European Nucleotide Archive in 2021. Nucleic Acids Res. 2022; 50:D106–D110. - PMC - PubMed
    1. Sayers E.W., Bolton E.E., Brister J.R., Canese K., Chan J., Comeau D.C., Connor D.C., Funk K., Kelly C., Kim S.. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2022; 50:D20–D26. - PMC - PubMed
    1. Fukuda A., Kodama Y., Mashima J., Fujisawa T., Ogasawara O.. DDBJ update: streamlining submission and access of human data. Nucleic Acids Res. 2021; 49:D71–D75. - PMC - PubMed

Publication types