UniProt: the universal protein knowledgebase in 2021
- PMID: 33237286
- PMCID: PMC7778908
- DOI: 10.1093/nar/gkaa1100
UniProt: the universal protein knowledgebase in 2021
Abstract
The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this article, we describe significant updates that we have made over the last two years to the resource. The number of sequences in UniProtKB has risen to approximately 190 million, despite continued work to reduce sequence redundancy at the proteome level. We have adopted new methods of assessing proteome completeness and quality. We continue to extract detailed annotations from the literature to add to reviewed entries and supplement these in unreviewed entries with annotations provided by automated systems such as the newly implemented Association-Rule-Based Annotator (ARBA). We have developed a credit-based publication submission interface to allow the community to contribute publications and annotations to UniProt entries. We describe how UniProtKB responded to the COVID-19 pandemic through expert curation of relevant entries that were rapidly made available to the research community through a dedicated portal. UniProt resources are available under a CC-BY (4.0) license via the web at https://www.uniprot.org/.
© The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research.
Figures






Similar articles
-
UniProt: the Universal Protein Knowledgebase in 2023.Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052. Nucleic Acids Res. 2023. PMID: 36408920 Free PMC article.
-
UniProt: a worldwide hub of protein knowledge.Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515. doi: 10.1093/nar/gky1049. Nucleic Acids Res. 2019. PMID: 30395287 Free PMC article.
-
UniProt: the Universal Protein Knowledgebase in 2025.Nucleic Acids Res. 2025 Jan 6;53(D1):D609-D617. doi: 10.1093/nar/gkae1010. Nucleic Acids Res. 2025. PMID: 39552041 Free PMC article.
-
From the research laboratory to the database: the Caenorhabditis elegans kinome in UniProtKB.Biochem J. 2017 Feb 15;474(4):493-515. doi: 10.1042/BCJ20160991. Biochem J. 2017. PMID: 28159896 Free PMC article. Review.
-
UniProt and Mass Spectrometry-Based Proteomics-A 2-Way Working Relationship.Mol Cell Proteomics. 2023 Aug;22(8):100591. doi: 10.1016/j.mcpro.2023.100591. Epub 2023 Jun 8. Mol Cell Proteomics. 2023. PMID: 37301379 Free PMC article. Review.
Cited by
-
Proteogenomic analysis of air-pollution-associated lung cancer reveals prevention and therapeutic opportunities.Elife. 2024 Oct 21;13:RP95453. doi: 10.7554/eLife.95453. Elife. 2024. PMID: 39432560 Free PMC article.
-
CAMPR4: a database of natural and synthetic antimicrobial peptides.Nucleic Acids Res. 2023 Jan 6;51(D1):D377-D383. doi: 10.1093/nar/gkac933. Nucleic Acids Res. 2023. PMID: 36370097 Free PMC article.
-
NPInter v5.0: ncRNA interaction database in a new era.Nucleic Acids Res. 2023 Jan 6;51(D1):D232-D239. doi: 10.1093/nar/gkac1002. Nucleic Acids Res. 2023. PMID: 36373614 Free PMC article.
-
Fine-tuning protein embeddings for functional similarity evaluation.Bioinformatics. 2024 Aug 2;40(8):btae445. doi: 10.1093/bioinformatics/btae445. Bioinformatics. 2024. PMID: 38985218 Free PMC article.
-
DMP8 and 9 regulate HAP2/GCS1 trafficking for the timely acquisition of sperm fusion competence.Proc Natl Acad Sci U S A. 2022 Nov 8;119(45):e2207608119. doi: 10.1073/pnas.2207608119. Epub 2022 Nov 2. Proc Natl Acad Sci U S A. 2022. PMID: 36322734 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources