Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2014 Aug;35(8):927-35.
doi: 10.1002/humu.22594. Epub 2014 Jun 24.

Genetic variations and diseases in UniProtKB/Swiss-Prot: the ins and outs of expert manual curation

Affiliations
Review

Genetic variations and diseases in UniProtKB/Swiss-Prot: the ins and outs of expert manual curation

Maria Livia Famiglietti et al. Hum Mutat. 2014 Aug.

Abstract

During the last few years, next-generation sequencing (NGS) technologies have accelerated the detection of genetic variants resulting in the rapid discovery of new disease-associated genes. However, the wealth of variation data made available by NGS alone is not sufficient to understand the mechanisms underlying disease pathogenesis and manifestation. Multidisciplinary approaches combining sequence and clinical data with prior biological knowledge are needed to unravel the role of genetic variants in human health and disease. In this context, it is crucial that these data are linked, organized, and made readily available through reliable online resources. The Swiss-Prot section of the Universal Protein Knowledgebase (UniProtKB/Swiss-Prot) provides the scientific community with a collection of information on protein functions, interactions, biological pathways, as well as human genetic diseases and variants, all manually reviewed by experts. In this article, we present an overview of the information content of UniProtKB/Swiss-Prot to show how this knowledgebase can support researchers in the elucidation of the mechanisms leading from a molecular defect to a disease phenotype.

Keywords: UniProtKB/Swiss-Prot; controlled vocabulary; database; disease; functional annotation; genetic variants; manual curation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Flowchart of variant annotation in UniProtKB/Swiss-Prot. Variant annotation is based on published experimental data. The first step is to select a relevant article. This is achieved using several complementary approaches, including browsing specialized journals, alerts from literature databases and text mining. Users are also invited to take part in this process by contacting us to draw our attention on obsolete entries and/or to interesting publications. Articles linking protein information with medical disorders are critically reviewed by expert curators, and variant identification, disease description, and/or protein functional characterization are annotated based on supporting evidence. This annotation is submitted to various manual and automated checks before final integration into UniProtKB/Swiss-Prot. The disease nomenclature is based on OMIM, if available. If the disorder is not reported in OMIM, names and acronyms are created by the UniProtKB/Swiss-Prot staff on the basis of published reports.
Figure 2
Figure 2
Excerpt from UniProtKB/Swiss-Prot entry Q5HYA8 representing human Meckelin (TMEM67). The “Sequence annotation (Features)” section describes the sequence and sequence variants at the single residue level. Note the presence of three types of variants: a neutral polymorphism at position 261, disease variants associated with ciliopathies MKS3, COACHS, and NPHP11, and VUS at positions 245 and 296. Note that disease-linked variant p.Asn242Thr affects a predicted N-glycosylation site (see subsection “Amino acid modifications”). Disease-linked variant p.Gln376Pro perturbs protein subcellular location.
Figure 3
Figure 3
UniProtKB/Swiss-Prot page for human Meckelin (TMEM67) variant p.Asn242Thr.
Figure 4
Figure 4
Excerpt from the “General annotation (Comments)” section in Q5HYA8, containing functional annotations based on publications. TMEM67 mutations are involved in several ciliopathies, including Meckel syndrome 3 (MKS3), Joubert syndrome 6 (JBTS6), Bardet–Biedl syndrome (BBS), COACH syndrome (COACHS), and nephronophthisis 11 (NPHP11). The precise type of association with the disease, i.e. confirmed or probable pathological role, susceptibility to disease or disease modification, is indicated in the “Note” using a controlled vocabulary.

References

    1. 1000 Genomes Project Consortium. Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56–65. - PMC - PubMed
    1. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7:248–249. - PMC - PubMed
    1. Barabási AL, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011;12:56–68. - PMC - PubMed
    1. Beales PL, Badano JL, Ross AJ, Ansley SJ, Hoskins BE, Kirsten B, Mein CA, Froguel P, Scambler PJ, Lewis RA, Lupski JR, Katsanis N. Genetic interaction of BBS1 mutations with alleles at other BBS loci can result in non-Mendelian Bardet-Biedl syndrome. Am J Hum Genet. 2003;72:1187–1199. - PMC - PubMed
    1. Capriotti E, Calabrese R, Fariselli P, Martelli PL, Altman RB, Casadio R. WS-SNPs&GO: a web server for predicting the deleterious effect of human protein variants using functional annotation. BMC Genomics. 2013;14(Suppl 3):S6. - PMC - PubMed

Publication types