Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2012 Dec;10(6):317-25.
doi: 10.1016/j.gpb.2012.06.006. Epub 2012 Nov 29.

The curation of genetic variants: difficulties and possible solutions

Affiliations
Review

The curation of genetic variants: difficulties and possible solutions

Kapil Raj Pandey et al. Genomics Proteomics Bioinformatics. 2012 Dec.

Abstract

The curation of genetic variants from biomedical articles is required for various clinical and research purposes. Nowadays, establishment of variant databases that include overall information about variants is becoming quite popular. These databases have immense utility, serving as a user-friendly information storehouse of variants for information seekers. While manual curation is the gold standard method for curation of variants, it can turn out to be time-consuming on a large scale thus necessitating the need for automation. Curation of variants described in biomedical literature may not be straightforward mainly due to various nomenclature and expression issues. Though current trends in paper writing on variants is inclined to the standard nomenclature such that variants can easily be retrieved, we have a massive store of variants in the literature that are present as non-standard names and the online search engines that are predominantly used may not be capable of finding them. For effective curation of variants, knowledge about the overall process of curation, nature and types of difficulties in curation, and ways to tackle the difficulties during the task are crucial. Only by effective curation, can variants be correctly interpreted. This paper presents the process and difficulties of curation of genetic variants with possible solutions and suggestions from our work experience in the field including literature support. The paper also highlights aspects of interpretation of genetic variants and the importance of writing papers on variants following standard and retrievable methods.

PubMed Disclaimer

References

    1. Bale S., Devisscher M., Van Criekinge W., Rehm H.L., Decouttere F., Nussbaum R. MutaDATABASE: a centralized and standardized DNA variation database. Nat Biotech. 2011;29:117–118. - PubMed
    1. Wildeman M., van Ophuizen E., den Dunnen J.T., Taschner P.E. Improving sequence variant descriptions in variant databases and literature using the Mutalyzer sequence variation nomenclature checker. Hum Mutat. 2008;29:6–13. - PubMed
    1. Gieger C., Deneke H., Fluck J. The future of text mining in genome-based clinical research. Biosilico. 2003;1:97–102.
    1. Shatkay H., Feldman R. Mining the biomedical literature in the genomic era: an overview. J Comput Biol. 2003;10:821–855. - PubMed
    1. Van Auken K., Jaffery J., Chan J., Muller H.M., Sternberg P.W. Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation. BMC Bioinformatics. 2009;10:228. - PMC - PubMed