Somatic mutation databases as tools for molecular epidemiology and molecular pathology of cancer: proposed guidelines for improving data collection, distribution, and integration
- PMID: 19006239
- DOI: 10.1002/humu.20832
Somatic mutation databases as tools for molecular epidemiology and molecular pathology of cancer: proposed guidelines for improving data collection, distribution, and integration
Abstract
There are currently less than 40 locus-specific databases (LSDBs) and one large general database that curate data on somatic mutations in human cancer genes. These databases have different scope and use different annotation standards and database systems, resulting in duplicated efforts in data curation, and making it difficult for users to find clear and consistent information. As data related to somatic mutations are generated at an increasing pace it is urgent to create a framework for improving the collecting of this information and making it more accessible to clinicians, scientists, and epidemiologists to facilitate research on biomarkers. Here we propose a data flow for improving the connectivity between existing databases and we provide practical guidelines for data reporting, database contents, and annotation standards. These proposals are based on common standards recommended by the Human Genome Variation Society (HGVS) with additions related to specific requirements of somatic mutations in cancer. Indeed, somatic mutations may be used in molecular pathology and clinical studies to characterize tumor types, help treatment choice, predict response to treatment and patient outcome, or in epidemiological studies as markers for tumor etiology or exposure assessment. Thus, specific annotations are required to cover these diverse research topics. This initiative is meant to promote collaboration and discussion on these issues and the development of adequate resources that would avoid the loss of extremely valuable information generated by years of basic and clinical research.
2008 Wiley-Liss, Inc.
Similar articles
-
Locus-specific database domain and data content analysis: evolution and content maturation toward clinical use.Hum Mutat. 2010 Oct;31(10):1109-16. doi: 10.1002/humu.21332. Hum Mutat. 2010. PMID: 20672379
-
Time for a unified system of mutation description and reporting: a review of locus-specific mutation databases.Genome Res. 2002 May;12(5):680-8. doi: 10.1101/gr.217702. Genome Res. 2002. PMID: 11997335
-
Guidelines for establishing locus specific databases.Hum Mutat. 2012 Feb;33(2):298-305. doi: 10.1002/humu.21646. Epub 2011 Dec 9. Hum Mutat. 2012. PMID: 22052659
-
The UMD-p53 database: new mutations and analysis tools.Hum Mutat. 2003 Mar;21(3):176-81. doi: 10.1002/humu.10187. Hum Mutat. 2003. PMID: 12619103 Review.
-
TP53 mutations in human cancer: database reassessment and prospects for the next decade.Adv Cancer Res. 2011;110:107-39. doi: 10.1016/B978-0-12-386469-7.00005-0. Adv Cancer Res. 2011. PMID: 21704230 Review.
Cited by
-
CanProVar: a human cancer proteome variation database.Hum Mutat. 2010 Mar;31(3):219-28. doi: 10.1002/humu.21176. Hum Mutat. 2010. PMID: 20052754 Free PMC article.
-
Exploring the cancer genome in the era of next-generation sequencing.Front Med. 2012 Mar;6(1):48-55. doi: 10.1007/s11684-012-0182-x. Epub 2012 Mar 31. Front Med. 2012. PMID: 22460448 Review.
-
Our changing view of the genomic landscape of cancer.J Pathol. 2010 Jan;220(2):231-43. doi: 10.1002/path.2645. J Pathol. 2010. PMID: 19918804 Free PMC article. Review.
-
A bioinformatics workflow for variant peptide detection in shotgun proteomics.Mol Cell Proteomics. 2011 May;10(5):M110.006536. doi: 10.1074/mcp.M110.006536. Epub 2011 Mar 9. Mol Cell Proteomics. 2011. PMID: 21389108 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials