The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes
- PMID: 19498102
- PMCID: PMC2704439
- DOI: 10.1101/gr.080531.108
The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes
Erratum in
- Genome Res. 2009 Aug;19(8):1506
Abstract
Effective use of the human and mouse genomes requires reliable identification of genes and their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation of genes, transcripts, and proteins. The collaborative consensus coding sequence (CCDS) project tracks identical protein annotations on the reference mouse and human genomes with a stable identifier (CCDS ID), and ensures that they are consistently represented on the NCBI, Ensembl, and UCSC Genome Browsers. Importantly, the project coordinates on manually reviewing inconsistent protein annotations between sites, as well as annotations for which new evidence suggests a revision is needed, to progressively converge on a complete protein-coding set for the human and mouse reference genomes, while maintaining a high standard of reliability and biological accuracy. To date, the project has identified 20,159 human and 17,707 mouse consensus coding regions from 17,052 human and 16,893 mouse genes. Three evaluation methods indicate that the entries in the CCDS set are highly likely to represent real proteins, more so than annotations from contributing groups not included in CCDS. The CCDS database thus centralizes the function of identifying well-supported, identically-annotated, protein-coding regions.
Figures




Similar articles
-
Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.Nucleic Acids Res. 2018 Jan 4;46(D1):D221-D228. doi: 10.1093/nar/gkx1031. Nucleic Acids Res. 2018. PMID: 29126148 Free PMC article.
-
Tracking and coordinating an international curation effort for the CCDS Project.Database (Oxford). 2012 Mar 20;2012:bas008. doi: 10.1093/database/bas008. Print 2012. Database (Oxford). 2012. PMID: 22434842 Free PMC article.
-
Current status and new features of the Consensus Coding Sequence database.Nucleic Acids Res. 2014 Jan;42(Database issue):D865-72. doi: 10.1093/nar/gkt1059. Epub 2013 Nov 11. Nucleic Acids Res. 2014. PMID: 24217909 Free PMC article.
-
An Experimental Approach to Genome Annotation: This report is based on a colloquium sponsored by the American Academy of Microbiology held July 19-20, 2004, in Washington, DC.Washington (DC): American Society for Microbiology; 2004. Washington (DC): American Society for Microbiology; 2004. PMID: 33001599 Free Books & Documents. Review.
-
EGASP: the human ENCODE Genome Annotation Assessment Project.Genome Biol. 2006;7 Suppl 1(Suppl 1):S2.1-31. doi: 10.1186/gb-2006-7-s1-s2. Epub 2006 Aug 7. Genome Biol. 2006. PMID: 16925836 Free PMC article. Review.
Cited by
-
De novo gene disruptions in children on the autistic spectrum.Neuron. 2012 Apr 26;74(2):285-99. doi: 10.1016/j.neuron.2012.04.009. Neuron. 2012. PMID: 22542183 Free PMC article.
-
Transcriptional enhancers in protein-coding exons of vertebrate developmental genes.PLoS One. 2012;7(5):e35202. doi: 10.1371/journal.pone.0035202. Epub 2012 May 2. PLoS One. 2012. PMID: 22567096 Free PMC article.
-
Curating Clinically Relevant Transcripts for the Interpretation of Sequence Variants.J Mol Diagn. 2018 Nov;20(6):789-801. doi: 10.1016/j.jmoldx.2018.06.005. Epub 2018 Aug 8. J Mol Diagn. 2018. PMID: 30096381 Free PMC article.
-
Gene characteristics predicting missense, nonsense and frameshift mutations in tumor samples.BMC Bioinformatics. 2018 Nov 19;19(1):430. doi: 10.1186/s12859-018-2455-0. BMC Bioinformatics. 2018. PMID: 30453881 Free PMC article.
-
Phosphorylation mapping of Laminin β1-chain: Kinases in association with active sites.J Biosci. 2019 Jun;44(2):55. J Biosci. 2019. PMID: 31180068
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical