CHESS: a new human gene catalog curated from thousands of large-scale RNA sequencing experiments reveals extensive transcriptional noise
- PMID: 30486838
- PMCID: PMC6260756
- DOI: 10.1186/s13059-018-1590-2
CHESS: a new human gene catalog curated from thousands of large-scale RNA sequencing experiments reveals extensive transcriptional noise
Abstract
We assembled the sequences from deep RNA sequencing experiments by the Genotype-Tissue Expression (GTEx) project, to create a new catalog of human genes and transcripts, called CHESS. The new database contains 42,611 genes, of which 20,352 are potentially protein-coding and 22,259 are noncoding, and a total of 323,258 transcripts. These include 224 novel protein-coding genes and 116,156 novel transcripts. We detected over 30 million additional transcripts at more than 650,000 genomic loci, nearly all of which are likely nonfunctional, revealing a heretofore unappreciated amount of transcriptional noise in human cells. The CHESS database is available at http://ccb.jhu.edu/chess .
Keywords: GTEx; Human gene count; RNA sequencing; Transcriptome; Transcriptome assembly.
Conflict of interest statement
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures





Comment in
-
We simply cannot go on being so vague about 'function'.Genome Biol. 2018 Dec 18;19(1):223. doi: 10.1186/s13059-018-1600-4. Genome Biol. 2018. PMID: 30563541 Free PMC article.
Similar articles
-
CHESS 3: an improved, comprehensive catalog of human genes and transcripts based on large-scale expression data, phylogenetic analysis, and protein structure.Genome Biol. 2023 Oct 30;24(1):249. doi: 10.1186/s13059-023-03088-4. Genome Biol. 2023. PMID: 37904256 Free PMC article.
-
A transcriptional sketch of a primary human breast cancer by 454 deep sequencing.BMC Genomics. 2009 Apr 20;10:163. doi: 10.1186/1471-2164-10-163. BMC Genomics. 2009. PMID: 19379481 Free PMC article.
-
High-throughput sequencing and analysis of the gill tissue transcriptome from the deep-sea hydrothermal vent mussel Bathymodiolus azoricus.BMC Genomics. 2010 Oct 11;11:559. doi: 10.1186/1471-2164-11-559. BMC Genomics. 2010. PMID: 20937131 Free PMC article.
-
Processing and Analysis of RNA-seq Data from Public Resources.Methods Mol Biol. 2021;2243:81-94. doi: 10.1007/978-1-0716-1103-6_4. Methods Mol Biol. 2021. PMID: 33606253 Review.
-
Alternative mRNA transcription, processing, and translation: insights from RNA sequencing.Trends Genet. 2015 Mar;31(3):128-39. doi: 10.1016/j.tig.2015.01.001. Epub 2015 Jan 30. Trends Genet. 2015. PMID: 25648499 Review.
Cited by
-
Assessing the functional relevance of splice isoforms.NAR Genom Bioinform. 2021 May 22;3(2):lqab044. doi: 10.1093/nargab/lqab044. eCollection 2021 Jun. NAR Genom Bioinform. 2021. PMID: 34046593 Free PMC article.
-
Splicing and editing of ionotropic glutamate receptors: a comprehensive analysis based on human RNA-Seq data.Cell Mol Life Sci. 2021 Jul;78(14):5605-5630. doi: 10.1007/s00018-021-03865-z. Epub 2021 Jun 8. Cell Mol Life Sci. 2021. PMID: 34100982 Free PMC article.
-
Systematic characterization of cancer transcriptome at transcript resolution.Nat Commun. 2022 Nov 10;13(1):6803. doi: 10.1038/s41467-022-34568-z. Nat Commun. 2022. PMID: 36357395 Free PMC article.
-
Predicting the Structural Impact of Human Alternative Splicing.bioRxiv [Preprint]. 2023 Dec 24:2023.12.21.572928. doi: 10.1101/2023.12.21.572928. bioRxiv. 2023. PMID: 38187531 Free PMC article. Preprint.
-
Polar/apolar interfaces modulate the conformational behavior of cyclic peptides with impact on their passive membrane permeability.RSC Adv. 2022 Feb 16;12(10):5782-5796. doi: 10.1039/d1ra09025a. eCollection 2022 Feb 16. RSC Adv. 2022. PMID: 35424539 Free PMC article.
References
-
- Liang F, Holt I, Pertea G, Karamycheva S, Salzberg SL, Quackenbush J. Correction: gene index analysis of the human genome estimates approximately 120,000 genes. Nat Genet. 2000;26:501. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources