Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Mar 24:arXiv:2303.13996v1.

The status of the human gene catalogue

Affiliations

The status of the human gene catalogue

Paulo Amaral et al. ArXiv. .

Update in

  • The status of the human gene catalogue.
    Amaral P, Carbonell-Sala S, De La Vega FM, Faial T, Frankish A, Gingeras T, Guigo R, Harrow JL, Hatzigeorgiou AG, Johnson R, Murphy TD, Pertea M, Pruitt KD, Pujar S, Takahashi H, Ulitsky I, Varabyou A, Wells CA, Yandell M, Carninci P, Salzberg SL. Amaral P, et al. Nature. 2023 Oct;622(7981):41-47. doi: 10.1038/s41586-023-06490-x. Epub 2023 Oct 4. Nature. 2023. PMID: 37794265 Free PMC article. Review.

Abstract

Scientists have been trying to identify all of the genes in the human genome since the initial draft of the genome was published in 2001. Over the intervening years, much progress has been made in identifying protein-coding genes, and the estimated number has shrunk to fewer than 20,000, although the number of distinct protein-coding isoforms has expanded dramatically. The invention of high-throughput RNA sequencing and other technological breakthroughs have led to an explosion in the number of reported non-coding RNA genes, although most of them do not yet have any known function. A combination of recent advances offers a path forward to identifying these functions and towards eventually completing the human gene catalogue. However, much work remains to be done before we have a universal annotation standard that includes all medically significant genes, maintains their relationships with different reference genomes, and describes clinically relevant genetic variants.

PubMed Disclaimer

References

    1. Understanding our genetic inheritance: The US Human Genome Project, the first five years 1991–1995. (U.S. Department of Health and Human Services and U.S. Department of Energy, 1990).
    1. Nurk S. et al. The complete sequence of a human genome. Science 376, 44–53, doi:10.1126/science.abj6987 (2022). - DOI - PMC - PubMed
    1. The Encode Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74, doi:10.1038/nature11247 (2012). - DOI - PMC - PubMed
    1. Kawaji H., Kasukawa T., Forrest A., Carninci P. & Hayashizaki Y. The FANTOM5 collection, a data series underpinning mammalian transcriptome atlases in diverse cell types. Sci Data 4, 170113, doi:10.1038/sdata.2017.113 (2017). - DOI - PMC - PubMed
    1. Fields C., Adams M. D., White O. & Venter J. C. How many genes in the human genome? Nature Genetics 7, 345–346, doi:10.1038/ng0794-345 (1994). - DOI - PubMed

Publication types

LinkOut - more resources