Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Sep;3(3):344-60.
doi: 10.3390/genes3030344.

The human transcriptome: an unfinished story

Affiliations

The human transcriptome: an unfinished story

Mihaela Pertea. Genes (Basel). 2012 Sep.

Abstract

Despite recent technological advances, the study of the human transcriptome is still in its early stages. Here we provide an overview of the complex human transcriptomic landscape, present the bioinformatics challenges posed by the vast quantities of transcriptomic data, and discuss some of the studies that have tried to determine how much of the human genome is transcribed. Recent evidence has suggested that more than 90% of the human genome is transcribed into RNA. However, this view has been strongly contested by groups of scientists who argued that many of the observed transcripts are simply the result of transcriptional noise. In this review, we conclude that the full extent of transcription remains an open question that will not be fully addressed until we decipher the complete range and biological diversity of the transcribed genomic sequences.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Composition of the human transcriptome. (a) Venn diagram of the number of loci containing mRNA transcripts (green), long ncRNAs (blue), and small ncRNAs (red); (b) Base pair coverage of the transcriptome by the three categories of transcripts.
Figure 2
Figure 2
The size of the transcriptome, computed as the fraction of the total number of base pairs in the human genome covered by the assembled transcripts, for 16 normal human tissues included in the Illumina Body Map [98]. Each RNA-seq data set was mapped to the genome with TopHat [78] and assembled with Cufflinks [41]. Note that except for adrenal tissue, in which transcripts cover 5.3% of the human genome, all other reconstructed transcriptomes are smaller in size than the currently annotated transcriptome.

References

    1. Ohno S. So much “junk” DNA in our genome. Brookhaven Symp. Biol. 1972;23:366–370. - PubMed
    1. The International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. doi: 10.1038/35057062. - DOI - PubMed
    1. Venter J.C., Adams M.D., Myers E.W., Li P.W., Mural R.J., Sutton G.G., Smith H.O., Yandell M., Evans C.A., Holt R.A., et al. The sequence of the human genome. Science. 2001;291:1304–1351. - PubMed
    1. Chen J., Sun M., Lee S., Zhou G., Rowley J.D., Wang S.M. Identifying novel transcripts and novel genes in the human genome by using novel SAGE tags. Proc. Natl. Acad. Sci. USA. 2002;99:12257–12262. - PMC - PubMed
    1. Kapranov P., Cawley S.E., Drenkow J., Bekiranov S., Strausberg R.L., Fodor S.P., Gingeras T.R. Large-scale transcriptional activity in chromosomes 21 and 22. Science. 2002;296:916–919. doi: 10.1126/science.1068597. - DOI - PubMed

LinkOut - more resources