Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Apr 15;100(8):4702-5.
doi: 10.1073/pnas.0831040100. Epub 2003 Apr 1.

Comprehensive sampling of gene expression in human cell lines with massively parallel signature sequencing

Affiliations

Comprehensive sampling of gene expression in human cell lines with massively parallel signature sequencing

C Victor Jongeneel et al. Proc Natl Acad Sci U S A. .

Abstract

Whereas information is rapidly accumulating about the structure and position of genes encoded in the human genome, less is known about the complexity and relative abundance of their expression in individual human cells and tissues. Here, we describe the characteristics of the transcriptomes of two cultured cell lines, HB4a (normal breast epithelium) and HCT-116 (colon adenocarcinoma), using massively parallel signature sequencing (MPSS). We generated in excess of 10(7) short signature sequences per cell line, thus providing a comprehensive snapshot of gene expression, within the technical limitations of the method. The number of genes expressed at one copy per cell or more in either of the lines was estimated to be between 10,000 and 15,000. The vast majority of the transcripts found in these cells can be mapped to known genes and their polyadenylation variants. Among the genes that could be identified from their signature sequences, approximately 8,500 were expressed by both cell lines, whereas 6,000 showed cellular specificity. Taking into account sequence tags that map uniquely to the genome but not to known transcripts, overall the data are consistent with an upper limit of 17,000 for the total number of genes expressed at more than one copy per cell in one or both of the two cell lines examined.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Utilization of polyadenylation sites as a function of their position relative to the transcript 5′ end (site 1 is the 5′ most). Only transcripts with five polyadenylation sites or fewer were taken into account, because the numbers obtained from of those with six or more (244 genes total) are insufficient to yield significant results.

References

    1. Adams M D, Soares M B, Kerlavage A R, Fields C, Venter J C. Nat Genet. 1993;4:373–380. - PubMed
    1. Strausberg R L, Riggins G J. Proc Natl Acad Sci USA. 2001;98:11837–11838. - PMC - PubMed
    1. Camargo A A, Samaia H P B, Dias-Neto E, Simão D F, Migotto I A, Briones M R, Costa F F, Nagai M A, Verjovski-Almeida S, Zago M A, et al. Proc Natl Acad Sci USA. 2001;98:12103–12108. - PMC - PubMed
    1. Velculescu V E, Zhang L, Vogelstein B, Kinzler K W. Science. 1995;270:484–487. - PubMed
    1. Lal A, Lash A E, Altschul S F, Velculescu V, Zhang L, McLendon R E, Marra M A, Prange C, Morin P J, Polyak K, et al. Cancer Res. 1999;59:5403–5407. - PubMed

Publication types