Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011;6(12):e29314.
doi: 10.1371/journal.pone.0029314. Epub 2011 Dec 22.

Preservation of ranking order in the expression of human Housekeeping genes

Affiliations

Preservation of ranking order in the expression of human Housekeeping genes

Grace T W Shaw et al. PLoS One. 2011.

Abstract

Housekeeping (HK) genes fulfill the basic needs for a cell to survive and function properly. Their ubiquitous expression, originally thought to be constant, can vary from tissue to tissue, but this variation remains largely uncharacterized and it could not be explained by previously identified properties of HK genes such as short gene length and high GC content. By analyzing microarray expression data for human genes, we uncovered a previously unnoted characteristic of HK gene expression, namely that the ranking order of their expression levels tends to be preserved from one tissue to another. Further analysis by tensor product decomposition and pathway stratification identified three main factors of the observed ranking preservation, namely that, compared to those of non-HK (NHK) genes, the expression levels of HK genes show a greater degree of dispersion (less overlap), stableness (a smaller variation in expression between tissues), and correlation of expression. Our results shed light on regulatory mechanisms of HK gene expression that are probably different for different HK genes or pathways, but are consistent and coordinated in different tissues.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. HK genes exhibited a significantly more preserved expression ranking order than MR or TS genes.
Frequency is the percentage of tissue pairs with the indicated Kendall's tau (formula image). Each distribution is a compilation of formula image630 Kendall's tau (formula image; Equation (3 )), where each formula image was for a pair of tissues sampled from a 36-tissue pool and the frequency of formula image at a particular value was computed on a histogram using a bin width of 0.1. The average value (formula image) of the formula image distribution for the HK, MR, and TS gene sets was 0.77, 0.59, and 0.41, respectively, and the standard errors were all small, mostly less than 0.01.
Figure 2
Figure 2. Ranking preservation of HK genes was not a random event.
(A) Kendall's tau computed for the manually partitioned HK and MR genes (circles) compared to those generated from two different random distributions: the triangles show the results when the GSE2361 expression data were randomly divided into two sets containing the same number of HK (Ng = 388) and MR (Ng = 11,953) genes, while the squares show the results when expression levels were ignored and rankings were randomly created (see Methods). Each data point is a combination of the two formula image computed for the two gene groups (Ng = 388 and Ng = 11,953) for a particular pair of tissues. (B) The expression of all the genes in any two human tissues (heart and thymus are shown as an example) always has an elliptical shape, resulting in a substantially preserved ranking order with a non-zero formula image (>0.4) even for randomly grouped genes (triangles in (A)). In contrast, randomly assigned rankings would yield a formula image value very close to zero (squares in (A)). Gene expression levels were log2 transformed and were denoted by xgt.
Figure 3
Figure 3. Schematic illustration of the three factors that contribute to preserve gene expression rankings.
The three factors represent three different types of expression pattern of a pair of genes in different tissues: (A) Co-expression (rgg′; Equation (5 )), (B) stableness (Sgg′; Equation (7 )), and (C) dispersion (Dgg′ ; Equation (8 )).
Figure 4
Figure 4. The contributions to the ranking preservation of the HK, MR, and TS genes.
The three contributing factors were dispersion (solid line), stableness (dashed line), and co-expression (dotted line). For each factor, its contribution (in parenthesis) was the sum of the individual contributions calculated using tensor product decomposing equations, such as Equation (11 )–(13). The contribution of factors other than these three is given in the black box in the upper right corner of each panel.
Figure 5
Figure 5. The contributions to the ranking preservation of the HK genes in seven HK-enriched KEGG pathways.
Shown is the percentage contribution of the three factors, stableness, co-expression, and dispersion to the expression ranking (Kendall's tau, formula image) computed for the HK genes found in each of the seven HK-enriched KEGG pathways. The black bars are the contributions of other unknown factors.
Figure 6
Figure 6. Range of expression levels of the HK genes in HK-enriched KEGG pathways.
Shown are boxplots of the tissue-wide expression profiles of the HK genes in each of the 7 HK-enriched KEGG pathways. The results for the whole HK set (388 genes) and for the NHK set (12,687 genes) are presented on the right for comparison. Each box is bounded by the 25th and 75th percentile of the data, with the solid line within the box marking the median and the dotted line the mean, while the two short horizontal bars indicate the 90th and 10th percentile of the data and dots beyond these two bars are outliers. Gene expression levels were log2 transformed and are denoted by xgt.
Figure 7
Figure 7. The ribosomal HK genes are expressed at different levels in tissues of different embryonic origins.
Shown are boxplots of expression levels on a log2 scale (xgt) for 79 ribosomal HK genes (94.1% of the 84 ribosomal genes, Table 2) in each human tissue. Each box is bounded by the 25th and 75th percentile of the data, the solid line within the box marking the median. The two short horizontal bars indicate the 90th and 10th percentile of the data, while the crosses mark the 95th and the 5th percentile of the data. The tissues are grouped according to their embryonic origin.
Figure 8
Figure 8. A less coherently regulated subcomplex (β) reduced the overall expression ranking preservation of proteasomal genes.
Shown are gene expression rankings (formula image) computed for HK genes coding for proteins involved in different subcomplexes of the proteasome. The formula image value is shown in parenthesis at the bottom of the box. * indicates that the value is significantly different from formula image computed from random sampling using the same number of genes.
Figure 9
Figure 9. The relationships of the contributing components of gene expression rankings.
The four-circle Venn diagram is used to illustrate the intertwined relationships between gene expression ranking (formula image) and the three contributing factors of co-expression (formula image, Equation (5 )), stableness (formula image, Equation (7 )), and dispersion (formula image, Equation (8 )). The rectangular box represents the universal set, its 16 components can be deduced from tensor computations (Equation (9 )), e.g., A8 = formula image, A10 = formula image, and A16 = formula image. The equations for computing these components are described in the Methods (Equation (11 13 )). Note that each of the four elements (formula image,formula image,formula image, and formula image) is a composite of eight components.

Similar articles

Cited by

References

    1. Watson JD, Hopkins NH, Roberts JW, Steitz JA, Weiner AM. The functioning of higher eucaryotic genes. Molecular biology of the gene. 1 ed. Menlo Park, , California: Benjamin/Cummings; 1965. 704
    1. Tu Z, Wang L, Xu M, Zhou X, Chen T, et al. Further understanding human disease genes by comparing with housekeeping genes and other genes. BMC Genomics. 2006;7:31. - PMC - PubMed
    1. Li B, Reese JC. Derepression of DNA damage-regulated genes requires yeast TAF(II)s. Embo J. 2000;19:4091–4100. - PMC - PubMed
    1. Giallongo A, Yon J, Fried M. Ribosomal protein L7a is encoded by a gene (Surf-3) within the tightly clustered mouse surfeit locus. Mol Cell Biol. 1989;9:224–231. - PMC - PubMed
    1. Yamada H, Chen D, Monstein HJ, Hakanson R. Effects of fasting on the expression of gastrin, cholecystokinin, and somatostatin genes and of various housekeeping genes in the pancreas and upper digestive tract of rats. Biochem Biophys Res Commun. 1997;231:835–838. - PubMed

Publication types