Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 13;18(7):e1010295.
doi: 10.1371/journal.pcbi.1010295. eCollection 2022 Jul.

What are housekeeping genes?

Affiliations

What are housekeeping genes?

Chintan J Joshi et al. PLoS Comput Biol. .

Abstract

The concept of "housekeeping gene" has been used for four decades but remains loosely defined. Housekeeping genes are commonly described as "essential for cellular existence regardless of their specific function in the tissue or organism", and "stably expressed irrespective of tissue type, developmental stage, cell cycle state, or external signal". However, experimental support for the tenet that gene essentiality is linked to stable expression across cell types, conditions, and organisms has been limited. Here we use genome-scale functional genomic screens together with bulk and single-cell sequencing technologies to test this link and optimize a quantitative and experimentally validated definition of housekeeping gene. Using the optimized definition, we identify, characterize, and provide as resources, housekeeping gene lists extracted from several human datasets, and 10 other animal species that include primates, chicken, and C. elegans. We find that stably expressed genes are not necessarily essential, and that the individual genes that are essential and stably expressed can considerably differ across organisms; yet the pathways enriched among these genes are conserved. Further, the level of conservation of housekeeping genes across the analyzed organisms captures their taxonomic groups, showing evolutionary relevance for our definition. Therefore, we present a quantitative and experimentally supported definition of housekeeping genes that can contribute to better understanding of their unique biological and evolutionary characteristics.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. The relationship between the axiomatically assigned four properties of housekeeping genes remains unexplored.
Fig 2
Fig 2. Gini coefficient captures the first property of housekeeping genes: stable expression.
(A) Coverage of the 3688 Eisenberg and Levanon housekeeping genes [1] among the 3688 genes with lowest Gini coefficients within each dataset. (B) Gini coefficients of individual genes are highly correlated across human datasets regardless of sample type; however, spearman correlations within sample types are tighter. Dots represent unique genes. Red specifically depicts housekeeping genes identified by Eisenberg and Levanon [1].
Fig 3
Fig 3. Housekeeping genes are mostly expressed at relatively low levels.
(A-B) Comparison of median expression levels and Gini coefficients calculated for the (C) HPA and (B) GTEx datasets. The spearman correlations between Gini coefficients and median expression values were -0.6545 (p = 0) and -0.7244 (p = 0) for HPA and GTEx, respectively. Dots represent unique genes. Red specifically depicts housekeeping genes identified by Eisenberg and Levanon [1].
Fig 4
Fig 4. Gini genes are essential.
Gini coefficients of essential genes (black dots) compared to the housekeeping genes from Eisenberg and Levanon when using (A) CellMiner (Wilcoxon p-value = 8.373 x 10−38; KS p-value = 2.48 x 10−26) and (B) Klijn et al. [37] (Wilcoxon p-value = 5.96 x 10−18; KS p-value = 4.44 x 10−23) cancer datasets, and (C) CHO datasets. The 2800 genes essential in the 20 cancer cell lines were extracted from DepMap [40,41], and the 338 CHO essential genes were extracted from Xiong et al. [39] (D) GO term coverage of essential genes and that of Gini genes from CellMiner (blue, 0.8557), Klijn et al. [37] (yellow, 0.8907), and CHO (green, 0.7055) are correlated. The left-tailed Wilcoxon p-values when comparing essential genes with the overall datasets are 2.35 x 10−146 for CellMiner and 9.16 x 10−313 for Klijn et al. [37] This suggests that essential genes and housekeeping genes are coming from populations with equal medians and similar cumulative distribution functions. The slightly lower correlation in CHO cells is likely due to the fewer number of essential genes reported in the CHO screen. (E) High- and medium-confidence essential genes in healthy C. elegans have significantly lower GC than non-essential genes (p = 4.53 x 10−5).
Fig 5
Fig 5. Gini coefficients accurately capture organism-specific differences.
(A-B) Spearman correlations of Gini coefficients of Gini genes (bottom 20th percentile of GC in at least one organism) identified using organism-specific transcriptomes capture cluster containing primates. The number of Gini genes with 1:1 orthologs in all organisms is shown using the bar plot on the right of the dendrogram. Please see S10 Fig. for p-values. (C) Principal component 1 (PC1) also captures the cluster containing primates. Also shown are the top 20 primate Gini genes (pink), the top 20 shared Gini genes (green), and top 20 Gini genes in all non-primates (blue) using the principal component coefficients of the first principal components. (D) Correlation among Gini coefficients across different organisms reproduce cluster containing primates (left panel). The Gini coefficients of genes belonging to top 20, middle 20, and bottom 20 coefficients of PC1 are shown (right panel). Left 20 Gini genes are specific to non-primates, middle 20 Gini genes are shared, and right 20 Gini genes are specific to primates. (E) GO term coverage is highly correlated across different datasets, also shown are the GO term correlations with human datasets used in S6 Fig.

Similar articles

Cited by

References

    1. Eisenberg E, Levanon EY. Human housekeeping genes, revisited. Trends Genet. 2013;29: 569–574. doi: 10.1016/j.tig.2013.05.010 - DOI - PubMed
    1. Tu Z, Wang L, Xu M, Zhou X, Chen T, Sun F. Further understanding human disease genes by comparing with housekeeping genes and other genes. BMC Genomics. 2006;7: 31. doi: 10.1186/1471-2164-7-31 - DOI - PMC - PubMed
    1. Butte AJ, Dzau VJ, Glueck SB. Further defining housekeeping, or “maintenance,” genes Focus on “A compendium of gene expression in normal human tissues”. Physiol Genomics. 2001. doi: 10.1007/s10857-005-4766-0 - DOI - PubMed
    1. Zhu J, He F, Hu S, Yu J. On the nature of human housekeeping genes. Trends Genet. 2008;24: 481–484. doi: 10.1016/j.tig.2008.08.004 - DOI - PubMed
    1. Warrington JA, Nair A, Mahadevappa M, Tsyganskaya M. Comparison of human adult and fetal expression and identification of 535 housekeeping/maintenance genes. Physiol Genomics. 2000;2000: 143–147. doi: 10.1152/physiolgenomics.2000.2.3.143 - DOI - PubMed

Publication types