Sequence features that drive human promoter function and tissue specificity
- PMID: 20501695
- PMCID: PMC2892090
- DOI: 10.1101/gr.100370.109
Sequence features that drive human promoter function and tissue specificity
Abstract
Promoters are important regulatory elements that contain the necessary sequence features for cells to initiate transcription. To functionally characterize a large set of human promoters, we measured the transcriptional activities of 4575 putative promoters across eight cell lines using transient transfection reporter assays. In parallel, we measured gene expression in the same cell lines and observed a significant correlation between promoter activity and endogenous gene expression (r = 0.43). As transient transfection assays directly measure the promoting effect of a defined fragment of DNA sequence, decoupled from epigenetic, chromatin, or long-range regulatory effects, we sought to predict whether a promoter was active using sequence features alone. CG dinucleotide content was highly predictive of ubiquitous promoter activity, necessitating the separation of promoters into two groups: high CG promoters, mostly ubiquitously active, and low CG promoters, mostly cell line-specific. Computational models trained on the binding potential of transcriptional factor (TF) binding motifs could predict promoter activities in both high and low CG groups: average area under the receiver operating characteristic curve (AUC) of the models was 91% and exceeded the AUC of CG content by an average of 23%. Known relationships, for example, between HNF4A and hepatocytes, were recapitulated in the corresponding cell lines, in this case the liver-derived cell line HepG2. Half of the associations between tissue-specific TFs and cell line-specific promoters were new. Our study underscores the importance of collecting functional information from complementary assays and conditions to understand biology in a systematic framework.
Figures





Similar articles
-
Differentiation-specific transcriptional regulation of the hepatitis B virus large surface antigen gene in human hepatoma cell lines.J Virol. 1990 May;64(5):2360-8. doi: 10.1128/JVI.64.5.2360-2368.1990. J Virol. 1990. PMID: 2157890 Free PMC article.
-
Distal apolipoprotein C-III regulatory elements F to J act as a general modular enhancer for proximal promoters that contain hormone response elements. Synergism between hepatic nuclear factor-4 molecules bound to the proximal promoter and distal enhancer sites.Arterioscler Thromb Vasc Biol. 1997 Jan;17(1):222-32. doi: 10.1161/01.atv.17.1.222. Arterioscler Thromb Vasc Biol. 1997. PMID: 9012660
-
Differential promoter usage in prolactin receptor gene expression: hepatocyte nuclear factor 4 binds to and activates the promoter preferentially active in the liver.Mol Endocrinol. 1996 Jun;10(6):661-71. doi: 10.1210/mend.10.6.8776726. Mol Endocrinol. 1996. PMID: 8776726
-
Predicting cell-type-specific gene expression from regions of open chromatin.Genome Res. 2012 Sep;22(9):1711-22. doi: 10.1101/gr.135129.111. Genome Res. 2012. PMID: 22955983 Free PMC article.
-
Functional promoter polymorphisms direct the expression of cystathionine gamma-lyase gene in mouse models of essential hypertension.J Mol Cell Cardiol. 2017 Jan;102:61-73. doi: 10.1016/j.yjmcc.2016.11.005. Epub 2016 Nov 16. J Mol Cell Cardiol. 2017. PMID: 27865915
Cited by
-
Genetic and epigenetic features of promoters with ubiquitous chromatin accessibility support ubiquitous transcription of cell-essential genes.Nucleic Acids Res. 2021 Jun 4;49(10):5705-5725. doi: 10.1093/nar/gkab345. Nucleic Acids Res. 2021. PMID: 33978759 Free PMC article.
-
CD28 Gene Polymorphisms in the Promoter Region Are Associated with Transfusion Reactions: A Functional Study.J Clin Med. 2021 Feb 20;10(4):871. doi: 10.3390/jcm10040871. J Clin Med. 2021. PMID: 33672525 Free PMC article.
-
Massively parallel reporter perturbation assays uncover temporal regulatory architecture during neural differentiation.Nat Commun. 2022 Mar 21;13(1):1504. doi: 10.1038/s41467-022-28659-0. Nat Commun. 2022. PMID: 35315433 Free PMC article.
-
High resolution mapping of Twist to DNA in Drosophila embryos: Efficient functional analysis and evolutionary conservation.Genome Res. 2011 Apr;21(4):566-77. doi: 10.1101/gr.104018.109. Epub 2011 Mar 7. Genome Res. 2011. PMID: 21383317 Free PMC article.
-
A report on DNA sequence determinants in gene expression.Bioinformation. 2020 May 31;16(5):422-431. doi: 10.6026/97320630016422. eCollection 2020. Bioinformation. 2020. PMID: 32831525 Free PMC article.
References
-
- Berg OG, von Hippel PH 1987. Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. J Mol Biol 193: 723–750 - PubMed
-
- Brown CD, Johnson DS, Sidow A 2007. Functional architecture and evolution of transcriptional elements that drive gene coexpression. Science 317: 1557–1560 - PubMed
-
- Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, Oyama R, Ravasi T, Lenhard B, Wells C, et al. 2005. The transcriptional landscape of the mammalian genome. Science 309: 1559–1563 - PubMed
Publication types
MeSH terms
Substances
Associated data
- Actions
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Research Materials
Miscellaneous