Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 May 15;16(10):1188-200.
doi: 10.1093/hmg/ddm066. Epub 2007 Apr 4.

Identification of polymorphic antioxidant response elements in the human genome

Affiliations

Identification of polymorphic antioxidant response elements in the human genome

Xuting Wang et al. Hum Mol Genet. .

Erratum in

  • Hum Mol Genet. 2007 Nov 15;16(22):2780

Abstract

Single nucleotide polymorphisms (SNPs) in transcription factor binding sites (TFBSs) may affect the binding of transcription factors, lead to differences in gene expression and phenotypes and therefore affect susceptibility to environmental exposure. We developed an integrated computational system for discovering functional SNPs in TFBSs in the human genome and predicting their impact on the expression of target genes. In this system, we (i) construct a position weight matrix (PWM) from a collection of experimentally discovered TFBSs; (ii) predict TFBSs in SNP sequences using the PWM and map SNPs to the upstream regions of genes; (iii) examine the evolutionary conservation of putative TFBSs by phylogenetic footprinting; (iv) prioritize candidate SNPs based on microarray expression profiles from tissues in which the transcription factor of interest is either deleted or over-expressed and (v) finally, analyze association of SNP genotypes with gene expression phenotypes. The application of our system has been tested to identify functional polymorphisms in the antioxidant response element (ARE), a cis-acting enhancer sequence found in the promoter region of many genes that encode antioxidant and Phase II detoxification enzymes/proteins. In response to oxidative stress, the transcription factor NRF2 (nuclear factor erythroid-derived 2-like 2) binds to AREs, mediating transcriptional activation of its responsive genes and modulating in vivo defense mechanisms against oxidative damage. Using our novel computational tools, we have identified a set of polymorphic AREs with functional evidence, showing the utility of our system to direct further experimental validation of genomic sequence variations that could be useful for identifying high-risk individuals.

PubMed Disclaimer

Conflict of interest statement

CONFLICTS OF INTEREST STATEMENT: The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
The workflow for in silico identification of polymorphic antioxidant response elements (AREs). Cylinders represent databases used, and rectangles contain computational applications. First, a position weight matrix (PWM) model is constructed from functional AREs and used to search for novel AREs in the input SNP sequences; Next, SNPs selected by PWM scoring are mapped into the upstream regions of genes; Then, a phylogentic footpring approach is used to examine evolutionary conservation in order to reduce excess false-positives, and microarray expression profiles are used to prioritize candidate SNPs that map to oxidative stress induced genes; Last, the genotype-expression association is computed to test SNP's function in silico.
Figure 2
Figure 2
Modeling AREs (NRF2 binding sites) based on 57 functional ARE sequences discovered by experimental methods. These 21-base sequences were used as input to our system to obtain (2A) position frequency matrix, (2B) position weight matrix, used for direct scoring of DNA sequences, and (2C) sequence logo, a visualization showing contributions of individual positions and individual nucleotides to the overall ARE motif, generated by the WebLogo program (http://weblogo.berkeley.edu). In the logo, the letters are sorted so the most common one is on top, the height of each letter is made proportional to its frequency, and the height of the entire stack is adjusted to signify the ‘information content’ (measured in bits) of the sequences at that position. The logo displays both significant residues and subtle sequence patterns.
Figure 3
Figure 3
The ΔPWM (difference in PWM scores of an allele-pair) scores of 4967 putative ARE SNPs grouped by the position in ARE motif. The mean (circle and solid line) and the standard deviation (Y error bars) of ΔPWM scores at each position were plotted, as well as the number of ARE SNPs (square and dash line) at each position.
Figure 4
Figure 4
A phylogenetic footprinting approach used to detect conserved response elements in multiple species. (A) Survey of AREs in upstream 5-kb regions of all genes in human, mouse, and rat genomes using position weight matrix, and found 2388 orthologous genes have AREs in upstream; (B) Identification of conserved AREs by examining the multiple sequence alignment of upstream 5-kb regions of orthologous genes. An example of multiple sequence alignment shows a conserved, functional ARE, and three non-conserved putative AREs in thioredoxin (TXN) gene upstream 5-kb region.
Figure 5
Figure 5
An example of association of SNP genotype with gene expression phenotype. The first three tracks, obtained from the HapMap Project genome browser, display chromosomal region, human gene, and genotyped SNPs and their relative allelic frequencies (little pie charts). The bottom track shows the linear regression plots of gene expression level against SNP genotype. Among 4 association tests, the expression levels of hemoglobin epsilon 1 (HBE1) gene are associated with genotypes of the putative ARE SNP (rs7130110) in upstream (4.7 kb). The presence of the “C” allele correlated with higher basal levels of HBE1 gene expression.

References

    1. Tomso DJ, Bell DA. In: Cellular and Molecular Toxicology. Vanden Heuvel J, editor. Elsevier; 2002. pp. 239–248.
    1. De Gobbi M, Viprakasit V, Hughes JR, Fisher C, Buckle VJ, Ayyub H, Gibbons RJ, Vernimmen D, Yoshinaga Y, de Jong P, et al. A regulatory SNP causes a human genetic disease by creating a new transcriptional promoter. Science. 2006;312:1215–1217. - PubMed
    1. Koivisto UM, Palvimo JJ, Janne OA, Kontula K. A single-base substitution in the proximal Sp1 site of the human low density lipoprotein receptor promoter as a cause of heterozygous familial hypercholesterolemia. Proc Natl Acad Sci U S A. 1994;91:10526–10530. - PMC - PubMed
    1. Bosma PJ, Chowdhury JR, Bakker C, Gantla S, de Boer A, Oostra BA, Lindhout D, Tytgat GN, Jansen PL, Oude Elferink RP, et al. The genetic basis of the reduced expression of bilirubin UDP-glucuronosyltransferase 1 in Gilbert's syndrome. N Engl J Med. 1995;333:1171–1175. - PubMed
    1. Grant DJ, Hall IJ, Eastmond DA, Jones IM, Bell DA. Bilirubin UDP-glucuronosyltransferase 1A1 (UGT1A1) gene promoter polymorphisms and HPRT, glycophorin A, and micronuclei mutant frequencies in human blood. Mutat Res. 2004;560:1–10. - PubMed

Publication types