Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008;9(12):R168.
doi: 10.1186/gb-2008-9-12-r168. Epub 2008 Dec 2.

Assaying the regulatory potential of mammalian conserved non-coding sequences in human cells

Affiliations

Assaying the regulatory potential of mammalian conserved non-coding sequences in human cells

Catia Attanasio et al. Genome Biol. 2008.

Abstract

Background: Conserved non-coding sequences in the human genome are approximately tenfold more abundant than known genes, and have been hypothesized to mark the locations of cis-regulatory elements. However, the global contribution of conserved non-coding sequences to the transcriptional regulation of human genes is currently unknown. Deeply conserved elements shared between humans and teleost fish predominantly flank genes active during morphogenesis and are enriched for positive transcriptional regulatory elements. However, such deeply conserved elements account for <1% of the conserved non-coding sequences in the human genome, which are predominantly mammalian.

Results: We explored the regulatory potential of a large sample of these 'common' conserved non-coding sequences using a variety of classic assays, including chromatin remodeling, and enhancer/repressor and promoter activity. When tested across diverse human model cell types, we find that the fraction of experimentally active conserved non-coding sequences within any given cell type is low (approximately 5%), and that this proportion increases only modestly when considered collectively across cell types.

Conclusions: The results suggest that classic assays of cis-regulatory potential are unlikely to expose the functional potential of the substantial majority of mammalian conserved non-coding sequences in the human genome.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Multi-tissue DNaseI hypersensitivity patterns of CNCSs. Shown are the locations of Chr21 CNCSs (top row, black vertical marks), 192 CNCSs tested for DHSs potential (second row, black vertical marks), and CNCSs encoding DHSs in one or more cell types (colored vertical marks). Absence of a colored vertical mark beneath a CNCSs from row 2 indicates lack of DHS potential in the tissue tested.
Figure 2
Figure 2
Unbiased mapping of DHSs and DHS CNCS overlaps. (a) Shown for a 1.7 Mb region of Chr21 are locations of CNCSs (top row, vertical red marks), locations of known genes and annotated transcripts, and maps of DNaseI hypersensitivity in intestinal (CACO2), lymphoid (GM06990), cervical (HeLa), and neural (SKnSH) cell types. A total of 416 distinct DHSs map to this region. (b) Results from 1,000 random trials of sample size 416 and corresponding overlap with CNCSs. The vertical arrow indicates actual result, which is within random expectation.
Figure 3
Figure 3
Chr21 CNCSs and control sequences. Shown are the mapping locations of the human chromosome 21 CNCSs and control non-genic non-transcribed sequences used in this study relative to known Chr21 genes: a) 2262 CNCSs described in Dermitzakis et al. [18]; b) 71 CNCSs randomly selected; c) 21 control single-copy sequences chosen randomly along Chr21; d) 23 CNCSs from Dermitzakis et al. coinciding with Sp1/Myc/p53 binding sites determined by Cawley et al. [30]; e) 44 putative promoter CNCSs.
Figure 4
Figure 4
Enhancer/repressor assay of CNCSs. (a, b) Boxplots showing the distribution of the luciferase activity for each subset of sequences in 293T (a) and Huh7 (b) cell lines. The proportion of putative regulatory elements of each subgroup is indicated at the bottom of both graphs. (c-f) Bar graphs showing the fold change of luciferase activity compared to the control sequence set for 71 selected CNCSs (c, d), 23 CNCS overlapping transcription factor binding sites (TFBSs) (e, f), in 293T and Huh7 cell lines, respectively. Red lines show ± 2-fold change threshold. Asterisks denote statistically significant change (one-sample t-test).
Figure 5
Figure 5
Assay of putative CNCS promoters. (a) Bar graph showing the normalized luciferase activity of putative promoter CNCSs in an episomal vector without minimal promoter. Bidirectionality was tested by cloning the sequences in the native or reverse orientation. Broken bars show values that are off scale. All CNCSs overlapping DHSs are included. (b) Pie chart showing the proportion of random CNCSs with enhancer, silencing, promoter or no activity.

References

    1. Miller W, Makova KD, Nekrutenko A, Hardison RC. Comparative genomics. Annu Rev Genomics Hum Genet. 2004;5:15–56. doi: 10.1146/annurev.genom.5.061903.180057. - DOI - PubMed
    1. Dermitzakis ET, Reymond A, Antonarakis SE. Conserved non-genic sequences - an unexpected feature of mammalian genomes. Nat Rev Genet. 2005;6:151–157. doi: 10.1038/nrg1527. - DOI - PubMed
    1. Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, Antonarakis SE, Attwood J, Baertsch R, Bailey J, Barlow K, Beck S, Berry E, Birren B, Bloom T, Bork P, Botcherby M, Bray N, Brent MR, Brown DG, Brown SD, Bult C, Burton J, Butler J, Campbell RD, Carninci P, et al. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. doi: 10.1038/nature01262. - DOI - PubMed
    1. Boffelli D, Nobrega MA, Rubin EM. Comparative genomics at the vertebrate extremes. Nat Rev Genet. 2004;5:456–465. doi: 10.1038/nrg1350. - DOI - PubMed
    1. Pennacchio LA, Ahituv N, Moses AM, Prabhakar S, Nobrega MA, Shoukry M, Minovitsky S, Dubchak I, Holt A, Lewis KD, Plajzer-Frick I, Akiyama J, De Val S, Afzal V, Black BL, Couronne O, Eisen MB, Visel A, Rubin EM. In vivo enhancer analysis of human conserved non-coding sequences. Nature. 2006;444:499–502. doi: 10.1038/nature05295. - DOI - PubMed

Publication types

LinkOut - more resources