Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2005 Sep 22;33(17):5437-45.
doi: 10.1093/nar/gki853. Print 2005.

Human-zebrafish non-coding conserved elements act in vivo to regulate transcription

Affiliations
Comparative Study

Human-zebrafish non-coding conserved elements act in vivo to regulate transcription

Jordan T Shin et al. Nucleic Acids Res. .

Abstract

Whole genome comparisons of distantly related species effectively predict biologically important sequences--core genes and cis-acting regulatory elements (REs)--but require experimentation to verify biological activity. To examine the efficacy of comparative genomics in identification of active REs from anonymous, non-coding (NC) sequences, we generated a novel alignment of the human and draft zebrafish genomes, and contrasted this set to existing human and fugu datasets. We tested the transcriptional regulatory potential of candidate sequences using two in vivo assays. Strict selection of non-genic elements which are deeply conserved in vertebrate evolution identifies 1744 core vertebrate REs in human and two fish genomes. We tested 16 elements in vivo for cis-acting gene regulatory properties using zebrafish transient transgenesis and found that 10 (63%) strongly modulate tissue-specific expression of a green fluorescent protein reporter vector. We also report a novel quantitative enhancer assay with potential for increased throughput based on normalized luciferase activity in vivo. This complementary system identified 11 (69%; including 9 of 10 GFP-confirmed elements) with cis-acting function. Together, these data support the utility of comparative genomics of distantly related vertebrate species to identify REs and provide a scaleable, in vivo quantitative assay to define functional activity of candidate REs.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Expression vector constructs use for zebrafish transgenesis. The GFP construct is comprised of the zCMLC2 promoter (blue) driving expression of green fluorescent protein (green). Cloned sequences were inserted into the gateway recombination sequence (red) 5′ to the expression cassette and assessed for their ability to modulate transcriptional activity from this promoter. Similarly, the GL2 firefly luciferase construct consisted of an expression module containing the zCMLC2 promoter and the luciferase coding sequence (gray). The control pGM:RL plasmid, which contained the renilla luciferase coding sequence (purple), was used for normalization.
Figure 2
Figure 2
Bioinformatic characterization of HZ NC evolutionarily conserved regions and overlap with other genomic comparisons. (A) Breakdown and characterization of the 6.5 × 104 conserved regions shared between the human and zebrafish genomes. While most of the conserved sequences correspond to known genes and transcripts, 7% represent conserved, NC sequences. (B) A Venn diagram comparison of HZ, human:pufferfish and human:UC NC datasets. Where one species is indicated, the comparison is between human genomic sequence and that species. The UC set is that reported by Bejerano et al. (10). For each dataset, the number of total elements is indicated in parentheses while the percentage of total elements that are unique to that set is indicated in italics. The numbers within each segment of the Venn diagram represents the absolute number of elements.
Figure 3
Figure 3
Quantitative analysis of reporter expression demonstrates that HZ NCECR regulate transcription in transgenic zebrafish. Sixteen conserved NC elements (11 from human [black], 1 from zebrafish [blue] and 2 paralogous sequences from both species), as well as negative control (Neg Ctrl, n = 8) and positive control (PC, n = 3) sequences were cloned for analysis. The percentage of embryos exhibiting extra-cardiac GFP expression are plotted in the horizontal axis (with standard error bars); the red vertical line indicates an approximate threshold delineating positive results which are also marked by an asterisk (*). For GFP-positive sequences, tissues with significant fluorescence in non-cardiac tissues, those tissues/organs which reproducibly demonstrated ectopic fluorescence are denoted with colored boxes. In those embryos where ectopic fluorescence did not exceed the statistical threshold for significance, no tissues are highlighted (gray boxes). Quantiative assessment of luciferase activity is presented in the LR column. LR values shown in this figure are the mean values of at least three samples. Experimental LRs which exceeded the mean of the pooled negative control LR value by 2 SD are highlighted in red.
Figure 4
Figure 4
Spatial localization of a GFP expression in transgenic zebrafish. Panels A1–D1 show GFP-images only; panels A2–D2 are merged GFP and bright-field images to assist in the spatial localization of the fluorescent signal. (A) GFP expression construct containing conserved sequence HZ.H2 demonstrates ectopic fluorescence in the dermal cells (yellow arrows). (B) Skeletal myocytes (elongated cells) and skin fluorescence (round cells) are identified in a 3 dpf transgenic embryo injected with the expression construct containing the HZ.H7 conserved sequence. (C) Brain, retina and neural tube fluorescence following injection with HZ.H11:cMLC2:GFP construct. In each case the bright fluorescence between the yolk sac and head represents intrinsic, positive control GFP expression in cardiac myocytes (indicated by the red asterisk in panels A1–C1) directed by the cMLC2 promoter. (D) A higher power photomicrograph identifies axonal projections (magnified view shown in the inset, with yellow arrows) and central neuronal expression of GFP in the forebrain and hindbrain of day 3 developing embryos transgenic for the HZ.H6 conserved NC element. Panel E and F show early (<30 hpf) expression of reporter genes in the skeletal myocytes (panel E) and dermal cells (panel F) prior to hatching.

References

    1. Waterston R.H., Lindblad-Toh K., Birney E., Rogers J., Abril J.F., Agarwal P., Agarwala R., Ainscough R., Alexandersson M., An P., et al. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. - PubMed
    1. Ovcharenko I., Stubbs L., Loots G.G. Interpreting mammalian evolution using Fugu genome comparisons. Genomics. 2004;84:890–895. - PubMed
    1. Barton L.M., Gottgens B., Gering M., Gilbert J.G., Grafham D., Rogers J., Bentley D., Patient R., Green A.R. Regulation of the stem cell leukemia (SCL) gene: a tale of two fishes. Proc. Natl Acad. Sci. USA. 2001;98:6747–6752. - PMC - PubMed
    1. Loots G.G., Locksley R.M., Blankespoor C.M., Wang Z.E., Miller W., Rubin E.M., Frazer K.A. Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. Science. 2000;288:136–140. - PubMed
    1. Pennacchio L.A., Olivier M., Hubacek J.A., Cohen J.C., Cox D.R., Fruchart J.C., Krauss R.M., Rubin E.M. An apolipoprotein influencing triglycerides in humans and mice revealed by comparative sequencing. Science. 2001;294:169–173. - PubMed

Publication types