Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2004 Dec;14(12):2448-56.
doi: 10.1101/gr.2964504. Epub 2004 Nov 15.

Noncoding regulatory sequences of Ciona exhibit strong correspondence between evolutionary constraint and functional importance

Affiliations
Comparative Study

Noncoding regulatory sequences of Ciona exhibit strong correspondence between evolutionary constraint and functional importance

David S Johnson et al. Genome Res. 2004 Dec.

Abstract

We show that sequence comparisons at different levels of resolution can efficiently guide functional analyses of regulatory regions in the ascidians Ciona savignyi and Ciona intestinalis. Sequence alignments of several tissue-specific genes guided discovery of minimal regulatory regions that are active in whole-embryo reporter assays. Using the Troponin I (TnI) locus as a case study, we show that more refined local sequence analyses can then be used to reveal functional substructure within a regulatory region. A high-resolution saturation mutagenesis in conjunction with comparative sequence analyses defined essential sequence elements within the TnI regulatory region. Finally, we found a significant, quantitative relationship between function and sequence divergence of noncoding functional elements. This work demonstrates the power of comparative sequence analysis between the two Ciona species for guiding gene regulatory experiments.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Five prime regions of eight Ciona loci. The black plots represent sequence identity between C. intestinalis and C. savignyi across each region. C. savignyi is the reference sequence for ad, and C. intestinalis is the reference sequence for eh. Green vertical bars correspond to annotated C. intestinalis exons. Blue arrows indicate direction of transcription of each gene. The red arrows indicate the native sequence present in the promoter fusion construct. Each promoter fusion construct contains significant regions of similarity between the two species. Expression patterns for C. savignyi constructs (io). Troponin I is strongly expressed in the tail muscle (i), Synaptotagmin is expressed in various neural lineages, including epidermal neurons (EN) (j) and spinal cord (SC) (k), α-tubulin is expressed in various tissues, including central nervous system (CNS) (l) and spinal cord (SC) (m), and ectopic expression frequently occurs in the mesenchyme (MC) (m), Noto9 is expressed specifically in the notochord (n,o). All embryos were fixed at the mid–late tailbud stages. Electroporated embryos always exhibit mosaic staining patterns. A variety of typical images are archived at http://mendel.stanford.edu/supplementarydata/johnson.
Figure 2.
Figure 2.
Ciona Troponin I gene expression. (ah) Expression of Troponin I transcript as determined by in situ hybridization. Expression is weak as early as gastrulation(a), is very strong by initial tailbud (b), and continues in the muscle through mid-tailbud (c), late tailbud (d), tadpole (e), and juvenile stages (fh). Note the expression in the trunk ventral cells (TVC; adult heart precursors), during the late tailbud stage in e. (in) Expression of lacZ reporter construct tn.pro.1793 as determined by X-gal staining. Expression is strong in mid tailbud (i,j), and continues through late tailbud (k), early tadpole (l), and juvenile stages (m). Ectopic expression in the mesenchyme is common (arrowhead, i). Expression continues in the tail muscle from the late tailbud (k) to the early tadpole (l). Expression also occurs in the TVC (arrowhead, m) and juvenile heart (arrowhead, n).
Figure 3.
Figure 3.
Activity of Troponin I–lacZ promoter constructs bearing various deletions, in context with sequence identity and exon structure of the locus. (a) Plot of sequence identity between C. savignyi and C. intestinalis in the promoter construct pro.1793 (cf. Fig 1a). Green shading represents exons, the blue bar represents the MSRR, and the red bars indicate highly constrained regions (90% id; >20 bp). (b) Activity of deletion constructs, with tn.pro.1793; black lines denote sequence present in the constructs. The table lists activities of the constructs as explained below. Truncation of 329 bp from the 5′-end of the tn.pro.1793 construct completely eliminates expression in the tail muscle. Deletion of internal regions, including annotated exon 1, exons 1 and 2, and exons 1–3, has no effect. The gray line (tn.ci.pro.921) denotes the construct derived from C. intestinalis. (c) Activity of minimally sufficient regulatory region (MSRR) constructs. Fusion of the 5′-most 363 bp of tn.pro.1793 to a heterologous Forkhead basal promoter has strong activity. Deletion of the Forkhead basal promoter has no effect on lacZ expression. The reversed insert has no activity. (df) Relative activities of constructs. We classify constructs as (d) +++, representing 50%–100% animals staining, with many of the animals staining strongly in a majority of the tail muscle cells; (e) ++, representing 25%–50% of the animals staining in the tail muscle, and the minority of these staining a majority of the tail muscle cells; and (f) +, with <25% of the animals staining, and none of the animals staining the majority of the tail muscle cells; “-” indicates a construct that never showed staining. Constructs that were mostly “-” but stained weakly on a single occasion are denoted +/-.
Figure 4.
Figure 4.
Deletion analysis of the minimally sufficient regulatory region. (a) The CHAOS algorithm returns four major regions of local conservation between C. savignyi and C. intestinalis. Note that this level of conservation stands out over neutrally evolving regions. (b) Plot of sequence identity between the C. savignyi and C. intestinalis TnI enhancer regions. The yellow shaded boxes represent the four CHAOS matches. (c) Horizontal lines denote sequence present in the constructs, activity is noted as in Figure 3. CHAOS matches CH1, CH2, and either of CH3/CH4 are necessary for expression of lacZ reporter in larval tail muscle. Concatenation of the four CHAOS matches is sufficient for weak lacZ expression.
Figure 5.
Figure 5.
Conservation, predicted binding sites, and activity upon mutagenesis in the TnMSRR. (a) Plot of predicted binding sites for four common vertebrate muscle regulators for the TnI locus. The right y-axis shows the log-odds score of the site. The black line plots sequence identity (left y-axis) between C. savignyi and C. intestinalis. The location of the enhancer (tn.msrr.363) is shown. (b) Plot of conservation versus activity upon mutagenesis, with predicted binding sites. The left y-axis is sequence identity (black) and activity on mutagenesis (red), and the right y-axis is the log-odds score of predicted sites. The black line plots sequence identity between the C. savignyi and C. intestinalis enhancers. The red line represents the activity (number of embryos staining/total embryos) of the lacZ reporter when a 20-bp window spanning that position is randomized. The yellow shading represents CHAOS hits, the green line is a conserved TATA-box, the pink arrow is the predicted transcription start site, and the blue lines are the three myf predictions. (c) Scatterplot of activity upon mutagenesis versus distance. The distance within each 20-mer is calculated by Kimura's two-parameter model. Mutagenized windows with activity >50% are not included, nor are poorly aligned sequences. (d) Predicted myf sites in C. savignyi and C. intestinalis, aligned with the two relevant scrambled windows. A predicted site remains in scrambled window 1, which retains activity, and no sites are predicted in scrambled window 2, which disrupts activity. Reverting one of the predicted sites back to wild type restores function (reverted window 2). Sequence of C. savignyi (Cs) is shown on top, with capital letters indicating identity with C. intestinalis. Blue sites are predicted on the forward strand, red sites on the reverse. All images and sequences are archived for retrieval at http://mendel.stanford.edu/supplementarydata/johnson.

References

    1. Adams, M.D., Celniker, S.E., Holt, R.A., Evans, C.A., Gocayne, J.D., Amanatides, P.G., Scherer, S.E., Li, P.W., Hoskins, R.A., Galle, R.F., et al. 2000. The genome sequence of Drosophila melanogaster. Science 287: 2185-2195. - PubMed
    1. Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 3389-3402. - PMC - PubMed
    1. Bertrand, V., Hudson, C., Caillol, D., Popovici, C., and Lemaire, P. 2003. Neural tissue in ascidian embryos is induced by FGF9/16/20, acting via a combination of maternal GATA and Ets transcription factors. Cell 115: 615-627. - PubMed
    1. Brudno, M., Chapman, M., Gottgens, B., Batzoglou, S., and Morgenstern, B. 2003a. Fast and sensitive multiple alignment of large genomic sequences. BMC Bioinformatics 4: 66. - PMC - PubMed
    1. Brudno, M., Do, C.B., Cooper, G.M., Kim, M.F., Davydov, E., Green, E.D., Sidow, A., and Batzoglou, S. NISC Comparative Sequencing Program. 2003b. LAGAN and Multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA. Genome Res. 13: 721-731. - PMC - PubMed

Web site references

    1. http://genome.jgi-psf.org/ciona4/ciona4.home.html; C. intestinalis genome and annotation.
    1. http://ghost.zool.kyoto-u.ac.jp/indexr1.html; C. intestinalis in situ expression database.
    1. http://mendel.stanford.edu/supplementarydata/johnson; Source for Supplemental data.
    1. http://www.broad.mit.edu/annotation/ciona/; C. savignyi genome sequence.
    1. http://www.fruitfly.org/seq_tools/promoter.html; promoter prediction at Flybase.

Publication types

LinkOut - more resources