Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2004 Feb;14(2):313-8.
doi: 10.1101/gr.1759004. Epub 2004 Jan 12.

Analysis of multiple genomic sequence alignments: a web resource, online tools, and lessons learned from analysis of mammalian SCL loci

Affiliations
Comparative Study

Analysis of multiple genomic sequence alignments: a web resource, online tools, and lessons learned from analysis of mammalian SCL loci

Michael A Chapman et al. Genome Res. 2004 Feb.

Abstract

Comparative analysis of genomic sequences is becoming a standard technique for studying gene regulation. However, only a limited number of tools are currently available for the analysis of multiple genomic sequences. An extensive data set for the testing and training of such tools is provided by the SCL gene locus. Here we have expanded the data set to eight vertebrate species by sequencing the dog SCL locus and by annotating the dog and rat SCL loci. To provide a resource for the bioinformatics community, all SCL sequences and functional annotations, comprising a collation of the extensive experimental evidence pertaining to SCL regulation, have been made available via a Web server. A Web interface to new tools specifically designed for the display and analysis of multiple sequence alignments was also implemented. The unique SCL data set and new sequence comparison tools allowed us to perform a rigorous examination of the true benefits of multiple sequence comparisons. We demonstrate that multiple sequence alignments are, overall, superior to pairwise alignments for identification of mammalian regulatory regions. In the search for individual transcription factor binding sites, multiple alignments markedly increase the signal-to-noise ratio compared to pairwise alignments.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A long-range four-way mammalian alignment is capable of detecting all known SCL regulatory regions. A SynPlot representation of a human/dog/mouse/rat alignment of the SCL locus. Numbers on the horizontal axis do not refer to any one of the sequences in the alignment, but instead represent distance in nucleotides from the beginning of the aligned file (i.e., gaps in the alignment are counted). Numbers on the vertical axis represent an alignment score in a 100-bp window moved by 25-bp increments across the entire alignment. The horizontal black lines above the profile represent the human (H), dog (D), mouse (M), and rat (R) sequences and illustrate the position of gaps introduced to permit optimum alignment. Coding and noncoding exons are demonstrated by red and brown boxes, respectively, and repeats are illustrated by the smaller blue boxes. Regions of open chromatin and the +23 kb enhancer are shaded in green over the plot. Peaks of conserved sequence can be seen within each known regulatory region. Three well conserved peaks in regions of unknown function (-10 kb, +26 kb, and +30 kb) are also indicated. Pr1a, promoter 1a; Pr1b, promoter 1b.

References

    1. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403-410. - PubMed
    1. Barton, L.M., Göttgens, B., Gering, M., Gilbert, J.G., Grafham, D., Rogers, J., Bentley, D., Patient, R., and Green, A.R. 2001. Regulation of the stem cell leukemia (SCL) gene: A tale of two fishes. Proc. Natl. Acad. Sci. 98: 6747-6752. - PMC - PubMed
    1. Begley, C.G. and Green, A.R. 1999. The SCL gene: From case report to critical hematopoietic regulator. Blood 93: 2760-2770. - PubMed
    1. Boffelli, D., McAuliffe, J., Ovcharenko, D., Lewis, K.D., Ovcharenko, I., Pachter, L., and Rubin, E.M. 2003. Phylogenetic shadowing of primate sequences to find functional regions of the human genome. Science 299: 1391-1394. - PubMed
    1. Brudno, M., Do, C.B., Cooper, G.M., Kim, M.F., Davydov, E., Green, E.D., Sidow, A., and Batzoglou, S. 2003. LAGAN and Multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA. Genome Res. 13: 721-731. - PMC - PubMed

WEB SITE REFERENCES

    1. http://bio.cse.psu.edu/pipmaker; PipMaker home page.
    1. http://lagan.stanford.edu; LAGAN home page.
    1. http://pipeline.lbl.gov/rat/; Berkeley Genome Pipeline.
    1. http://repeatmasker.genome.washington.edu/cgi-bin/RepeatMasker; RepeatMasker server.
    1. http://www-gsd.lbl.gov/vista/; Vista home page.

Publication types

Associated data