Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007:395:237-54.

Mulan: multiple-sequence alignment to predict functional elements in genomic sequences

Affiliations

Mulan: multiple-sequence alignment to predict functional elements in genomic sequences

Gabriela G Loots et al. Methods Mol Biol. 2007.

Abstract

Multiple sequence alignment analysis is a powerful approach for translating the evolutionary selective power into phylogenetic relationships to localize functional coding and noncoding genomic elements. The tool Mulan (http://mulan.dcode.org/) has been designed to effectively perform multiple comparisons of genomic sequences necessary to facilitate bioinformatic-driven biological discoveries. The Mulan network server is capable of comparing both closely and distantly related genomes to identify conserved elements over a broad range of evolutionary time. Several novel algorithms are brought together in this tool: the tba multisequence aligner program used to rapidly identify local sequence conservation and the multiTF program to detect evolutionarily conserved transcription factor binding sites in alignments. Mulan is integrated with the ERC Browser, the UCSC Genome Browser for quick uploads of available sequences and supports two-way communication with the GALA database to overlay GALA functional genome annotation with sequence conservation profiles. Local multiple alignments computed by Mulan ensure reliable representation of short- and large-scale genomic rearrangements in distant organisms. Recently, we have also introduced the ability to handle duplications to permit the reliable reconstruction of evolutionary events that underlie the genome sequence data. Here, we describe the main features of the Mulan tool that include the interactive modification of critical conservation parameters, visualization options, and dynamic access to sequence data from visual graphs for flexible and easy-to-perform analysis of differentially evolving genomic regions.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Accessing the Mulan tool from the homepage at http://mulan.dcode.org (A) or from the ECR Browser “Synteny/Alignments” link (B).
Fig. 2
Fig. 2
Each sequence can be pasted in, in FASTA format, uploaded as a FASTA file, or entered as an accession number along with the available annotation (A). Alternatively, sequences can be fetched from the UCSC Genome Browser individually using the “Upload” function (A), or in groups (Batch Upload System) Browser (B). Once sequences have been uploaded, the program acknowledges the receipt (C).
Fig. 3
Fig. 3
Mulan defines a guiding phylogenetic tree before proceeding with the detailed sequence alignment. The user has the option to submit modifications to this tree.
Fig. 4
Fig. 4
A completed alignment request results in a “summary page” that provides links to the interactive visualization platform, pairwise dynamic plots, dot plots, annotation files, sequence files, and a portal to the transcription factor binding site analysis software, MultiTF.
Fig. 5
Fig. 5
Mulan interactive alignment customization options (A) and graphical display of alignments (B).
Fig. 6
Fig. 6
Contig ordering based on homology to the reference sequence. The top layer of shaded lines indicates the location of contigs from a second sequence aligned to the base sequence where right-turned triangles specify forward strand alignments, and left-turned triangles correspond to reverse strand alignments.
Fig. 7
Fig. 7
Mulan alignment analysis options: color density by interspecies conservation (A) and summary of conservation display (B).
Fig. 8
Fig. 8
MultiTF portal available from Mulan “summary page.” First menu allows users to define the types of transcription factor binding site matrices to be used in the analysis, along with similarity thresholds (A). Next, the user selects either all or a subset of available transcription factors from the TRANSFAC library (B). Results are provided on an interactive summary page.
Fig. 9
Fig. 9
Transcription factor binding sites can be juxtaposed on the Mulan multiple alignment graphics and several clustering and visualization options are provided for customized analysis (A). Similarly, the binding sites can be visualized within the textual alignment (B).

Similar articles

Cited by

References

    1. Pennacchio LA, Olivier M, Hubacek JA, et al. An apolipoprotein influencing triglycerides in humans and mice revealed by comparative sequencing. Science. 2001;294:169–173. - PubMed
    1. Gilligan P, Brenner S, Venkatesh B. Fugu and human sequence comparison identifies novel human genes and conserved non-coding sequences. Gene. 2002;294:35–44. - PubMed
    1. Elnitski L, Li J, Noguchi CT, Miller W, Hardison R. A negative cis-element regulates the level of enhancement by hypersensitive site 2 of the beta-globin locus control region. J. Biol. Chem. 2001;276:6289–6298. - PubMed
    1. Loots GG, Locksley RM, Blankespoor CM, et al. Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. Science. 2000;288:136–140. - PubMed
    1. Mayor C, Brudno M, Schwartz JR, et al. VISTA: visualizing global DNA sequence alignments of arbitrary length. Bioinformatics. 2000;16:1046–1047. - PubMed

LinkOut - more resources