Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Dec 18:10:432.
doi: 10.1186/1471-2105-10-432.

Algorithms for locating extremely conserved elements in multiple sequence alignments

Affiliations

Algorithms for locating extremely conserved elements in multiple sequence alignments

Huei-Hun E Tseng et al. BMC Bioinformatics. .

Abstract

Background: In 2004, Bejerano et al. announced the startling discovery of hundreds of "ultraconserved elements", long genomic sequences perfectly conserved across human, mouse, and rat. Their announcement stimulated a flurry of subsequent research.

Results: We generalize the notion of ultraconserved element in a natural way from extraordinary human-rodent conservation to extraordinary conservation over an arbitrary set of species. We call these "Extremely Conserved Elements". There is a linear time algorithm to find all such Extremely Conserved Elements in any multiple sequence alignment, provided that the conservation is required to be across all the aligned species. For the general case of conservation across an arbitrary subset of the aligned species, we show that the question of whether there exists an Extremely Conserved Element is NP-complete. We illustrate the linear time algorithm by cataloguing all 177 Extremely Conserved Elements in the currently available 44-vertebrate whole-genome alignment, and point out some of the characteristics of these elements.

Conclusions: The NP-completeness in the case of conservation across an arbitrary subset of the aligned species implies that it is unlikely an efficient algorithm exists for this general case. Despite this fact, for the interesting case of conservation across all or most of the aligned species, our algorithm is efficient enough to be practical. The 177 Extremely Conserved Elements that we catalog demonstrate many of the characteristics of the original ultraconserved elements of Bejerano et al.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of 177 EC(40, 100, 0.8) elements by human chromosome and location with respect to human genes. See text for the explanation of location labels.

Similar articles

Cited by

References

    1. Bejerano G, Pheasant M, Makunin O, Stephen S, Kent WJ, Mattick JS, Haussler D. Ultraconserved elements in the human genome. Science. 2004;304:1321–1325. doi: 10.1126/science.1098119. - DOI - PubMed
    1. Derti A, Roth FP, Church GM, Wu C-t. Mammalian ultraconserved elements are strongly depleted among segmental duplications and copy number variants. Nature Genetics. 2006;38:1216–1220. doi: 10.1038/ng1888. - DOI - PubMed
    1. Sakuraba Y, Kimura T, Masuya H, Noguchi H, Sezutsu H, Takahasi KR, Toyoda A, Fukumura R, Murata T, Sakaki Y, Yamamura M, Wakana S, Noda T, Shiroishi T, Gondo Y. Identification and characterization of new long conserved noncoding sequences in vertebrates. Mammalian Genome. 2008;19:703–712. doi: 10.1007/s00335-008-9152-7. - DOI - PubMed
    1. Visel A, Prabhakar S, Akiyama JA, Shoukry M, Lewis KD, Holt A, Plajzer-Frick I, Afzal V, Rubin EM, Pennacchio LA. Ultraconservation identifies a small subset of extremely constrained developmental enhancers. Nature Genetics. 2008;40:158–160. doi: 10.1038/ng.2007.55. - DOI - PMC - PubMed
    1. Prabhakar S, Poulin F, Shoukry M, Afzal V, Rubin EM, Couronne O, Pennacchio LA. Close sequence comparisons are sufficient to identify human cis-regulatory elements. Genome Research. 2006;16:855–863. doi: 10.1101/gr.4717506. - DOI - PMC - PubMed

LinkOut - more resources