Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005;6(8):R72.
doi: 10.1186/gb-2005-6-8-r72. Epub 2005 Aug 1.

Genome-wide promoter extraction and analysis in human, mouse, and rat

Affiliations

Genome-wide promoter extraction and analysis in human, mouse, and rat

Zhenyu Xuan et al. Genome Biol. 2005.

Abstract

Large-scale and high-throughput genomics research needs reliable and comprehensive genome-wide promoter annotation resources. We have conducted a systematic investigation on how to improve mammalian promoter prediction by incorporating both transcript and conservation information. This enabled us to build a better multispecies promoter annotation pipeline and hence to create CSHLmpd (Cold Spring Harbor Laboratory Mammalian Promoter Database) for the biomedical research community, which can act as a starting reference system for more refined functional annotations.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of conservation scores in promoter alignments. (a) Pairwise promoter alignments of human-rodent and mouse-rat non-orthologous genes (control set II) with different promoter GC content. (b) Pairwise promoter alignments of most conserved promoter pairs and randomly selected 1 kb sequence pairs (control set I). (c) Alignments of mouse-rat and human-rodent homologous promoter pairs. (d) Three-way promoter alignments of homologous promoter triplets and sequence triplets from control set II.
Figure 2
Figure 2
Flowchart of the pipeline to construct the promoter database. Ovals indicate data and rectangles the method. The ovals shaded gray represent the data stored in CSHLmpd.
Figure 3
Figure 3
Sensitivity and specificity of promoter prediction for CpG-island related and non-CpG-island related promoters in different gene sets. (a) 5,893 human genes with homologous rodent promoters. (b) All 8,949 human genes in the test set. The definition of different methods is described in the text and in Materials and methods.
Figure 4
Figure 4
Screen shots of the CSHLmpd user interface. (a) Gbrowse for genome-wide gene and promoter display. (b) Homologous promoter search and analysis.

References

    1. Cavin PR, Junier T, Bucher P. The Eukaryotic Promoter Database EPD. Nucleic Acids Res. 1998;26:353–357. doi: 10.1093/nar/26.1.353. - DOI - PMC - PubMed
    1. Suzuki Y, Yamashita R, Nakai K, Sugano S. DBTSS: DataBase of human Transcriptional Start Sites and full-length cDNAs. Nucleic Acids Res. 2002;30:328–331. doi: 10.1093/nar/30.1.328. - DOI - PMC - PubMed
    1. Bajic VB, Tan SL, Suzuki Y, Sugano S. Promoter prediction analysis on the whole human genome. Nat Biotechnol. 2004;22:1467–1473. doi: 10.1038/nbt1032. - DOI - PubMed
    1. Scherf M, Klingenhoff A, Frech K, Quandt K, Schneider R, Grote K, Frisch M, Gailus-Durner V, Seidel A, Brack-Werner R, Werner T. First pass annotation of promoters on human chromosome 22. Genome Res. 2001;11:333–340. doi: 10.1101/gr.154601. - DOI - PMC - PubMed
    1. Diehn M, Sherlock G, Binkley G, Jin H, Matese JC, Hernandez-Boussard T, Rees CA, Cherry JM, Botstein D, Brown PO, Alizadeh AA. SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data. Nucleic Acids Res. 2003;31:219–223. doi: 10.1093/nar/gkg014. - DOI - PMC - PubMed

Publication types

LinkOut - more resources