Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2001 Mar;11(3):333-40.
doi: 10.1101/gr.154601.

First pass annotation of promoters on human chromosome 22

Affiliations
Comparative Study

First pass annotation of promoters on human chromosome 22

M Scherf et al. Genome Res. 2001 Mar.

Abstract

The publication of the first almost complete sequence of a human chromosome (chromosome 22) is a major milestone in human genomics. Together with the sequence, an excellent annotation of genes was published which certainly will serve as an information resource for numerous future projects. We noted that the annotation did not cover regulatory regions; in particular, no promoter annotation has been provided. Here we present an analysis of the complete published chromosome 22 sequence for promoters. A recent breakthrough in specific in silico prediction of promoter regions enabled us to attempt large-scale prediction of promoter regions on chromosome 22. Scanning of sequence databases revealed only 20 experimentally verified promoters, of which 10 were correctly predicted by our approach. Nearly 40% of our 465 predicted promoter regions are supported by the currently available gene annotation. Promoter finding also provides a biologically meaningful method for "chromosomal scaffolding", by which long genomic sequences can be divided into segments starting with a gene. As one example, the combination of promoter region prediction with exon/intron structure predictions greatly enhances the specificity of de novo gene finding. The present study demonstrates that it is possible to identify promoters in silico on the chromosomal level with sufficient reliability for experimental planning and indicates that a wealth of information about regulatory regions can be extracted from current large-scale (megabase) sequencing projects. Results are available on-line at http://genomatix.gsf.de/chr22/.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Correlation analysis of PromoterInspector promoter regions with annotated gene starts on chromosome 22 (+ strand shown). The y-axis indicates the total number of matches found in relative distance to the annotated gene start. Values on the x-axis with a negative sign mark distances to promoter regions which are located upstream of an annotated gene start, while positive values mark distances to promoter regions which are located downstream from an annotated gene start. The column at distance value 0 marks the number of promoter regions which overlap with an annotated gene start. The range accepted as tolerance is highlighted in black. (A), known and related genes as defined by Dunham et al. (1999). (B), predicted genes as defined by Dunham et al. (1999).
Figure 1
Figure 1
Correlation analysis of PromoterInspector promoter regions with annotated gene starts on chromosome 22 (+ strand shown). The y-axis indicates the total number of matches found in relative distance to the annotated gene start. Values on the x-axis with a negative sign mark distances to promoter regions which are located upstream of an annotated gene start, while positive values mark distances to promoter regions which are located downstream from an annotated gene start. The column at distance value 0 marks the number of promoter regions which overlap with an annotated gene start. The range accepted as tolerance is highlighted in black. (A), known and related genes as defined by Dunham et al. (1999). (B), predicted genes as defined by Dunham et al. (1999).

References

    1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. - PubMed
    1. Burge C, Karlin S. Prediction of complete gene structures in human genomic DNA. J Mol Biol. 1997;268:78–94. - PubMed
    1. Cross CH, Bird AP. CpG islands and genes. Curr Opin Genet Dev. 1995;5:309–314. - PubMed
    1. Dunham I, Shimizu N, Roe BA, Chissoe S, Hunt A, Collins JE, Bruskiewich R, Beare DM, Clamp M, Smink LJ, et al. The DNA sequence of human chromosome 22. Nature. 1999;402:489–495. - PubMed
    1. Fickett JW, Hatzigeorgiou AC. Eukaryotic promoter recognition. Genome Res. 1997;7:861–878. - PubMed

Publication types

LinkOut - more resources