Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Jan 1;34(Database issue):D338-43.
doi: 10.1093/nar/gkj060.

ICDS database: interrupted CoDing sequences in prokaryotic genomes

Affiliations

ICDS database: interrupted CoDing sequences in prokaryotic genomes

Emmanuel Perrodou et al. Nucleic Acids Res. .

Abstract

Unrecognized frameshifts, in-frame stop codons and sequencing errors lead to Interrupted CoDing Sequence (ICDS) that can seriously affect all subsequent steps of functional characterization, from in silico analysis to high-throughput proteomic projects. Here, we describe the Interrupted CoDing Sequence database containing ICDS detected by a similarity-based approach in 80 complete prokaryotic genomes. ICDS can be retrieved by species browsing or similarity searches via a web interface (http://www-bio3d-igbmc.u-strasbg.fr/ICDS/). The definition of each interrupted gene is provided as well as the ICDS genomic localization with the surrounding sequence. Furthermore, to facilitate the experimental characterization of ICDS, we propose optimized primers for re-sequencing purposes. The database will be regularly updated with additional data from ongoing sequenced genomes. Our strategy has been validated by three independent tests: (i) ICDS prediction on a benchmark of artificially created frameshifts, (ii) comparison of predicted ICDS and results obtained from the comparison of the two genomic sequences of Bacillus licheniformis strain ATCC 14580 and (iii) re-sequencing of 25 predicted ICDS of the recently sequenced genome of Mycobacterium smegmatis. This allows us to estimate the specificity and sensitivity (95 and 82%, respectively) of our program and the efficiency of primer determination.

PubMed Disclaimer

Figures

Figure 1
Figure 1
General principle of ICDS detection. (a) Pairs of proteins exhibiting at least one common homologue are retained. Such a pair can correspond to ICDS or to paralogous adjacent genes. (b) No significant similarity (E < 10−3) is detected between the components of a couple, those genes are considered as ICDS. (c) If a significant similarity exists, genes are considered as paralogues and discarded from the analysis.
Figure 2
Figure 2
Example of an ICDS corresponding to an authentic event of M.smegmatis. (a) Representation of the genomic region showing the best protein. (b) blastP alignment of the two predicted proteins. (c) Research for homology between the two predicted proteins defining an ICDS. (d) Limits extraction and prediction of primers. The region has been re-sequenced and results are shown. The sequence in black represents the region where the frameshift occurs. The underlined sequences represent the primers.

Similar articles

Cited by

References

    1. Bianchetti L., Thompson J.D., Lecompte O., Plewniak F., Poch O. vALId: validation of protein sequence quality based on multiple alignment data. J. Bioinform. Comput. Biol. 2005;3:1–19. - PubMed
    1. Weinstock G.M. Genomics and bacterial pathogenesis. Emerg. Infect. Dis. 2000;6:496–504. - PMC - PubMed
    1. Farabaugh P.J. Programmed translational frameshifting. Annu. Rev. Genet. 1996;30:507–528. - PubMed
    1. Baranov P.V., Gesteland R.F., Atkins J.F. Recoding: translational bifurcations in gene expression. Gene. 2002;286:187–201. - PubMed
    1. Baranov P.V., Gurvich O.L., Fayet O., Prere M.F., Miller W.A., Gesteland R.F., Atkins J.F., Giddings M.C. RECODE: a database of frameshifting, bypassing and codon redefinition utilized for gene expression. Nucleic Acids Res. 2001;29:264–267. - PMC - PubMed

Publication types

Substances