ICDS database: interrupted CoDing sequences in prokaryotic genomes
- PMID: 16381882
- PMCID: PMC1347423
- DOI: 10.1093/nar/gkj060
ICDS database: interrupted CoDing sequences in prokaryotic genomes
Abstract
Unrecognized frameshifts, in-frame stop codons and sequencing errors lead to Interrupted CoDing Sequence (ICDS) that can seriously affect all subsequent steps of functional characterization, from in silico analysis to high-throughput proteomic projects. Here, we describe the Interrupted CoDing Sequence database containing ICDS detected by a similarity-based approach in 80 complete prokaryotic genomes. ICDS can be retrieved by species browsing or similarity searches via a web interface (http://www-bio3d-igbmc.u-strasbg.fr/ICDS/). The definition of each interrupted gene is provided as well as the ICDS genomic localization with the surrounding sequence. Furthermore, to facilitate the experimental characterization of ICDS, we propose optimized primers for re-sequencing purposes. The database will be regularly updated with additional data from ongoing sequenced genomes. Our strategy has been validated by three independent tests: (i) ICDS prediction on a benchmark of artificially created frameshifts, (ii) comparison of predicted ICDS and results obtained from the comparison of the two genomic sequences of Bacillus licheniformis strain ATCC 14580 and (iii) re-sequencing of 25 predicted ICDS of the recently sequenced genome of Mycobacterium smegmatis. This allows us to estimate the specificity and sensitivity (95 and 82%, respectively) of our program and the efficiency of primer determination.
Figures


Similar articles
-
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].Yi Chuan Xue Bao. 2004 May;31(5):431-43. Yi Chuan Xue Bao. 2004. PMID: 15478601 Chinese.
-
Interrupted coding sequences in Mycobacterium smegmatis: authentic mutations or sequencing errors?Genome Biol. 2007;8(2):R20. doi: 10.1186/gb-2007-8-2-r20. Genome Biol. 2007. PMID: 17295914 Free PMC article.
-
MICheck: a web tool for fast checking of syntactic annotations of bacterial genomes.Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W471-9. doi: 10.1093/nar/gki498. Nucleic Acids Res. 2005. PMID: 15980515 Free PMC article.
-
GeneTack database: genes with frameshifts in prokaryotic genomes and eukaryotic mRNA sequences.Nucleic Acids Res. 2013 Jan;41(Database issue):D152-6. doi: 10.1093/nar/gks1062. Epub 2012 Nov 17. Nucleic Acids Res. 2013. PMID: 23161689 Free PMC article.
-
MBGD: a platform for microbial comparative genomics based on the automated construction of orthologous groups.Nucleic Acids Res. 2007 Jan;35(Database issue):D343-6. doi: 10.1093/nar/gkl978. Epub 2006 Nov 29. Nucleic Acids Res. 2007. PMID: 17135196 Free PMC article.
Cited by
-
Comparative geno-plasticity analysis of Mycoplasma bovis HB0801 (Chinese isolate).PLoS One. 2012;7(5):e38239. doi: 10.1371/journal.pone.0038239. Epub 2012 May 31. PLoS One. 2012. PMID: 22693604 Free PMC article.
-
High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using Mycobacterium tuberculosis as an example.BMC Genomics. 2008 Jul 2;9:316. doi: 10.1186/1471-2164-9-316. BMC Genomics. 2008. PMID: 18597682 Free PMC article.
-
Translational recoding in archaea.Extremophiles. 2012 Nov;16(6):793-803. doi: 10.1007/s00792-012-0482-8. Epub 2012 Sep 27. Extremophiles. 2012. PMID: 23015064 Review.
-
Design and evaluation of Actichip, a thematic microarray for the study of the actin cytoskeleton.BMC Genomics. 2007 Aug 29;8:294. doi: 10.1186/1471-2164-8-294. BMC Genomics. 2007. PMID: 17727702 Free PMC article.
-
Detecting the molecular scars of evolution in the Mycobacterium tuberculosis complex by analyzing interrupted coding sequences.BMC Evol Biol. 2008 Mar 6;8:78. doi: 10.1186/1471-2148-8-78. BMC Evol Biol. 2008. PMID: 18325090 Free PMC article.
References
-
- Bianchetti L., Thompson J.D., Lecompte O., Plewniak F., Poch O. vALId: validation of protein sequence quality based on multiple alignment data. J. Bioinform. Comput. Biol. 2005;3:1–19. - PubMed
-
- Farabaugh P.J. Programmed translational frameshifting. Annu. Rev. Genet. 1996;30:507–528. - PubMed
-
- Baranov P.V., Gesteland R.F., Atkins J.F. Recoding: translational bifurcations in gene expression. Gene. 2002;286:187–201. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases