Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011:1:53.
doi: 10.1038/srep00053. Epub 2011 Aug 3.

Putative essential and core-essential genes in Mycoplasma genomes

Affiliations

Putative essential and core-essential genes in Mycoplasma genomes

Yan Lin et al. Sci Rep. 2011.

Abstract

Mycoplasma, which was used to create the first "synthetic life", has been an important species in the emerging field, synthetic biology. However, essential genes, an important concept of synthetic biology, for both M. mycoides and M. capricolum, as well as 14 other Mycoplasma with available genomes, are still unknown. We have developed a gene essentiality prediction algorithm that incorporates information of biased gene strand distribution, homologous search and codon adaptation index. The algorithm, which achieved an accuracy of 80.8% and 78.9% in self-consistence and cross-validation tests, respectively, predicted 5880 essential genes in the 16 Mycoplasma genomes. The intersection set of essential genes in available Mycoplasma genomes consists of 153 core essential genes. The predicted essential genes (available from pDEG, tubic.tju.edu.cn/pdeg) and the proposed algorithm can be helpful for studying minimal Mycoplasma genomes as well as essential genes in other genomes.

PubMed Disclaimer

Figures

Figure 1
Figure 1. The flow chart of the proposed algorithm in training and prediction phases.
Figure 2
Figure 2. Accuracy indices and the ROC curve for the current algorithm.
(A) Sensitivity, specificity and positive prediction rate in relation to the parameter s defined in eq. (8). The value of s (ss0) was chosen such that the sensitivity Sn is roughly equal to the specificity Sp. (B) The ROC curve (blue) and AUC (Area Under Curve). The red line denotes an extrapolation of the ROC curve to the point where 1 − Sp = 1. The AUC value is found to be 0.812.
Figure 3
Figure 3. The phylogenetic tree of the 18 Mycoplasma genomes based on the 16S rRNA.
The intersection set of (A) genes and (B) essential genes in the 18 Mycoplasma genomes. The numbers on the left indicate gene numbers in intersection sets between genomes, whereas those on the right denote total gene number in a genome. The intersection set of the 5880 predicted essential genes and those experimentally identified in M. genitalium and M. pulmonis genomes consists of 153 core essential genes for the Mycoplasma family.
Figure 4
Figure 4. Functional classification of genes in the M. genitalium genome based on COG.
(A) COG classification of core-essential, non-core-essential and non-essential genes in M. genitalium. (B) Distribution of COG classification of the 153 core-essential genes.

Similar articles

Cited by

References

    1. Gibson D. G. et al.. Creation of a bacterial cell controlled by a chemically synthesized genome. Science 329, 52–6 (2010). - PubMed
    1. Pennisi E. Synthetic genome brings new life to bacterium. Science 328, 958–9 (2010). - PubMed
    1. Itaya M. An estimation of minimal genome size required for life. FEBS Lett 362, 257–60 (1995). - PubMed
    1. Koonin E. V. How many genes can make a cell: the minimal-gene-set concept. Annu Rev Genomics Hum Genet 1, 99–116 (2000). - PMC - PubMed
    1. Editorial. . Unbottling the genes. Nat Biotechnol 27, 1059 (2009). - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources