Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Feb 6:7:41923.
doi: 10.1038/srep41923.

A Noise Trimming and Positional Significance of Transposon Insertion System to Identify Essential Genes in Yersinia pestis

Affiliations

A Noise Trimming and Positional Significance of Transposon Insertion System to Identify Essential Genes in Yersinia pestis

Zheng Rong Yang et al. Sci Rep. .

Abstract

Massively parallel sequencing technology coupled with saturation mutagenesis has provided new and global insights into gene functions and roles. At a simplistic level, the frequency of mutations within genes can indicate the degree of essentiality. However, this approach neglects to take account of the positional significance of mutations - the function of a gene is less likely to be disrupted by a mutation close to the distal ends. Therefore, a systematic bioinformatics approach to improve the reliability of essential gene identification is desirable. We report here a parametric model which introduces a novel mutation feature together with a noise trimming approach to predict the biological significance of Tn5 mutations. We show improved performance of essential gene prediction in the bacterium Yersinia pestis, the causative agent of plague. This method would have broad applicability to other organisms and to the identification of genes which are essential for competitiveness or survival under a broad range of stresses.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Figure 1
Figure 1. Noise trimming of the input1 dataset.
The horizontal axis represents the log of the number of transposon insertions per gene. The vertical axis stands for the frequency of the log of transposon insertion number per gene. The vertical dotted line indicates the threshold corresponding to a critical p value 0.05. Genes whose insertion counts were below this threshold were treated as Type II essential genes.
Figure 2
Figure 2. Prediction of essential genes for input1 using DEM.
The curve shows the relationship between log MF values and the corresponding false discovery rates (q values). The triangle indicates the boundary of separation between essential genes and non-essential genes. Bars in blue represent the density of the log MF values. The horizontal axis stands for log value of MF and the vertical axis stands for the frequency and q values.
Figure 3
Figure 3. A Venn diagram of all essential genes predicted by our system for the three samples.
Essential genes predicted by our system for the three samples. They include all three types of essential genes.
Figure 4
Figure 4. Locations of 548 essential genes identified in the Y. pestis chromosome.
Moving out from the centre the layers show; MF values; transposon insertion sites per gene for all genes; insertion counts per gene for all genes; transposon insertion counts per base pair genome-wise. Brown bars indicate Type I essential genes, red bars represent Type II essential genes and blue bars represent Type III essential genes.

References

    1. van Opijnen T. & Camilli A. Transposon insertion sequencing: a new tool for systems-level analysis of microorganisms. Nature Rev Microbiol. 11, 435–42 (2013). - PMC - PubMed
    1. Langridge G. et al.. Simultaneous assay of every Salmonella Typhi gene using one million transposon mutants. Genome Res. 19, 2308–16 (2009). - PMC - PubMed
    1. Barquist L. et al.. The TraDIS toolkit: sequencing and analysis for dense transposon mutant libraries. Bioinformatics. 32, 1109–11 (2016). - PMC - PubMed
    1. Akerley B. et al.. Systematic identification of essential genes by in vitro mariner mutagenesis. Proc Natl Acad Sci USA 95, 8927–32 (1998). - PMC - PubMed
    1. Gawronski J. et al.. Tracking insertion mutants within libraries by deep sequencing and a genome-wide screen for Haemophilus genes required in the lung. Proc Natl Acad Sci USA 106, 16422–7 (2009). - PMC - PubMed

Publication types

Substances