Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Nov;4(11):e000228.
doi: 10.1099/mgen.0.000228. Epub 2018 Oct 23.

PhasomeIt: an 'omics' approach to cataloguing the potential breadth of phase variation in the genus Campylobacter

Affiliations

PhasomeIt: an 'omics' approach to cataloguing the potential breadth of phase variation in the genus Campylobacter

Jack Aidley et al. Microb Genom. 2018 Nov.

Abstract

Hypermutable simple sequence repeats (SSRs) are drivers of phase variation (PV) whose stochastic, high-frequency, reversible switches in gene expression are a common feature of several pathogenic bacterial species, including the human pathogen Campylobacter jejuni. Here we examine the distribution and conservation of known and putative SSR-driven phase variable genes - the phasome - in the genus Campylobacter. PhasomeIt, a new program, was specifically designed for rapid identification of SSR-mediated PV. This program detects the location, type and repeat number of every SSR. Each SSR is linked to a specific gene and its putative expression state. Other outputs include conservation of SSR-driven phase-variable genes and the 'core phasome' - the minimal set of PV genes in a phylogenetic grouping. Analysis of 77 complete Campylobacter genome sequences detected a 'core phasome' of conserved PV genes in each species and a large number of rare PV genes with few, or no, homologues in other genome sequences. Analysis of a set of partial genome sequences, with food-chain-associated metadata, detected evidence of a weak link between phasome and source host for disease-causing isolates of sequence type (ST)-828 but not the ST-21 or ST-45 complexes. Investigation of the phasomes in the genus Campylobacter provided evidence of overlapping but distinctive mechanisms of PV-mediated adaptation to specific niches. This suggests that the phasome could be involved in host adaptation and spread of campylobacters. Finally, this tool is malleable and will have utility for studying the distribution and genic effects of other repetitive elements in diverse bacterial species.

Keywords: Campylobacter; PhasomeIt; phase variation; simple sequence repeat.

PubMed Disclaimer

Conflict of interest statement

The authors declare that there are no conflicts of interest.

Figures

Fig. 1.
Fig. 1.
Overview of the processes and outputs of PhasomeIt. This flow diagram depicts the four major processes (steps 1–4) applied by PhasomeIt to detection of PV genes in genome sequences and the major outputs of this program (step 5). Step 1, input genome sequences are subject to a search for SSRs using Bossref that allows for user-defined inputs of repeat numbers for each repeat unit type and outputs the genome position and type of SSR. Bossref converts the DNA sequence into a binary code (2 bits per base), shifts the sequence to the left by twice the repeat unit length, performs an eXclusive OR (xor) with the original sequence, and identifies the repeats as a run of 0’s equal to twice the repeat length minus one. Step 2, PhasomeIt utilizes these SSR data to link the SSRs to genes, now defined as a phase variation (PV) gene, and to determine if the SSR mediates translational or transcriptional switching of gene expression. For transcriptional PV wherein an SSR is close to two genes, a bias (indicated by the magnifying-glass) is introduced to link the SSR to the 5′ end of a gene and hence in the putative promoter region. Step 3, PhasomeIt performs homology searches to identify known or putative functions of this gene followed by Step 4 wherein homology searches are performed against all genes in all submitted genome sequences in order to identify homology groups of related genes. Step 5, a range of output datasets (indicated in boxes) and visual outputs are generated providing both summary data and enabling interactive exploration of specific genes and specific amino acid sequences.
Fig. 2.
Fig. 2.
Example gene group graphic with C. coli and C. fetus. This HTML output of PhasomeIt illustrates the homology of PV genes between isolates with each numbered row representing a different homology group. Coloured blocks show the presence or absence of PV genes: green is a tract located within the ORF, orange is a tract located nearby and grey is a non-PV homologue. Coloured bars at the start indicate the repeat unit on PV tracts. Faint beige bars are present to help readability and indicate absence of any homologue in that isolate. Numbered points shown in red indicate the following: 1, PV gene conserved in all C. coli strains but absent in C. fetus; 2, PV gene conserved in all C. coli and C. fetus strains; 3, PV gene present in only one isolate; 4, PV homology group present in all C. coli and C. fetus strains but only phase-variable in one isolate; 5, block of PV genes present in the majority of C. fetus isolates; 6, intragenic SSR present in only one isolate.
Fig. 3.
Fig. 3.
Number of PV genes per genome sequence in each species. This graph shows the number of putatively PV genes identified for each species. For species with five or more isolates, box and whiskers are shown, while for those with fewer each point is shown. These indicate the median and interquartile range (IQR) from 25 to 75 %. The whiskers stretch to the further point within 1.5 IQRs from the ends of the box, and outlying points beyond this range are individually marked. The labelled point is for C. jejuni subsp. doylei strain 269.97.
Fig. 4.
Fig. 4.
Neighbour-joining tree of phasome similarity for Campylobacter species. The tree depicts the phasome similarities created by applying a Manhattan distance metric to binary lists of presence and absence of homology groups. The tree is coloured by species. Numbers indicate strains, a full list is given in Table S4, and selected strains are: 1, C. jejuni NCTC 11168; 2, C. jejuni 81-176; 12, C. jejuni subsp. doylei 269.97; 25, C. jejuni NCTC 11351.
Fig. 5.
Fig. 5.
Tree based on homology groups shows clear separation by ST-complex. A neighbour-joining tree was derived using a Manhattan distance between isolates based on the presence or absence of homology groups. The tree is coloured to indicate the ST-complex for each isolate. A group of genomes with poor coverage across the highly conserved PV genes is indicated in red. Numbers are isolate numbers and carry no biological significance (see Table S5).
Fig. 6.
Fig. 6.
Host association in the C. jejuni STs. A neighbour-joining tree (derived as for Fig. 5) is shown for C. jejuni isolates from the ST-21 (a), ST-45 (b) and ST-823 complexes (c). Tree branches are coloured according to the source for each isolates. ST-828 isolates from cattle show significant grouping within the tree (P<0.05, tree-based scan statistic with conditional Poisson model). All other differences were non-significant.

References

    1. van der Woude MW, Bäumler AJ. Phase and antigenic variation in bacteria. Clin Microbiol Rev. 2004;17:581–611. doi: 10.1128/CMR.17.3.581-611.2004. - DOI - PMC - PubMed
    1. Bayliss CD, Palmer ME. Evolution of simple sequence repeat-mediated phase variation in bacterial genomes. Ann N Y Acad Sci. 2012;1267:39–44. doi: 10.1111/j.1749-6632.2012.06584.x. - DOI - PubMed
    1. Cooper KK, Cooper MA, Zuccolo A, Joens LA. Re-sequencing of a virulent strain of Campylobacter jejuni NCTC11168 reveals potential virulence factors. Res Microbiol. 2013;164:6–11. doi: 10.1016/j.resmic.2012.10.002. - DOI - PubMed
    1. Oldfield NJ, Matar S, Bidmos FA, Alamro M, Neal KR, et al. Prevalence and phase variable expression status of two autotransporters, NalP and MspA, in carriage and disease isolates of Neisseria meningitidis. PLoS One. 2013;8:e69746. doi: 10.1371/journal.pone.0069746. - DOI - PMC - PubMed
    1. Aidley J. Genomics and Population Dynamics of Phase Variable Genes In campylobacter. University of Leicester; 2017.

Publication types

LinkOut - more resources