Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Feb;42(4):2415-32.
doi: 10.1093/nar/gkt1201. Epub 2013 Dec 2.

The complex methylome of the human gastric pathogen Helicobacter pylori

Affiliations

The complex methylome of the human gastric pathogen Helicobacter pylori

Juliane Krebes et al. Nucleic Acids Res. 2014 Feb.

Abstract

The genome of Helicobacter pylori is remarkable for its large number of restriction-modification (R-M) systems, and strain-specific diversity in R-M systems has been suggested to limit natural transformation, the major driving force of genetic diversification in H. pylori. We have determined the comprehensive methylomes of two H. pylori strains at single base resolution, using Single Molecule Real-Time (SMRT®) sequencing. For strains 26695 and J99-R3, 17 and 22 methylated sequence motifs were identified, respectively. For most motifs, almost all sites occurring in the genome were detected as methylated. Twelve novel methylation patterns corresponding to nine recognition sequences were detected (26695, 3; J99-R3, 6). Functional inactivation, correction of frameshifts as well as cloning and expression of candidate methyltransferases (MTases) permitted not only the functional characterization of multiple, yet undescribed, MTases, but also revealed novel features of both Type I and Type II R-M systems, including frameshift-mediated changes of sequence specificity and the interaction of one MTase with two alternative specificity subunits resulting in different methylation patterns. The methylomes of these well-characterized H. pylori strains will provide a valuable resource for future studies investigating the role of H. pylori R-M systems in limiting transformation as well as in gene regulation and host interaction.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Circos plots displaying the distribution of methylated bases in the genomes of H. pylori 26695 (A) and J99-R3 (B). The outermost and innermost tracks represent the interpulse duration ratios. The colored tracks in between represent the location of methylation of the different motifs. Each motif is displayed by one track except for the asymmetric Type I recognition sites for which both motifs were combined. For m6A and m4C methylation, the cutoff for calling a site modified was Qmod >100 (kinetic score); for the TET-treated m5C methylation a kinetic score threshold >50 was used. The position and type of methylation for each motif is depicted in the legend. Novel motifs are highlighted in red. The genome tick marks display the genomic positions (kbp). The locations of the cag pathogenicity island (cagPAI) and PZ are indicated. Please note that the PZ of strain 26695 is split into two regions (21).
Figure 2.
Figure 2.
Graphical representation of the silenced JHP0414/JHP0415 Type I MTase of J99-R3 (A), the putative phase-variable Type III MTases (B), BcgI-like R-M systems (C) and Type ISP MTases (D). (A) The gene sequences of HsdM (JHP0415) and HsdS (JHP0414) are displayed as gray bars and the protein sequences as yellow bars. SMRT sequencing of the J99-R3 genome did not reveal methylation by the Type I MTase complex (dashed lines), while expression of the enzyme complex on a plasmid in E. coli 2796 and subsequent SMRT analyses demonstrate methylation of the recognition site AAAN6TGG. (B–D) The gene sequences (gray arrows) contain one or two putative or authentic frameshifts that would prevent translation of full-length proteins (colored bars). Each frameshift was repaired through site-directed mutagenesis (addition and deletion of an indicated number of nucleotides, +15 Nt—TAAGGTTAATATATG) and correction was verified by targeted Sanger sequencing. The activity of the MTase proteins was pretested with a methylation activity assay. If an MTase showed activity in the assay, the gDNA of the corresponding E. coli host strain ER2796 was subjected to SMRT sequencing and analyzed for methylation. ‘mut’ marks alleles with frameshift corrections, ‘trunc’ labels not fully translated alleles, ‘n.d.’ points out that no SMRT sequencing was performed and dashed lines indicate that no methylation could be detected through SMRT sequencing. Filled inverted triangles highlight the position of the frameshift mutations. JHP—ORFs of J99, HP—ORFs of 26695. Homologous systems are displayed in the same color.
Figure 3.
Figure 3.
Functional characterization of JHP0785/JHP0786 and JHP0726 through insertion mutagenesis (A) and expression of the R-M genes in E. coli 2796 (B) confirmed that the Type I MTase JHP0786 interacts with the two S subunits JHP0785 or JHP0726 to achieve methylation of two different recognition sites. The methylation patterns of the J99-R3 parental and mutant genomes and the E. coli ER2796 host genomes were analyzed by SMRT sequencing. Motifs displayed in gray are not modified. Methylation of a recognition site is indicated by coloring and boldface type. The methylated nucleotide within a site is highlighted by the addition of a methyl group (CH3) to the sequence. ‘M’ is the abbreviation for methyltransferase, which acts as a homodimer. The two S subunits are composed of either one or two TRDs and interaction of TRDs is achieved by helical connector regions (H) adjacent to each TRD. (A) Expression of the three R-M components in J99-R3 and functional inactivation of either component (labeled by a black cross) resulted in different methylation patterns of the two distinct recognition sites. (B) Analysis of enzyme activity through plasmid-based expression of either JHP0786/JHP0785 (pRRS) alone or together with JHP0726 (pACYC184) in E. coli.
Figure 4.
Figure 4.
Graphical representation of the unique ability of the Type I MTase complex JHP1422/JHP1423 (HsdS/HsdM) to recognize and methylate two distinct recognition sites. (A) Schematic representation of the domain structure of the JHP1422 specificity subunit (HsdS) deduced from the J99 genome sequence (21). The proposed protein harbors three TRDs of which TRD1 is unique and TRD2 and 3 are duplicated as indicated by the coloring. Helical connector regions (H), known as conserved regions (CR), flank each TRD. (B) Inactivation of JHP1423 (HsdM) by insertion mutagenesis and subsequent SMRT analysis resulted in a lack of methylation of two different recognition sites as indicated by the gray color of the two sites. (C) Joint expression of JHP1423 (M) and several allele variants of JHP1422 led to different methylation patterns in the E. coli host genome. Cloning of JHP1422/JHP1423 yielded two naturally occurring allelic variants of JHP1422 (left panel: allele variant 1 and 2, JHP1423 sequence not shown) that differ in composition and protein length. In contrast to the proposed sequence, the H2 region following TRD2 is replaced by the H1 for allele variant 1. Allele variant 2 shows the same exchange but, in addition, has a second TRD2 flanked by the H2 region. Both variants have a different impact on methylation of the two recognition sites (right panel). Allele variants 3–6 were obtained by mutagenesis to reveal the specificity of the TRDs. Coloring and boldface type highlight methylation of the recognition sites, and the methyl group (CH3) indicates the modified nucleotide. Recognition sequences displayed in gray are not modified. The methylation patterns were analyzed by SMRT sequencing of the J99-R3jhp1423 mutant genome (B) and E. coli ER2796 genomes (C).
Figure 5.
Figure 5.
Graphical representation of the specificity switching phenomenon of the two homologous Type IIG MTases JHP1272 and HP1353-HP1354. (A) The gene sequences (gray arrows) contain two putative or authentic frameshifts that would prevent translation of full-length proteins (blue bars). Each frameshift was repaired through site-directed mutagenesis [addition or deletion of C nucleotide(s) is indicated] and correction was verified by targeted Sanger sequencing. Plasmid DNA (Hpy99XIV) or gDNA (remainder) of the E. coli host strains ER2796 was subjected to SMRT sequencing and analyzed for methylation. ‘mut’ marks alleles with frameshift corrections, ‘n.d.’ points out that no SMRT sequencing was performed and dashed lines indicate that no methylation could be detected. Filled inverted triangles highlight the position of the frameshift mutations. JHP—ORF of J99, HP—ORF of 26695. (B–D) Portions of the amino-acid multiple sequence alignments including secondary structure predictions (raw output of PROMALS) of the MTases JHP1272 and HP1353-HP1354. The numbers preceding each sequence display the position within the amino acid sequence. An asterisk at the end of a sequence indicates a stop codon. CTRD—C-terminal additional recognition domain, TRD—target recognition domain. (B) Secondary structure-guided alignment of JHP1272 and HP1353 CTRD and TRD with two MmeI family enzymes. The alignment shows the similarity of both MTases to the secondary structure of MmeI and MaqI in the MmeI region known to specify recognition of the −2 and −1 bases (relative to the methylated base within a recognition site). For these positions a changing specificity in the two active variants of JHP1272 and HP1353-HP1354 was observed. The red color text indicates predicted alpha helices and blue indicates predicted beta strands. The arrows indicate the four beta strands forming the structural motif in MmeI family enzymes that present −2 and −1 contact amino acids. The underlined PPPP region represents the first homopolymeric repeat of C nucleotides. (C) Putative recognition region for −1 and −2 base positions within the regular TRD and CTRD of the two MTases. Amino acid identities are depicted in bold black, similarities in bold green. (D) Alignment of HP1353-HP1354 and JHP1272 CTRD. Both regions show 87% amino acid identity (82 of 94), which most likely accounts for the identical specificity switch for −1 and −2 base positions observed for both protein variants containing the CTRD.

Similar articles

Cited by

References

    1. Suerbaum S, Michetti P. Helicobacter pylori infection. N. Engl. J. Med. 2002;347:1175–1186. - PubMed
    1. Falush D, Wirth T, Linz B, Pritchard JK, Stephens M, Kidd M, Blaser MJ, Graham DY, Vacher S, Perez-Perez GI, et al. Traces of human migrations in Helicobacter pylori populations. Science. 2003;299:1582–1585. - PubMed
    1. Suerbaum S, Josenhans C. Helicobacter pylori evolution and phenotypic diversification in a changing host. Nat. Rev. Microbiol. 2007;5:441–452. - PubMed
    1. Linz B, Balloux F, Moodley Y, Manica A, Liu H, Roumagnac P, Falush D, Stamer C, Prugnolle F, van der Merwe SW, et al. An African origin for the intimate association between humans and Helicobacter pylori. Nature. 2007;445:915–918. - PMC - PubMed
    1. Bjorkholm B, Sjolund M, Falk PG, Berg OG, Engstrand L, Andersson DI. Mutation frequency and biological cost of antibiotic resistance in Helicobacter pylori. Proc. Natl Acad. Sci. USA. 2001;98:14607–14612. - PMC - PubMed

Publication types

Substances