Evaluation of core genome and whole genome multilocus sequence typing schemes for Campylobacter jejuni and Campylobacter coli outbreak detection in the USA
- PMID: 37133905
- PMCID: PMC10272873
- DOI: 10.1099/mgen.0.001012
Evaluation of core genome and whole genome multilocus sequence typing schemes for Campylobacter jejuni and Campylobacter coli outbreak detection in the USA
Abstract
Campylobacter is a leading causing of bacterial foodborne and zoonotic illnesses in the USA. Pulsed-field gene electrophoresis (PFGE) and 7-gene multilocus sequence typing (MLST) have been historically used to differentiate sporadic from outbreak Campylobacter isolates. Whole genome sequencing (WGS) has been shown to provide superior resolution and concordance with epidemiological data when compared with PFGE and 7-gene MLST during outbreak investigations. In this study, we evaluated epidemiological concordance for high-quality SNP (hqSNP), core genome (cg)MLST and whole genome (wg)MLST to cluster or differentiate outbreak-associated and sporadic Campylobacter jejuni and Campylobacter coli isolates. Phylogenetic hqSNP, cgMLST and wgMLST analyses were also compared using Baker's gamma index (BGI) and cophenetic correlation coefficients. Pairwise distances comparing all three analysis methods were compared using linear regression models. Our results showed that 68/73 sporadic C. jejuni and C. coli isolates were differentiated from outbreak-associated isolates using all three methods. There was a high correlation between cgMLST and wgMLST analyses of the isolates; the BGI, cophenetic correlation coefficient, linear regression model R 2 and Pearson correlation coefficients were >0.90. The correlation was sometimes lower comparing hqSNP analysis to the MLST-based methods; the linear regression model R 2 and Pearson correlation coefficients were between 0.60 and 0.86, and the BGI and cophenetic correlation coefficient were between 0.63 and 0.86 for some outbreak isolates. We demonstrated that C. jejuni and C. coli isolates clustered in concordance with epidemiological data using WGS-based analysis methods. Discrepancies between allele and SNP-based approaches may reflect the differences between how genomic variation (SNPs and indels) are captured between the two methods. Since cgMLST examines allele differences in genes that are common in most isolates being compared, it is well suited to surveillance: searching large genomic databases for similar isolates is easily and efficiently done using allelic profiles. On the other hand, use of an hqSNP approach is much more computer intensive and not scalable to large sets of genomes. If further resolution between potential outbreak isolates is needed, wgMLST or hqSNP analysis can be used.
Keywords: Campylobacter; cgMLST; hqSNP; outbreak; wgMLST.
Conflict of interest statement
The authors declare that there are no conflicts of interest.
Figures







Similar articles
-
Evaluation of whole and core genome multilocus sequence typing allele schemes for Salmonella enterica outbreak detection in a national surveillance network, PulseNet USA.Front Microbiol. 2023 Sep 21;14:1254777. doi: 10.3389/fmicb.2023.1254777. eCollection 2023. Front Microbiol. 2023. PMID: 37808298 Free PMC article.
-
Comparison of Molecular Subtyping and Antimicrobial Resistance Detection Methods Used in a Large Multistate Outbreak of Extensively Drug-Resistant Campylobacter jejuni Infections Linked to Pet Store Puppies.J Clin Microbiol. 2020 Sep 22;58(10):e00771-20. doi: 10.1128/JCM.00771-20. Print 2020 Sep 22. J Clin Microbiol. 2020. PMID: 32719029 Free PMC article.
-
Validation of Core and Whole-Genome Multi-Locus Sequence Typing Schemes for Shiga-Toxin-Producing E. coli (STEC) Outbreak Detection in a National Surveillance Network, PulseNet 2.0, USA.Microorganisms. 2025 Jun 4;13(6):1310. doi: 10.3390/microorganisms13061310. Microorganisms. 2025. PMID: 40572198 Free PMC article.
-
Campylobacter sequence typing databases: applications and future prospects.Microbiology (Reading). 2012 Nov;158(Pt 11):2695-2709. doi: 10.1099/mic.0.062000-0. Epub 2012 Sep 17. Microbiology (Reading). 2012. PMID: 22986295 Review.
-
Whole Genome Sequencing Based Surveillance of L. monocytogenes for Early Detection and Investigations of Listeriosis Outbreaks.Front Public Health. 2019 Jun 4;7:139. doi: 10.3389/fpubh.2019.00139. eCollection 2019. Front Public Health. 2019. PMID: 31214559 Free PMC article. Review.
Cited by
-
Evaluation of whole and core genome multilocus sequence typing allele schemes for Salmonella enterica outbreak detection in a national surveillance network, PulseNet USA.Front Microbiol. 2023 Sep 21;14:1254777. doi: 10.3389/fmicb.2023.1254777. eCollection 2023. Front Microbiol. 2023. PMID: 37808298 Free PMC article.
-
CampyTube: Seamless Integration of a Molecular Test and Lateral Flow Detection of Campylobacter in a Single Vial.Biosensors (Basel). 2025 Aug 1;15(8):497. doi: 10.3390/bios15080497. Biosensors (Basel). 2025. PMID: 40862958 Free PMC article.
-
The integrated genomic surveillance system of Andalusia (SIEGA) provides a One Health regional resource connected with the clinic.Sci Rep. 2024 Aug 19;14(1):19200. doi: 10.1038/s41598-024-70107-0. Sci Rep. 2024. PMID: 39160186 Free PMC article.
-
dTOURS: Dense-region tagging for outbreak detection using ratio statistics.PLoS One. 2025 May 13;20(5):e0322663. doi: 10.1371/journal.pone.0322663. eCollection 2025. PLoS One. 2025. PMID: 40359413 Free PMC article.
-
Comparison of gene-by-gene and genome-wide short nucleotide sequence-based approaches to define the global population structure of Streptococcus pneumoniae.Microb Genom. 2024 Aug;10(8):001278. doi: 10.1099/mgen.0.001278. Microb Genom. 2024. PMID: 39196267 Free PMC article.
References
MeSH terms
LinkOut - more resources
Full Text Sources