Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 17;26(1):181.
doi: 10.1186/s12859-025-06200-6.

Automated web-based typing of Clostridioides difficile ribotypes via MALDI-TOF MS

Affiliations

Automated web-based typing of Clostridioides difficile ribotypes via MALDI-TOF MS

Mario Blázquez-Sánchez et al. BMC Bioinformatics. .

Abstract

Background: Clostridioides difficile is a major cause of hospital-acquired diarrhea and a driver of nosocomial outbreaks, yet rapid, accurate ribotype identification remains challenging. We sought to develop a MALDI-TOF MS-based workflow coupled with machine learning to distinguish epidemic toxigenic ribotypes (RT027 and RT181) from other strains in real time.

Results: We analyzed MALDI-TOF spectra from 379 clinical isolates collected across ten Spanish hospitals and identified seven discriminant biomarker peaks. Two peaks (2463 and 4993 m/z) were uniquely associated with RT027, while combinations of five additional peaks reliably identified RT181. Our classifiers-implemented both in the commercial Clover MSDAS platform and the open-access AutoCdiff web tool-achieved up to 100% balanced accuracy in ribotype assignment and proved robust in real-time outbreak simulations.

Conclusions: This study demonstrates that MALDI-TOF MS combined with tailored machine learning can deliver rapid, high-precision ribotype identification for C. difficile. The freely available AutoCdiff models ( https://bacteria.id ) offer an immediately deployable solution for clinical laboratories, with the potential to enhance outbreak surveillance and control.

Keywords: Biomarker peaks; Classification; Clostridioides difficile; Clostridium difficile; MALDI-TOF MS; Machine learning; Neural network; Outbreak; Random forest; Ribotyping.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: This study was reviewed and approved by the HGUGM Ethics Committee (CEIm) under study number MICRO.HGUGM.2021-025. Because only anonymized microbiological isolates were used and no human specimens or personal data were involved, the Committee waived the requirement for informed consent. Consent for publication: Not applicable Competing interests: MJA and LM are employees of Clover Bioanalytical Software, S.L.

Figures

Fig. 1
Fig. 1
DBLR-FS graphical model. The training of this algorithm is divided into two phases: (1) Kernelization: Both the MALDI-TOF training spectra formula image and a data subset (relevance vectors) formula image are combined into a linear kernel formula image to define the dual space with the same dimensions as number of samples. (2) Classification: The linear kernel formula image along with the dual weights formula image define a linear regression over outputs formula image. Moreover, the prior distribution of formula image (dependent on formula image, see Supplementary Material 1) is initialized with higher values at peaks known to be relevant based on expert knowledge from microbiologists. Finally, output formula image is introduced on a Bayesian LR to determine the most likely ribotype
Fig. 2
Fig. 2
FA-VAE training and prediction phases. Training phase (left): Paired MALDI-TOF spectra and known ribotypes are fed into two modules. The VAE encoder–decoder compresses each spectrum (formula image) and reconstructs the original signal, while the Bayesian LR module embeds the ribotype label (formula image). The two vectors are then merged into a single global encoding (G) that captures both spectral and ribotype information for each isolate. Prediction phase (right): A new spectrum is passed through the VAE encoder to produce formula image, which is projected into the global space G. From G, the model extracts the ribotype vector formula image and uses the Bayesian classifier to output the most likely ribotype (for example, RT181 with 80% probability)
Fig. 3
Fig. 3
Screenshot of the upload page web interface
Fig. 4
Fig. 4
Screenshot of the results page web interface
Fig. 5
Fig. 5
Peaks used for creation of predictive models for Clover MSDAS. a Peak 2463 m/z. b Intensities of 2463 m/z according to model categories. c Peak 3353 m/z. d Intensities of 3353 m/z. e Peaks 4933 and 4993 m/z. f Intensities of 4933 m/z. g Intensities of 4993 m/z. h Peak 6187 m/z. i Intensities of 6187 m/z. j Peaks 6651 and 6710 m/z. k Intensities of 6651 m/z. l Intensities of 6710 m/z
Fig. 6
Fig. 6
Proposed laboratory workflow with the implementation of C. difficile typing by MALDI-TOF MS methodology presented in this paper. The workflow shows an Xpert®-dependent option and a direct-culture option for laboratories without a Xpert® system available. PFGE: Pulsed-Field Gel Electrophoresis. WGS: Whole-Genome Sequencing

Similar articles

References

    1. Czepiel J, Dróżdż M, Pituch H, Kuijper EJ, Perucki W, Mielimonka A, et al. Clostridium difficile infection. Eur J Clin Microbiol Infect Dis. 2019;38:1211–21. - PMC - PubMed
    1. Markovska R, Dimitrov G, Gergova R, Boyanova L. Clostridioides difficile, a New “Superbug’’. Microorganisms. 2023;11(4):845. - PMC - PubMed
    1. He M, Miyajima F, Roberts P, Ellison L, Pickard DJ, Martin MJ, et al. Emergence and global spread of epidemic healthcare-associated Clostridium difficile. Nat Genet. 2013;45(1):109–13. - PMC - PubMed
    1. Viprey VF, Davis GL, Benson AD, Ewin D, Spittal W, Vernon JJ, et al. A point-prevalence study on community and inpatient Clostridioides difficile infections (CDI): results from combatting bacterial resistance in Europe CDI (COMBACTE-CDI), July to November 2018. Eurosurveillance. 2022;27(26):2100704. - PMC - PubMed
    1. Kachrimanidou M, Metallidis S, Tsachouridou O, Harmanus C, Lola V, Protonotariou E, et al. Predominance of Clostridioides difficile PCR ribotype 181 in northern Greece, 2016–2019. Anaerobe. 2022;76: 102601. - PubMed

MeSH terms

LinkOut - more resources