This is a preprint.
Advancing sarcoma diagnostics with expanded DNA methylation-based classification
- PMID: 40630595
- PMCID: PMC12236892
- DOI: 10.1101/2025.06.30.25330543
Advancing sarcoma diagnostics with expanded DNA methylation-based classification
Abstract
Purpose: Sarcomas pose a severe diagnostic challenge. A wide variety of these distinct entities need to be distinguished from each other and from less aggressive types of mesenchymal tumors, to ensure correct clinical management. A machine learning based classifier for sarcomas utilizing DNA methylation data from 1077 tumors recognizing 62 sarcoma types has already been developed and termed the sarcoma classifier, which we published in 2021. Here we present a major advancement of the scale and precision of the sarcoma classifier.
Methods: DNA methylation profiles and histologic data from an unprecedented multi-institutional cohort of mesenchymal tumors were collected and analyzed. Utilizing a machine learning approach, the classifier was rigorously validated through five-fold nested cross-validation, achieving a 98% class-level accuracy and a Brier score of 0.017, indicative of well-calibrated probability estimates.
Results: The sarcoma classifier v13.1 was developed based on a training set of 4377 methylation profiles from sarcomas and less aggressive mesenchymal tumors comprising 116 tumor sub-classes and 4 control groups forming 93 distinct methylation classes. Performance was validated using four independent cohorts, comprising a total of 1547 mesenchymal tumors. A methylation-based classifier prediction was obtained in 73% of cases in the validation sets, of which 91% matched the original histopathology diagnosis, thereby increasing diagnostic confidence. The classifier enabled a definitive molecular diagnosis or tumor reclassification in 6% of cases with inconclusive or ambiguous histological findings.
Conclusion: Adding new sarcoma types and expanding tumor sample numbers in each methylation class in the new sarcoma classifier decisively increased the number of diagnostic predictions and improved match with histologic evaluation. This substantial advancement will promote clinical implementation of the tool for the diagnosis of mesenchymal tumor lesions.
Similar articles
-
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21. Clin Orthop Relat Res. 2025. PMID: 38905450
-
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23. Clin Orthop Relat Res. 2024. PMID: 39051924
-
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22. Clin Orthop Relat Res. 2024. PMID: 38517402
-
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340. Health Technol Assess. 2006. PMID: 16959170
-
Rapid, point-of-care antigen tests for diagnosis of SARS-CoV-2 infection.Cochrane Database Syst Rev. 2022 Jul 22;7(7):CD013705. doi: 10.1002/14651858.CD013705.pub3. Cochrane Database Syst Rev. 2022. PMID: 35866452 Free PMC article.
Publication types
LinkOut - more resources
Full Text Sources