A multi-constraint representation learning model for identification of ovarian cancer with missing laboratory indicators
- PMID: 39819725
- PMCID: PMC11744287
- DOI: 10.12122/j.issn.1673-4254.2025.01.20
A multi-constraint representation learning model for identification of ovarian cancer with missing laboratory indicators
Abstract
Objectives: To evaluate the performance of a multi-constraint representation learning classification model for identifying ovarian cancer with missing laboratory indicators.
Methods: Tabular data with missing laboratory indicators were collected from 393 patients with ovarian cancer and 1951 control patients. The missing ovarian cancer laboratory indicator features were projected to the latent space to obtain a classification model using the representational learning classification model based on discriminative learning and mutual information coupled with feature projection significance score consistency and missing location estimation. The proposed constraint term was ablated experimentally to assess the feasibility and validity of the constraint term by accuracy, area under the ROC curve (AUC), sensitivity, and specificity. Cross-validation methods and accuracy, AUC, sensitivity and specificity were also used to evaluate the discriminative performance of this classification model in comparison with other interpolation methods for processing of the missing data.
Results: The results of the ablation experiments showed good compatibility among the constraints, and each constraint had good robustness. The cross-validation experiment showed that for identification of ovarian cancer with missing laboratory indicators, the AUC, accuracy, sensitivity and specificity of the proposed multi-constraints representation-based learning classification model was 0.915, 0.888, 0.774, and 0.910, respectively, and its AUC and sensitivity were superior to those of other interpolation methods.
Conclusions: The proposed model has excellent discriminatory ability with better performance than other missing data interpolation methods for identification of ovarian cancer with missing laboratory indicators.
目的: 探索基于多约束表征学习分类模型在面对缺失实验室指标的情况下鉴别卵巢癌的鉴别能力和应用价值。方法: 收集了2344例患者(393例卵巢癌和1951例对照)的缺失实验室指标表格型数据,使用本研究提出的基于判别学习和互信息以及特征投影重要性得分一致性及缺失位置估算的表征学习分类模型对缺失的卵巢癌实验室指标特征进行投影到潜在空间得到分类模型。对提出的约束项进行消融实验,通过准确率、ROC曲线下面积(AUC)、敏感度、特异性说明约束项的可行性和有效项。采用交叉验证方法和准确率、AUC、敏感度、特异性评价该分类模型的鉴别性能。将本研究与其他用于缺失数据的插补方法进行对缺失数据处理后鉴别分类能力的对比。结果: 消融实验结果显示约束项之间有很好的相容性,每项约束项都有较好的鲁棒性。交叉验证结果显示,本研究提出的基于多约束表征学习分类模型在面对缺失实验室指标的情况下对卵巢癌的鉴别中的AUC、准确率、敏感度、特异性分别为0.915、0.888、0.774、0.910,其中AUC和敏感度优于其它缺失数据插补方法。结论: 基于多约束表征学习模型在缺失实验室指标鉴别卵巢癌的应用中具有优秀的鉴别能力和较高的应用价值。与其他缺失插补方法相比,本研究提出的多约束表征学习模型在针对卵巢癌缺失实验室指标的鉴别分类任务中具有较大的优势。.
Keywords: discriminant analysis; feature importance score consistency; missing data; missing position estimation; mutual information; ovarian cancer; shared representation learning.
Figures
Similar articles
-
[An MRI multi-sequence feature imputation and fusion mutual-aid model based on sequence deletion for differentiation of high-grade from low-grade glioma].Nan Fang Yi Ke Da Xue Xue Bao. 2024 Aug 20;44(8):1561-1570. doi: 10.12122/j.issn.1673-4254.2024.08.15. Nan Fang Yi Ke Da Xue Xue Bao. 2024. PMID: 39276052 Free PMC article. Chinese.
-
[A multi-modal feature fusion classification model based on distance matching and discriminative representation learning for differentiation of high-grade glioma from solitary brain metastasis].Nan Fang Yi Ke Da Xue Xue Bao. 2024 Jan 20;44(1):138-145. doi: 10.12122/j.issn.1673-4254.2024.01.16. Nan Fang Yi Ke Da Xue Xue Bao. 2024. PMID: 38293985 Free PMC article. Chinese.
-
An integrated machine learning-based model for joint diagnosis of ovarian cancer with multiple test indicators.J Ovarian Res. 2024 Feb 20;17(1):45. doi: 10.1186/s13048-024-01365-9. J Ovarian Res. 2024. PMID: 38378582 Free PMC article.
-
Pattern Classification for Ovarian Tumors by Integration of Radiomics and Deep Learning Features.Curr Med Imaging. 2022;18(14):1486-1502. doi: 10.2174/1573405618666220516122145. Curr Med Imaging. 2022. PMID: 35578861
-
Machine learning for epithelial ovarian cancer platinum resistance recurrence identification using routine clinical data.Front Oncol. 2024 Nov 8;14:1457294. doi: 10.3389/fonc.2024.1457294. eCollection 2024. Front Oncol. 2024. PMID: 39582538 Free PMC article.
References
-
- National Cancer Institute . Cancer stat facts: ovarian cancer 2024[EB/OL]. [2020-08-10]. https://seer.cancer.gov/statfacts/html/ovary.html.
-
- Zeng HM, Zheng RS, Guo YM, et al. . Cancer survival in China, 2003-2005: a population-based study[J]. Int J Cancer, 2015, 136(8): 1921-30. - PubMed
-
- Sundar S, Neal RD, Kehoe S. Diagnosis of ovarian cancer[J]. BMJ, 2015, 351: h4443. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical