Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 28;14(1):30733.
doi: 10.1038/s41598-024-80175-x.

Development of an accurate breast cancer detection classifier based on platelet RNA

Affiliations

Development of an accurate breast cancer detection classifier based on platelet RNA

Wenlong Xie et al. Sci Rep. .

Abstract

Platelets possess cancer-induced reprogramming properties, thereby contributing to RNA profile alterations and further cancer progression, while the former is considered a promising biosource for cancer detection. Hence, tumor-educated platelets (TEP) are considered a prospective novel method for early breast cancer (BC) screening. Our study integrated the data from 276 patients with untreated BC, 95 with benign disease controls, 214 healthy controls, and 2 who underwent mastectomy in Chinese and European cohorts to develop a 10-biomarker diagnostic model. The model demonstrated high diagnostic performance for BC in an independent test set (n = 177) with an area under the curve of 0.957. The sensitivity for BC diagnosis was 89.2%, with 100% specificity in asymptomatic controls, while that for the symptomatic group, including benign tumors and inflammatory diseases, was 62.1%. The model demonstrated substantial accuracy for stages 0-III BC (80% for stage 0 [n = 5], 83.3% for stage I [n = 12], 94.6% for stage II [n = 37], and 88.9% for stage III [n = 9]) and precisely helped determine residual cancer in two patients who underwent mastectomy. Moreover, our developed classifiers distinguish different BC subtypes properly. In summary, we created and tested a new TEP-RNA-based BC diagnostic model that was confirmed valid and demonstrated high efficiency in detecting early-stage BC and heterogeneous subtypes, including recurrent tumors. However, these results warrant more validation in larger population-based prospective studies before clinical implementation.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests. Study approval: This study was approved by the Ethics Committees of the First Affiliated Hospital of Xiamen University (XMYY-2021KYSB005). This study does not include any experiments with animals performed by any of the authors.

Figures

Fig. 1
Fig. 1
Study design and patient enrollment. A total of 221 samples were disqualified, and Agilent 2100 Bioanalyzer was used to detect genomic contamination and severe degradation along with 3 samples that failed quality control. Quality control keeps samples that have a total detected RNA count of > 1500 per sample and a correlation of > 0.5 between samples. BC breast cancer.
Fig. 2
Fig. 2
Platelet differences in patients with BC and healthy donors. (a) Volcano plot illustrating the 2388 differentially expressed RNAs (P < 0.001, log CPM ≥ 3). A total of 1181 demonstrated significant upregulation (red), and 1207 exhibited downregulation (blue) in the BC group. (b) Heatmap of unsupervised clustering with BC and healthy individuals in the training cohort. We revealed that BC or normal samples could be distinguished through platelet RNA expression patterns.
Fig. 3
Fig. 3
Model development and optimization workflow. We identified 1761 differentially expressed genes using the training dataset with patients with BC and non-BC controls (P < 0.001, log CPM ≥ 3). Subsequently, we performed LASSO and single-factor logistic regression (with each marker AUC > 0.8) for feature reduction, leading to the selection of the top four genes. The comparison between patients with BC and symptomatic controls identified 120 differentially expressed genes (P < 0.05, log CPM ≥ 3). After dimensionality reduction (each marker AUC > 0.65), we received the top seven genes. Two separate panels were modeled for SVM. The 4 markers were employed for the initial data classification, and positive predictions (BC) were further refined using the 7 markers to obtain the final diagnostic outcome.
Fig. 4
Fig. 4
Performance of TEP-RNA panel in training and validation cohorts for BC diagnostics. ROC curves (a) for BC diagnostics using the 4-BC-TEP-RNA panel in training (black line) and validation (red line) cohorts along with the confusion matrix (b) tailored for the validation cohort. Furthermore, using the 10-BC-TEP-RNA panel to produce ROC curves (c) in both the training and validation cohorts, and presenting the confusion matrix (d), specifically for the latter.
Fig. 5
Fig. 5
Performance of TEP-RNA panel in an independent test cohort for BC diagnostics. (a) ROC curves for BC diagnostics using 4-BC-TEP-RNA and 10-BC-TEP-RNA panels on the independent test cohort. We observe a significantly higher AUC value of the 10-BC-TEP-RNA panel than that of the former (p = 1.578e–05). (b) Confusion matrix for BC diagnostics using the 4-BC-TEP-RNA panel in the test cohort. (c) Bar plot of detection accuracy for various stages of BC and control samples. High detection accuracy is observed for all BC stages. Stage na denotes samples without recorded tumor staging information.
Fig. 6
Fig. 6
SVM Model for breast cancer companion diagnosis with TEP-derived RNA. Extracting feature genes based on HER2 amplification or HR positivity using 60% of samples for model training and allocating 40% for validation. (a, b) Unsupervised clustering heatmap and classifier ROC curve based on HER2 receptor status. The AUC value in the validation set is 0.882 (95% CI 79.58%–96.92%), indicating a significant improvement over the random classifier (p = 2.238e–05). (c, d) Unsupervised clustering heatmap and classifier ROC curve based on HR receptor status. The AUC value in the validation set is 0.861 (95% CI 76.71%–95.54%), revealing the significant enhancement over the random classifier (p = 0.0063). Selected genes: the number of RNAs included in the training dataset of the SVM algorithm.

Similar articles

  • Detection and localization of early- and late-stage cancers using platelet RNA.
    In 't Veld SGJG, Arkani M, Post E, Antunes-Ferreira M, D'Ambrosi S, Vessies DCL, Vermunt L, Vancura A, Muller M, Niemeijer AN, Tannous J, Meijer LL, Le Large TYS, Mantini G, Wondergem NE, Heinhuis KM, van Wilpe S, Smits AJ, Drees EEE, Roos E, Leurs CE, Tjon Kon Fat LA, van der Lelij EJ, Dwarshuis G, Kamphuis MJ, Visser LE, Harting R, Gregory A, Schweiger MW, Wedekind LE, Ramaker J, Zwaan K, Verschueren H, Bahce I, de Langen AJ, Smit EF, van den Heuvel MM, Hartemink KJ, Kuijpers MJE, Oude Egbrink MGA, Griffioen AW, Rossel R, Hiltermann TJN, Lee-Lewandrowski E, Lewandrowski KB, De Witt Hamer PC, Kouwenhoven M, Reijneveld JC, Leenders WPJ, Hoeben A, Verdonck-de Leeuw IM, Leemans CR, Baatenburg de Jong RJ, Terhaard CHJ, Takes RP, Langendijk JA, de Jager SC, Kraaijeveld AO, Pasterkamp G, Smits M, Schalken JA, Łapińska-Szumczyk S, Łojkowska A, Żaczek AJ, Lokhorst H, van de Donk NWCJ, Nijhof I, Prins HJ, Zijlstra JM, Idema S, Baayen JC, Teunissen CE, Killestein J, Besselink MG, Brammen L, Bachleitner-Hofmann T, Mateen F, Plukker JTM, Heger M, de Mast Q, Lisman T, Pegtel DM, Bogaard HJ, Jassem J, Supernat A, Mehra N, Gerritsen W, de Kroon CD, Lok CAR, Piek JMJ, Steeghs N, van Houdt WJ, Br… See abstract for full author list ➔ In 't Veld SGJG, et al. Cancer Cell. 2022 Sep 12;40(9):999-1009.e6. doi: 10.1016/j.ccell.2022.08.006. Epub 2022 Sep 1. Cancer Cell. 2022. PMID: 36055228
  • Evaluation of the diagnostic value of circulating tumor cells with CytoSorter® CTC capture system in patients with breast cancer.
    Jin L, Zhao W, Zhang J, Chen W, Xie T, Wang L, Fan W, Xie S, Shen J, Zheng H, Hu W, Wei Q, Dong M, Wang Q, Shen J, Liu Y. Jin L, et al. Cancer Med. 2020 Mar;9(5):1638-1647. doi: 10.1002/cam4.2825. Epub 2020 Jan 6. Cancer Med. 2020. PMID: 31908156 Free PMC article.
  • 8-Hydroxy-2'-deoxyguanosine as a Discriminatory Biomarker for Early Detection of Breast Cancer.
    Nour Eldin EEM, El-Readi MZ, Nour Eldein MM, Alfalki AA, Althubiti MA, Mohamed Kamel HF, Eid SY, Al-Amodi HS, Mirza AA. Nour Eldin EEM, et al. Clin Breast Cancer. 2019 Apr;19(2):e385-e393. doi: 10.1016/j.clbc.2018.12.013. Epub 2018 Dec 21. Clin Breast Cancer. 2019. PMID: 30683611
  • Discovery and function exploration of microRNA-155 as a molecular biomarker for early detection of breast cancer.
    Liu X, Chang Q, Wang H, Qian H, Jiang Y. Liu X, et al. Breast Cancer. 2021 Jul;28(4):806-821. doi: 10.1007/s12282-021-01215-2. Epub 2021 Jan 21. Breast Cancer. 2021. PMID: 33475963 Free PMC article.
  • The future of blood-based biomarkers for the early detection of breast cancer.
    Loke SY, Lee ASG. Loke SY, et al. Eur J Cancer. 2018 Mar;92:54-68. doi: 10.1016/j.ejca.2017.12.025. Epub 2018 Feb 6. Eur J Cancer. 2018. PMID: 29413690 Review.

References

    1. Siegel, R. L., Miller, K. D., Fuchs, H. E. & Jemal, A. Cancer statistics, 2022. CA Cancer J. Clin.72, 7–33 (2022). - PubMed
    1. Haemmerle, M., Stone, R. L., Menter, D. G., Afshar-Kharghan, V. & Sood, A. K. The platelet lifeline to cancer: challenges and opportunities. Cancer Cell33, 965–983 (2018). - PMC - PubMed
    1. McAllister, S. S. & Weinberg, R. A. The tumour-induced systemic environment as a critical regulator of cancer progression and metastasis. Nat. Cell Biol.16, 717–727 (2014). - PMC - PubMed
    1. In’t Veld, S. & Wurdinger, T. Tumor-educated platelets. Blood133, 2359–2364 (2019). - PubMed
    1. Stone, R. L. et al. Paraneoplastic thrombocytosis in ovarian cancer. N. Engl. J. Med.366, 610–618 (2012). - PMC - PubMed

LinkOut - more resources