Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Apr 11;9(2):175-186.
doi: 10.1007/s13167-018-0131-0. eCollection 2018 Jun.

Premenopausal breast cancer: potential clinical utility of a multi-omics based machine learning approach for patient stratification

Affiliations

Premenopausal breast cancer: potential clinical utility of a multi-omics based machine learning approach for patient stratification

Holger Fröhlich et al. EPMA J. .

Abstract

Background: The breast cancer (BC) epidemic is a multifactorial disease attributed to the early twenty-first century: about two million of new cases and half a million deaths are registered annually worldwide. New trends are emerging now: on the one hand, with respect to the geographical BC prevalence and, on the other hand, with respect to the age distribution. Recent statistics demonstrate that young populations are getting more and more affected by BC in both Eastern and Western countries. Therefore, the old rule "the older the age, the higher the BC risk" is getting relativised now. Accumulated evidence shows that young premenopausal women deal with particularly unpredictable subtypes of BC such as triple-negative BC, have lower survival rates and respond less to conventional chemotherapy compared to the majority of postmenopausal BC.

Working hypothesis: Here we hypothesised that a multi-level diagnostic approach may lead to the identification of a molecular signature highly specific for the premenopausal BC. A multi-omic approach using machine learning was considered as a potent tool for stratifying patients with benign breast alterations into well-defined risk groups, namely individuals at high versus low risk for breast cancer development.

Results and conclusions: The study resulted in identifying multi-omic signature specific for the premenopausal BC that can be used for stratifying patients with benign breast alterations. Our predictive model is capable of discriminating individually between high and low BC-risk with high confidence (>90%) and considered of potential clinical utility. Novel risk assessment approaches and advanced screening programmes-as the long-term target of this project-are of particular importance for predictive, preventive and personalised medicine as the medicine of the future, due to the expected health benefits for young subpopulations and the healthcare system as a whole.

Keywords: Bioinformatics; Biomarker panel; Breast cancer; Laboratory medicine; Machine learning; Menopause; Multi-level diagnostics; Patient stratification; Predictive preventive personalised medicine.

PubMed Disclaimer

Conflict of interest statement

Compliance with ethical standardsThe authors declare that they have no competing interests.Not applicable.All the patient investigations conformed to the principles outlined in the Declaration of Helsinki and have been performed with the permission (Nr. 148/05) released by the responsible Ethic’s Committee of the Medical Faculty, Rheinische Friedrich-Wilhelms-University of Bonn. Human rights have been obligatory protected during the entire duration of the project according to the European standards. All the patients were informed about the purposes of the study and have signed their “consent of the patient”. This article does not contain any studies with animals performed by any of the authors.

Figures

Fig. 1
Fig. 1
Western blot imaging of the expression rates for a actin and b catalase as demonstrated for the samples/patients numbered 1–7 which correspond to the patients 1 and 2 diagnosed with benign breast alterations, 3–7 breast cancer patients, whereby 3–6 are premenopausal BC and 7 is postmenopausal BC
Fig. 2
Fig. 2
Cophenetic correlation (left) and silhouette index (right) as a function of the number of NMF clusters. Both plots clearly favour a solution with two patient groups. The solution of a consensus clustering over 100 NMF runs (purple) is contrasted with the silhouette indices for clustering the markers (red) and the patients (green)
Fig. 3
Fig. 3
Left: silhouette plot of two patient subgroups with 37 and 24 patients, respectively. The x-axis shows the cluster silhouette for each patient on the y-axis. The cluster silhouette is a measure of how similar each patient is compared to patients in its own cluster and the closest patient from other clusters. The silhouette measure ranges from 0 to 1, where 1 indicates a perfect agreement of the assumed cluster assignment with patient distances. The average cluster silhouette for all patients in cluster 1 and 2 is shown. The overall average silhouette index was 1. Right: consensus matrix with super-imposed dendrogram of hierarchical clustering. The consensus matrix depicts the relative frequency of two patients falling into the same cluster across repeated NMF runs. A clear separation of two patient subgroups (red blocks) can be seen. Patients in these groups frequently fall into the same cluster.
Fig. 4
Fig. 4
Prediction of correct patient cluster assignment: The boxplot shows the distribution of 10 AUC values resulting from 10 repeats of a tenfold cross-validation procedure. Within each cross-validation loop, a GBM classifier was trained on 9/10 of the available patient data and tested on the hold out rest. A 50% AUC indicates chance level and a AUC of 100% a perfect prediction performance
Fig. 5
Fig. 5
Principal component plot depicting both identified patient subgroup and 14 breast cancer patients. Shown is the projection of patients (indicated by numbers) on the first two principal components of the biomarker signature space. The first two principal components explain 19.6 and 9.2% of the total variance
Fig. 6
Fig. 6
Boxplots depicting the distribution of individual variables in low risk (cluster 1) and high risk (cluster 2) patient subgroups. All markers show a statistically significant difference between high- and low-risk groups after multiple testing corrections
Fig. 7
Fig. 7
Heat map of patients—the x-axis displays individualised patient profiles (patients involved are listed on the y-axis); characteristic patterns of following biomarker are presented: CA IV, CA IV/I + II + III, CA II, CA III, actin expression, catalase expression, CA I, and hybridome Hcy/CA(I–IV). The colour code displays the row-wise normalised magnitude (e.g. expression levels for proteins) of each marker (z-score): green = high, red = low. Example: indicated with the red arrow patient is at high BC risk; in this case, low levels of CA IV, CA IV/I + II + III, CA II and CA III, but high levels of actin, catalase, CA I, and Hcy / CA (I–IV) have been demonstrated that is characteristic for the profile of the “high risk” cluster. The patient at low risk for BC indicated with the green arrow shows exactly opposite patterns

Similar articles

Cited by

References

    1. Golubnitschaja O, Debald M, Yeghiazaryan K, Kuhn W, Pešta M, Costigliola V, Grech G. Breast cancer epidemic in the early 21st century: evaluation of risk factors, cumulative questionnaires and recommendations for preventive measures. Tumor Biol. 2016;37(10):12941–12957. doi: 10.1007/s13277-016-5168-x. - DOI - PubMed
    1. Jemal A, Bray F, Center MM, Ferlay J, Ward E, Forman D. Global cancer statistics. CA Cancer J Clin. 2011;61(2):69–90. doi: 10.3322/caac.20107. - DOI - PubMed
    1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2016. CA Cancer J Clin. 2016;66(1):7–30. doi: 10.3322/caac.21332. - DOI - PubMed
    1. Smokovski I, Risteski M, Polivka J, Jr, Zubor P, Konieczka K, Costigliola V, Golubnitschaja O. Postmenopausal breast cancer: European challenge and innovative concepts. EPMA J. 2017;8(2):159–169. doi: 10.1007/s13167-017-0094-6. - DOI - PMC - PubMed
    1. American Cancer Society . Global cancer facts & figures. 2. Atlanta: American Cancer Society; 2011.