Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 26;17(5):e0268967.
doi: 10.1371/journal.pone.0268967. eCollection 2022.

Integrated bioinformatics and statistical approaches to explore molecular biomarkers for breast cancer diagnosis, prognosis and therapies

Affiliations

Integrated bioinformatics and statistical approaches to explore molecular biomarkers for breast cancer diagnosis, prognosis and therapies

Md Shahin Alam et al. PLoS One. .

Abstract

Integrated bioinformatics and statistical approaches are now playing the vital role in identifying potential molecular biomarkers more accurately in presence of huge number of alternatives for disease diagnosis, prognosis and therapies by reducing time and cost compared to the wet-lab based experimental procedures. Breast cancer (BC) is one of the leading causes of cancer related deaths for women worldwide. Several dry-lab and wet-lab based studies have identified different sets of molecular biomarkers for BC. But they did not compare their results to each other so much either computationally or experimentally. In this study, an attempt was made to propose a set of molecular biomarkers that might be more effective for BC diagnosis, prognosis and therapies, by using the integrated bioinformatics and statistical approaches. At first, we identified 190 differentially expressed genes (DEGs) between BC and control samples by using the statistical LIMMA approach. Then we identified 13 DEGs (AKR1C1, IRF9, OAS1, OAS3, SLCO2A1, NT5E, NQO1, ANGPT1, FN1, ATF6B, HPGD, BCL11A, and TP53INP1) as the key genes (KGs) by protein-protein interaction (PPI) network analysis. Then we investigated the pathogenetic processes of DEGs highlighting KGs by GO terms and KEGG pathway enrichment analysis. Moreover, we disclosed the transcriptional and post-transcriptional regulatory factors of KGs by their interaction network analysis with the transcription factors (TFs) and micro-RNAs. Both supervised and unsupervised learning's including multivariate survival analysis results confirmed the strong prognostic power of the proposed KGs. Finally, we suggested KGs-guided computationally more effective seven candidate drugs (NVP-BHG712, Nilotinib, GSK2126458, YM201636, TG-02, CX-5461, AP-24534) compared to other published drugs by cross-validation with the state-of-the-art alternatives top-ranked independent receptor proteins. Thus, our findings might be played a vital role in breast cancer diagnosis, prognosis and therapies.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. The pipeline of this study.
Fig 2
Fig 2
(A) Volcano plot of–log10(P-value) against log2FC values to display significantly upregulated and downregulated DEGs. (B) Heatmap of significantly upregulated and downregulated DEGs to observe the clustering performance of tumor and control groups by hierarchical clustering approach.
Fig 3
Fig 3. Protein-protein interaction (PPI) network of DEGs to select the key genes (KGs).
Blue color indicates downregulated and pink color indicates upregulated DEGs, big size and octagon shape indicate the KGs.
Fig 4
Fig 4
KGs regulatory network analysis results (A) KGs-TFs interaction network to identify key transcriptional regulators of KGs, (B) KGs-miRNAs interaction network to identify key post-transcriptional regulators of KGs. Here pink color octagon indicates the KGs in both A and B, blue color bigger size ellipse indicates key TFs in A and key miRNAs in B.
Fig 5
Fig 5
The prognostic powers of KGs were displayed by (A) Heatmap of hierarchical clustering (B) Multivariate survival curves with KGs and (C) ROC curves of prediction models with KGs.
Fig 6
Fig 6. Molecular docking simulation results for exploring candidate drugs against BC.
(A) Image of binding affinity scores of proposed ordered receptor proteins with the top 97 ordered meta-drug agents, (B) Image of binding affinity scores of proposed ordered receptor proteins with the proposed ordered candidate drugs only (C) Image of binding affinity scores of ordered proposed and already published candidate drugs against the top-ranked independent receptors published by others.
Fig 7
Fig 7. The 3D views of the selected strong binding interactions between drug targets and agents were displayed.
The key interacting amino acids and their binding types with potential targets were also shown.
Fig 8
Fig 8
Validation of the proposed KGs (receptors) and candidate drugs in favor of BC by the literature review (A) Validation of the proposed KGs: circles with blue color indicate downregulated KGs and pink color indicates upregulated KGs, and each connected network with a circle indicates the reference in which the KG is associated with BC, (B) Validation of the proposed candidate drugs: circles with red color indicate FDA approved and investigational drugs, green color indicate investigational drugs and ash color indicate unapproved drugs, and each connected network with a circle indicates the references in which our suggested drugs might be effective against BC treatment.

Similar articles

Cited by

References

    1. Lei S, Zheng R, Zhang S, Wang S, Chen R, Sun K, et al.. Global patterns of breast cancer incidence and mortality: A population-based cancer registry data analysis from 2000 to 2020. Cancer Commun (Lond). 2021; 41(11):1183–94. doi: 10.1002/cac2.12207 . - DOI - PMC - PubMed
    1. Koo MM, von Wagner C, Abel GA, McPhail S, Rubin GP, Lyratzopoulos G. Typical and atypical presenting symptoms of breast cancer and their associations with diagnostic intervals: Evidence from a national audit of cancer diagnosis. Cancer Epidemiol. 2017; 48:140–6. doi: 10.1016/j.canep.2017.04.010 . - DOI - PMC - PubMed
    1. Cancer.Net. Breast Cancer: Statistics 2021. Available from: https://www.cancer.net/cancer-types/breast-cancer/statistics.
    1. Dong H, Zhang S, Wei Y, Liu C, Wang N, Zhang P, et al.. Bioinformatic analysis of differential expression and core GENEs in breast cancer. Int J Clin Exp Pathol. 2018; 11(3):1146–56 . - PMC - PubMed
    1. Mosharaf MP, Reza MS, Kibria MK, Ahmed FF, Kabir MH, Hasan S, et al.. Computational identification of host genomic biomarkers highlighting their functions, pathways and regulators that influence SARS-CoV-2 infections and drug repurposing. Sci Rep. 2022; 12(1):4279. doi: 10.1038/s41598-022-08073-8 . - DOI - PMC - PubMed

Publication types

MeSH terms