SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations
- PMID: 34607568
- PMCID: PMC8491382
- DOI: 10.1186/s12859-021-04397-w
SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations
Abstract
Background: Blood cancers (BCs) are responsible for over 720 K yearly deaths worldwide. Their prevalence and mortality-rate uphold the relevance of research related to BCs. Despite the availability of different resources establishing Disease-Disease Associations (DDAs), the knowledge is scattered and not accessible in a straightforward way to the scientific community. Here, we propose SicknessMiner, a biomedical Text-Mining (TM) approach towards the centralization of DDAs. Our methodology encompasses Named Entity Recognition (NER) and Named Entity Normalization (NEN) steps, and the DDAs retrieved were compared to the DisGeNET resource for qualitative and quantitative comparison.
Results: We obtained the DDAs via co-mention using our SicknessMiner or gene- or variant-disease similarity on DisGeNET. SicknessMiner was able to retrieve around 92% of the DisGeNET results and nearly 15% of the SicknessMiner results were specific to our pipeline.
Conclusions: SicknessMiner is a valuable tool to extract disease-disease relationship from RAW input corpus.
Keywords: Biomedical text-mining; Blood cancers; Deep learning; Disease-disease associations; Natural language processing.
© 2021. The Author(s).
Conflict of interest statement
The authors declare that they have no competing interests.
Figures




Similar articles
-
A Text Mining Pipeline Using Active and Deep Learning Aimed at Curating Information in Computational Neuroscience.Neuroinformatics. 2019 Jul;17(3):391-406. doi: 10.1007/s12021-018-9404-y. Neuroinformatics. 2019. PMID: 30443819 Free PMC article.
-
Biomedical named entity recognition using deep neural networks with contextual information.BMC Bioinformatics. 2019 Dec 27;20(1):735. doi: 10.1186/s12859-019-3321-4. BMC Bioinformatics. 2019. PMID: 31881938 Free PMC article.
-
DTranNER: biomedical named entity recognition with deep learning-based label-label transition model.BMC Bioinformatics. 2020 Feb 11;21(1):53. doi: 10.1186/s12859-020-3393-1. BMC Bioinformatics. 2020. PMID: 32046638 Free PMC article.
-
A knowledge-driven approach to extract disease-related biomarkers from the literature.Biomed Res Int. 2014;2014:253128. doi: 10.1155/2014/253128. Epub 2014 Apr 16. Biomed Res Int. 2014. PMID: 24839601 Free PMC article. Review.
-
Text Mining for Drug Discovery.Methods Mol Biol. 2019;1939:231-252. doi: 10.1007/978-1-4939-9089-4_13. Methods Mol Biol. 2019. PMID: 30848465 Review.
Cited by
-
A study on large-scale disease causality discovery from biomedical literature.BMC Med Inform Decis Mak. 2025 Mar 18;25(1):136. doi: 10.1186/s12911-025-02893-0. BMC Med Inform Decis Mak. 2025. PMID: 40102814 Free PMC article.
-
Exploring novel disease-disease associations based on multi-view fusion network.Comput Struct Biotechnol J. 2023 Feb 24;21:1807-1819. doi: 10.1016/j.csbj.2023.02.038. eCollection 2023. Comput Struct Biotechnol J. 2023. PMID: 36923471 Free PMC article.
References
-
- Batool Z, Usman M, Saleem K, Abdullah-Al-Wadud M, Fazal-e-Amin A-E. Disease–disease association using network modeling: challenges and opportunities. J Med Imaging Health Inform. 2018;8(4):627–638. doi: 10.1166/jmihi.2018.2342. - DOI
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources