A comparative evaluation of biomedical similar article recommendation

Li Zhang¹, Wei Lu², Haihua Chen³, Yong Huang⁴, Qikai Cheng⁵

Affiliations

¹ School of Information Management, Wuhan University, Wuhan 430074, Hubei Province, China. Electronic address: zlahu@foxmail.com.
² School of Information Management, Wuhan University, Wuhan 430074, Hubei Province, China. Electronic address: weilu@whu.edu.cn.
³ Department of Information Science, University of North Texas, Denton 76203, TX, USA. Electronic address: haihua.chen@unt.edu.
⁴ School of Information Management, Wuhan University, Wuhan 430074, Hubei Province, China. Electronic address: yonghuang1991@whu.edu.cn.
⁵ School of Information Management, Wuhan University, Wuhan 430074, Hubei Province, China. Electronic address: chengqikai0806@163.com.

PMID: 35661818
DOI: 10.1016/j.jbi.2022.104106

Free article

Review

A comparative evaluation of biomedical similar article recommendation

Li Zhang et al. J Biomed Inform. 2022 Jul.

Free article

. 2022 Jul:131:104106.

doi: 10.1016/j.jbi.2022.104106. Epub 2022 Jun 2.

Authors

Li Zhang¹, Wei Lu², Haihua Chen³, Yong Huang⁴, Qikai Cheng⁵

Affiliations

¹ School of Information Management, Wuhan University, Wuhan 430074, Hubei Province, China. Electronic address: zlahu@foxmail.com.
² School of Information Management, Wuhan University, Wuhan 430074, Hubei Province, China. Electronic address: weilu@whu.edu.cn.
³ Department of Information Science, University of North Texas, Denton 76203, TX, USA. Electronic address: haihua.chen@unt.edu.
⁴ School of Information Management, Wuhan University, Wuhan 430074, Hubei Province, China. Electronic address: yonghuang1991@whu.edu.cn.
⁵ School of Information Management, Wuhan University, Wuhan 430074, Hubei Province, China. Electronic address: chengqikai0806@163.com.

PMID: 35661818
DOI: 10.1016/j.jbi.2022.104106

Abstract

Background: Biomedical sciences, with their focus on human health and disease, have attracted unprecedented attention in the 21st century. The proliferation of biomedical sciences has also led to a large number of scientific articles being produced, which makes it difficult for biomedical researchers to find relevant articles and hinders the dissemination of valuable discoveries. To bridge this gap, the research community has initiated the article recommendation task, with the aim of recommending articles to biomedical researchers automatically based on their research interests. Over the past two decades, many recommendation methods have been developed. However, an algorithm-level comparison and rigorous evaluation of the most important methods on a shared dataset is still lacking.

Method: In this study, we first investigate 15 methods for automated article recommendation in the biomedical domain. We then conduct an empirical evaluation of the 15 methods, including six term-based methods, two word embedding methods, three sentence embedding methods, two document embedding methods, and two BERT-based methods. These methods are evaluated in two scenarios: article-oriented recommenders and user-oriented recommenders, with two publicly available datasets: TREC 2005 Genomics and RELISH, respectively.

Results: Our experimental results show that the text representation models BERT and BioSenVec outperform many existing recommendation methods (e.g., BM25, PMRA, XPRC) and web-based recommendation systems (e.g., MScanner, MedlineRanker, BioReader) on both datasets regarding most of the evaluation metrics, and fine-tuning can improve the performance of the BERT-based methods.

Conclusions: Our comparison study is useful for researchers and practitioners in selecting the best modeling strategies for building article recommendation systems in the biomedical domain. The code and datasets are publicly available.

Keywords: BERT; Biomedical article recommendation; Methodological comparison; Model evaluation; Modeling strategy; Text representation.

PubMed Disclaimer

Cited by

A hybrid algorithm for clinical decision support in precision medicine based on machine learning.
Zhang Z, Lin X, Wu S. Zhang Z, et al. BMC Bioinformatics. 2023 Jan 3;24(1):3. doi: 10.1186/s12859-022-05116-9. BMC Bioinformatics. 2023. PMID: 36597033 Free PMC article.
Study on Muscle Fatigue Classification for Manual Lifting by Fusing sEMG and MMG Signals.
Wang Z, Guan X, Li D, Jiang C, Bai Y, Yang D, He L. Wang Z, et al. Sensors (Basel). 2025 Aug 13;25(16):5023. doi: 10.3390/s25165023. Sensors (Basel). 2025. PMID: 40871887 Free PMC article.

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Elsevier Science
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A comparative evaluation of biomedical similar article recommendation

Affiliations

A comparative evaluation of biomedical similar article recommendation

Authors

Affiliations

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Miscellaneous