Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2025 Jun 3:arXiv:2506.03321v1.

Enhancing Automatic PT Tagging for MEDLINE Citations Using Transformer-Based Models

Affiliations

Enhancing Automatic PT Tagging for MEDLINE Citations Using Transformer-Based Models

Victor H Cid et al. ArXiv. .

Abstract

We investigated the feasibility of predicting Medical Subject Headings (MeSH) Publication Types (PTs) from MEDLINE citation metadata using pre-trained Transformer-based models BERT and DistilBERT. This study addresses limitations in the current automated indexing process, which relies on legacy NLP algorithms. We evaluated monolithic multi-label classifiers and binary classifier ensembles to enhance the retrieval of biomedical literature. Results demonstrate the potential of Transformer models to significantly improve PT tagging accuracy, paving the way for scalable, efficient biomedical indexing.

Keywords: MEDLINE; Machine Learning; MeSH Publication Types; Natural Language Processing; Pre-trained Foundation Models.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
MDELINE Publication Type tag correlations.
Figure 2:
Figure 2:
Monolithic multilabel classifier architecture.
Figure 3:
Figure 3:
Ensemble of binary classifiers architecture, with example PT class recall scores.

Similar articles

References

    1. NLM, “Publication Characteristics (Publication Types) with Scope Notes,” 22 December 2023. [Online]. Available: https://www.nlm.nih.gov/mesh/pubtypes.html. [Accessed 3 June 2024].
    1. NLM, “MEDLINE/PubMed Data Element (Field) Descriptions.,” [Online]. Available: https://www.nlm.nih.gov/bsd/mms/medlineelements.html. [Accessed 3 June 2024].
    1. Schneider J., Hoang L., Y. K. and C. A.M., “Evaluation of publication type tagging as a strategy to screen randomized controlled trial articles in preparing systematic reviews,” JAMIA Open, vol. 5, no. 1, p. ooac015, 30 March 2022. - PMC - PubMed
    1. Proescholdt R., Hsiao T. K., Schneider J., Cohen A. M., McDonagh M. S. and Smalheiser N. R., “Testing a filtering strategy for systematic reviews: evaluating work savings and recall.,” AMIA Annual Symposium Proceedings, pp. 406–413, 2022. - PMC - PubMed
    1. Barnes J., Abbot N. C., F. H. E. and Ernst E., “Articles on Complementary Medicine in the Mainstream Medical Literature: An Investigation of MEDLINE, 1966 through 1996.,” Arch Intern Med, vol. 159, no. 15, pp. 1721–1725, 1999. - PubMed

Publication types

LinkOut - more resources