One Hundred Years of Hypertension Research: Topic Modeling Study
- PMID: 35583933
- PMCID: PMC9161044
- DOI: 10.2196/31292
One Hundred Years of Hypertension Research: Topic Modeling Study
Abstract
Background: Due to scientific and technical advancements in the field, published hypertension research has developed substantially during the last decade. Given the amount of scientific material published in this field, identifying the relevant information is difficult. We used topic modeling, which is a strong approach for extracting useful information from enormous amounts of unstructured text.
Objective: This study aims to use a machine learning algorithm to uncover hidden topics and subtopics from 100 years of peer-reviewed hypertension publications and identify temporal trends.
Methods: The titles and abstracts of hypertension papers indexed in PubMed were examined. We used the latent Dirichlet allocation model to select 20 primary subjects and then ran a trend analysis to see how popular they were over time.
Results: We gathered 581,750 hypertension-related research articles from 1900 to 2018 and divided them into 20 topics. These topics were broadly categorized as preclinical, epidemiology, complications, and therapy studies. Topic 2 (evidence review) and topic 19 (major cardiovascular events) are the key (hot topics). Most of the cardiopulmonary disease subtopics show little variation over time, and only make a small contribution in terms of proportions. The majority of the articles (414,206/581,750; 71.2%) had a negative valency, followed by positive (119, 841/581,750; 20.6%) and neutral valency (47,704/581,750; 8.2%). Between 1980 and 2000, negative sentiment articles fell somewhat, while positive and neutral sentiment articles climbed substantially.
Conclusions: The number of publications has been increasing exponentially over the period. Most of the uncovered topics can be grouped into four categories (ie, preclinical, epidemiology, complications, and treatment-related studies).
Keywords: LDA; cardiovascular; high blood pressure; hypertension; latent Dirichlet allocation; machine learning; research trends; topic modeling.
©Mustapha Abba, Chidozie Nduka, Seun Anjorin, Shukri Mohamed, Emmanuel Agogo, Olalekan Uthman. Originally published in JMIR Formative Research (https://formative.jmir.org), 18.05.2022.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures
References
-
- Devos P, Menard J. Bibliometric analysis of research relating to hypertension reported over the period 1997-2016. J Hypertens. 2019 Nov;37(11):2116–2122. doi: 10.1097/HJH.0000000000002143. http://europepmc.org/abstract/MED/31136459 - DOI - PMC - PubMed
-
- Mills KT, Stefanescu A, He J. The global epidemiology of hypertension. Nat Rev Nephrol. 2020 Apr;16(4):223–237. doi: 10.1038/s41581-019-0244-2. http://europepmc.org/abstract/MED/32024986 10.1038/s41581-019-0244-2 - DOI - PMC - PubMed
-
- A global brief on hypertension: Silent killer, global public health crisis. World Health Organisation. 2013. [2020-12-20]. https://www.who.int/publications/i/item/a-global-brief-on-hypertension-s... .
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous