Text mining in mosquito-borne disease: A systematic review
- PMID: 35430265
- PMCID: PMC9663275
- DOI: 10.1016/j.actatropica.2022.106447
Text mining in mosquito-borne disease: A systematic review
Abstract
Mosquito-borne diseases are emerging and re-emerging across the globe, especially after the COVID19 pandemic. The recent advances in text mining in infectious diseases hold the potential of providing timely access to explicit and implicit associations among information in the text. In the past few years, the availability of online text data in the form of unstructured or semi-structured text with rich content of information from this domain enables many studies to provide solutions in this area, e.g., disease-related knowledge discovery, disease surveillance, early detection system, etc. However, a recent review of text mining in the domain of mosquito-borne disease was not available to the best of our knowledge. In this review, we survey the recent works in the text mining techniques used in combating mosquito-borne diseases. We highlight the corpus sources, technologies, applications, and the challenges faced by the studies, followed by the possible future directions that can be taken further in this domain. We present a bibliometric analysis of the 294 scientific articles that have been published in Scopus and PubMed in the domain of text mining in mosquito-borne diseases, from the year 2016 to 2021. The papers were further filtered and reviewed based on the techniques used to analyze the text related to mosquito-borne diseases. Based on the corpus of 158 selected articles, we found 27 of the articles were relevant and used text mining in mosquito-borne diseases. These articles covered the majority of Zika (38.70%), Dengue (32.26%), and Malaria (29.03%), with extremely low numbers or none of the other crucial mosquito-borne diseases like chikungunya, yellow fever, West Nile fever. Twitter was the dominant corpus resource to perform text mining in mosquito-borne diseases, followed by PubMed and LexisNexis databases. Sentiment analysis was the most popular technique of text mining to understand the discourse of the disease and followed by information extraction, which dependency relation and co-occurrence-based approach to extract relations and events. Surveillance was the main usage of most of the reviewed studies and followed by treatment, which focused on the drug-disease or symptom-disease association. The advance in text mining could improve the management of mosquito-borne diseases. However, the technique and application posed many limitations and challenges, including biases like user authentication and language, real-world implementation, etc. We discussed the future direction which can be useful to expand this area and domain. This review paper contributes mainly as a library for text mining in mosquito-borne diseases and could further explore the system for other neglected diseases.
Keywords: Dengue; Malaria; Mosquito-borne; Text analysis; Vector-borne; Zika.
Copyright © 2022. Published by Elsevier B.V.
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article.
-
Measures implemented in the school setting to contain the COVID-19 pandemic.Cochrane Database Syst Rev. 2022 Jan 17;1(1):CD015029. doi: 10.1002/14651858.CD015029. Cochrane Database Syst Rev. 2022. Update in: Cochrane Database Syst Rev. 2024 May 2;5:CD015029. doi: 10.1002/14651858.CD015029.pub2. PMID: 35037252 Free PMC article. Updated.
-
Home treatment for mental health problems: a systematic review.Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150. Health Technol Assess. 2001. PMID: 11532236
-
Comparison of cellulose, modified cellulose and synthetic membranes in the haemodialysis of patients with end-stage renal disease.Cochrane Database Syst Rev. 2001;(3):CD003234. doi: 10.1002/14651858.CD003234. Cochrane Database Syst Rev. 2001. Update in: Cochrane Database Syst Rev. 2005 Jul 20;(3):CD003234. doi: 10.1002/14651858.CD003234.pub2. PMID: 11687058 Updated.
-
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340. Health Technol Assess. 2006. PMID: 16959170
Cited by
-
Sustainable development through the bio-fabrication of ecofriendly ZnO nanoparticles and its approaches to toxicology and environmental protection.Biomass Convers Biorefin. 2022 Oct 27:1-17. doi: 10.1007/s13399-022-03445-6. Online ahead of print. Biomass Convers Biorefin. 2022. PMID: 36320445 Free PMC article.
-
Chemical Composition, Larvicidal and Molluscicidal Activity of Essential Oils of Six Guava Cultivars Grown in Vietnam.Plants (Basel). 2023 Aug 7;12(15):2888. doi: 10.3390/plants12152888. Plants (Basel). 2023. PMID: 37571040 Free PMC article.
-
Tracking mosquito-borne diseases via social media: a machine learning approach to topic modelling and sentiment analysis.PeerJ. 2024 Mar 1;12:e17045. doi: 10.7717/peerj.17045. eCollection 2024. PeerJ. 2024. PMID: 39670104 Free PMC article.
-
75 years' journey of malaria publications in English: what and where?Malar J. 2024 Jun 2;23(1):172. doi: 10.1186/s12936-024-04992-1. Malar J. 2024. PMID: 38825698 Free PMC article.
-
Text Mining and Determinants of Sentiments towards the COVID-19 Vaccine Booster of Twitter Users in Malaysia.Healthcare (Basel). 2022 May 27;10(6):994. doi: 10.3390/healthcare10060994. Healthcare (Basel). 2022. PMID: 35742045 Free PMC article.
References
-
- Aggarwal U., Aggarwal G. Sentiment analysis: a survey. Int. J. Comput. Sci. Eng. 2017;5(5) Open Access Surv. Pap.
-
- Boit J., El-Gayar O. Topical mining of malaria using social media. A text mining approach. Fac. Res. Publ. 2020 https://scholar.dsu.edu/bispapers/222/ Jan.Accessed: Dec. 10, 2021. [Online]. Available:
-
- Carlos M.A., Nogueira M., Machado RJ. Proceedings of the 4th International Conference on Systems and Informatics (ICSAI) IEEE; 2017. Analysis of dengue outbreaks using big data analytics and social networks; pp. 1592–1597. Nov 11.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous