Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul:231:106447.
doi: 10.1016/j.actatropica.2022.106447. Epub 2022 Apr 14.

Text mining in mosquito-borne disease: A systematic review

Affiliations

Text mining in mosquito-borne disease: A systematic review

Song-Quan Ong et al. Acta Trop. 2022 Jul.

Abstract

Mosquito-borne diseases are emerging and re-emerging across the globe, especially after the COVID19 pandemic. The recent advances in text mining in infectious diseases hold the potential of providing timely access to explicit and implicit associations among information in the text. In the past few years, the availability of online text data in the form of unstructured or semi-structured text with rich content of information from this domain enables many studies to provide solutions in this area, e.g., disease-related knowledge discovery, disease surveillance, early detection system, etc. However, a recent review of text mining in the domain of mosquito-borne disease was not available to the best of our knowledge. In this review, we survey the recent works in the text mining techniques used in combating mosquito-borne diseases. We highlight the corpus sources, technologies, applications, and the challenges faced by the studies, followed by the possible future directions that can be taken further in this domain. We present a bibliometric analysis of the 294 scientific articles that have been published in Scopus and PubMed in the domain of text mining in mosquito-borne diseases, from the year 2016 to 2021. The papers were further filtered and reviewed based on the techniques used to analyze the text related to mosquito-borne diseases. Based on the corpus of 158 selected articles, we found 27 of the articles were relevant and used text mining in mosquito-borne diseases. These articles covered the majority of Zika (38.70%), Dengue (32.26%), and Malaria (29.03%), with extremely low numbers or none of the other crucial mosquito-borne diseases like chikungunya, yellow fever, West Nile fever. Twitter was the dominant corpus resource to perform text mining in mosquito-borne diseases, followed by PubMed and LexisNexis databases. Sentiment analysis was the most popular technique of text mining to understand the discourse of the disease and followed by information extraction, which dependency relation and co-occurrence-based approach to extract relations and events. Surveillance was the main usage of most of the reviewed studies and followed by treatment, which focused on the drug-disease or symptom-disease association. The advance in text mining could improve the management of mosquito-borne diseases. However, the technique and application posed many limitations and challenges, including biases like user authentication and language, real-world implementation, etc. We discussed the future direction which can be useful to expand this area and domain. This review paper contributes mainly as a library for text mining in mosquito-borne diseases and could further explore the system for other neglected diseases.

Keywords: Dengue; Malaria; Mosquito-borne; Text analysis; Vector-borne; Zika.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig 1
Fig. 1
Search methodology for the selection of relevant articles.
Fig 2
Fig. 2
Trends and articles relevant percentage for the databases used in this review for the articles that using text mining in mosquito-borne disease from 2016 to 2021.

Similar articles

Cited by

References

    1. Abulaish M., Parwez M.D.A., Jahiruddin DiseaSE: a biomedical text analytics system for disease symptom extraction and characterization. J. Biomed. Inform. 2019;100 doi: 10.1016/j.jbi.2019.103324. Dec. - DOI - PubMed
    1. Aggarwal U., Aggarwal G. Sentiment analysis: a survey. Int. J. Comput. Sci. Eng. 2017;5(5) Open Access Surv. Pap.
    1. Boit J., El-Gayar O. Topical mining of malaria using social media. A text mining approach. Fac. Res. Publ. 2020 https://scholar.dsu.edu/bispapers/222/ Jan.Accessed: Dec. 10, 2021. [Online]. Available:
    1. Carlos M.A., Nogueira M., Machado RJ. Proceedings of the 4th International Conference on Systems and Informatics (ICSAI) IEEE; 2017. Analysis of dengue outbreaks using big data analytics and social networks; pp. 1592–1597. Nov 11.
    1. Cohen K.B., Hunter L. Getting started in text mining. PLoS Comput. Biol. 2008;4(1):e20. doi: 10.1371/journal.pcbi.0040020. - DOI - PMC - PubMed

Publication types