LASSO Regression Modeling on Prediction of Medical Terms among Seafarers' Health Documents Using Tidy Text Mining
- PMID: 35324813
- PMCID: PMC8945331
- DOI: 10.3390/bioengineering9030124
LASSO Regression Modeling on Prediction of Medical Terms among Seafarers' Health Documents Using Tidy Text Mining
Abstract
Generally, seafarers face a higher risk of illnesses and accidents than land workers. In most cases, there are no medical professionals on board seagoing vessels, which makes disease diagnosis even more difficult. When this occurs, onshore doctors may be able to provide medical advice through telemedicine by receiving better symptomatic and clinical details in the health abstracts of seafarers. The adoption of text mining techniques can assist in extracting diagnostic information from clinical texts. We applied lexicon sentimental analysis to explore the automatic labeling of positive and negative healthcare terms to seafarers' text healthcare documents. This was due to the lack of experimental evaluations using computational techniques. In order to classify diseases and their associated symptoms, the LASSO regression algorithm is applied to analyze these text documents. A visualization of symptomatic data frequency for each disease can be achieved by analyzing TF-IDF values. The proposed approach allows for the classification of text documents with 93.8% accuracy by using a machine learning model called LASSO regression. It is possible to classify text documents effectively with tidy text mining libraries. In addition to delivering health assistance, this method can be used to classify diseases and establish health observatories. Knowledge developed in the present work will be applied to establish an Epidemiological Observatory of Seafarers' Pathologies and Injuries. This Observatory will be a collaborative initiative of the Italian Ministry of Health, University of Camerino, and International Radio Medical Centre (C.I.R.M.), the Italian TMAS.
Keywords: correlations; disease mapping; lasso regression; seafarers; text mining.
Conflict of interest statement
No author has any conflict during the preparation and publication of the manuscript.
Figures








Similar articles
-
The impact of the COVID-19 pandemic on seafarers' mental health and chronic fatigue: Beneficial effects of onboard peer support, external support and Internet access.Mar Policy. 2022 Mar;137:104942. doi: 10.1016/j.marpol.2021.104942. Epub 2022 Jan 6. Mar Policy. 2022. PMID: 35013636 Free PMC article.
-
Mental health of Filipino seafarers and its implications for seafarers' education.Int Marit Health. 2021;72(3):183-192. doi: 10.5603/IMH.2021.0035. Int Marit Health. 2021. PMID: 34604987 Review.
-
The anti-therapeutic effects of workers' compensation in China: The case of seafarers.Int J Law Psychiatry. 2018 May-Jun;58:97-104. doi: 10.1016/j.ijlp.2018.02.011. Epub 2018 Apr 13. Int J Law Psychiatry. 2018. PMID: 29853019
-
Women seafarers' health and welfare survey.Int Marit Health. 2015;66(3):123-38. doi: 10.5603/IMH.2015.0027. Int Marit Health. 2015. PMID: 26394312
-
The Use of Radio and Telemedicine by TMAS Centers in Provision of Medical Care to Seafarers: A Systematic Review.J Pers Med. 2023 Jul 22;13(7):1171. doi: 10.3390/jpm13071171. J Pers Med. 2023. PMID: 37511784 Free PMC article. Review.
Cited by
-
Analyzing Community Care Research Trends Using Text Mining.J Multidiscip Healthc. 2022 Jul 15;15:1493-1510. doi: 10.2147/JMDH.S366726. eCollection 2022. J Multidiscip Healthc. 2022. PMID: 35873091 Free PMC article.
-
Integrated bioinformatics analysis of noncoding RNAs with tumor immune microenvironment in gastric cancer.Sci Rep. 2023 Sep 11;13(1):15006. doi: 10.1038/s41598-023-41444-3. Sci Rep. 2023. PMID: 37696973 Free PMC article.
-
Using automated text classification to explore uncertainty in NICE appraisals for drugs for rare diseases.Int J Technol Assess Health Care. 2024 Jan 5;40(1):e5. doi: 10.1017/S0266462323002805. Int J Technol Assess Health Care. 2024. PMID: 38178720 Free PMC article.
-
A Multi-omics approach to identify and validate shared genetic architecture in rheumatoid arthritis, multiple sclerosis, and type 1 diabetes: integrating GWAS, GEO, MSigDB, and scRNA-seq data.Funct Integr Genomics. 2025 Apr 21;25(1):91. doi: 10.1007/s10142-025-01598-x. Funct Integr Genomics. 2025. PMID: 40254686 Free PMC article.
-
Plasma amino acid profiles of dogs with the hepatocutaneous syndrome and dogs with other chronic liver diseases.J Vet Intern Med. 2025 Jan-Feb;39(1):e17285. doi: 10.1111/jvim.17285. J Vet Intern Med. 2025. PMID: 39831315 Free PMC article.
References
-
- Caruso G. Do seafarers have sunshine; Proceedings of the 8th International Symposium on Maritime Health (ISMH) Book of Abstracts; Rijeka, Croatia. 8–13 May 2005.
LinkOut - more resources
Full Text Sources
Miscellaneous