A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques

doi:10.1155/2022/6561622

. 2022 Sep 14:2022:6561622.

doi: 10.1155/2022/6561622. eCollection 2022.

A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques

Affiliations

¹ School of Software, Tsinghua University, Beijing, China.
² Government Degree College for Women, Bosan Road, Multan, Pakistan.
³ Department of Creative Technologies, Faculty of Computing and Artificial Intelligence, Air University, Islamabad, Pakistan.
⁴ EIAS-Data Science and Blockchain Laboratory, College of Computer and Information Sciences, Prince Sultan University, Riyadh 11586, Saudi Arabia.
⁵ Department of Computer Sciences, Bahria University Karachi Campus, Karachi, Pakistan.
⁶ Applied College, Abqaiq Branch, King Faisal University, P.O. Box 4000, Al-Ahsa 31982, Hofuf, Saudi Arabia.
⁷ School of Software Engineering, South China University of Technology, Guangzhou, China.
⁸ Department of Software Engineering, University of Science and Technology Bannu, Bannu, Pakistan.
⁹ Department of Computer Science, COMSATS University Islamabad, Sahiwal Campus, Islamabad, Pakistan.

PMID: 36156967
PMCID: PMC9492356
DOI: 10.1155/2022/6561622

A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques

Rubia Fatima et al. Comput Intell Neurosci. 2022.

. 2022 Sep 14:2022:6561622.

doi: 10.1155/2022/6561622. eCollection 2022.

Authors

Affiliations

¹ School of Software, Tsinghua University, Beijing, China.
² Government Degree College for Women, Bosan Road, Multan, Pakistan.
³ Department of Creative Technologies, Faculty of Computing and Artificial Intelligence, Air University, Islamabad, Pakistan.
⁴ EIAS-Data Science and Blockchain Laboratory, College of Computer and Information Sciences, Prince Sultan University, Riyadh 11586, Saudi Arabia.
⁵ Department of Computer Sciences, Bahria University Karachi Campus, Karachi, Pakistan.
⁶ Applied College, Abqaiq Branch, King Faisal University, P.O. Box 4000, Al-Ahsa 31982, Hofuf, Saudi Arabia.
⁷ School of Software Engineering, South China University of Technology, Guangzhou, China.
⁸ Department of Software Engineering, University of Science and Technology Bannu, Bannu, Pakistan.
⁹ Department of Computer Science, COMSATS University Islamabad, Sahiwal Campus, Islamabad, Pakistan.

PMID: 36156967
PMCID: PMC9492356
DOI: 10.1155/2022/6561622

Abstract

Context and Background: Since December 2019, the coronavirus (COVID-19) epidemic has sparked considerable alarm among the general community and significantly affected societal attitudes and perceptions. Apart from the disease itself, many people suffer from anxiety and depression due to the disease and the present threat of an outbreak. Due to the fast propagation of the virus and misleading/fake information, the issues of public discourse alter, resulting in significant confusion in certain places. Rumours are unproven facts or stories that propagate and promote sentiments of prejudice, hatred, and fear. Objective. The study's objective is to propose a novel solution to detect fake news using state-of-the-art machines and deep learning models. Furthermore, to analyse which models outperformed in detecting the fake news. Method. In the research study, we adapted a COVID-19 rumours dataset, which incorporates rumours from news websites and tweets, together with information about the rumours. It is important to analyse data utilizing Natural Language Processing (NLP) and Deep Learning (DL) approaches. Based on the accuracy, precision, recall, and the f1 score, we can assess the effectiveness of the ML and DL algorithms. Results. The data adopted from the source (mentioned in the paper) have collected 9200 comments from Google and 34,779 Twitter postings filtered for phrases connected with COVID-19-related fake news. Experiment 1. The dataset was assessed using the following three criteria: veracity, stance, and sentiment. In these terms, we have different labels, and we have applied the DL algorithms separately to each term. We have used different models in the experiment such as (i) LSTM and (ii) Temporal Convolution Networks (TCN). The TCN model has more performance on each measurement parameter in the evaluated results. So, we have used the TCN model for the practical implication for better findings. Experiment 2. In the second experiment, we have used different state-of-the-art deep learning models and algorithms such as (i) Simple RNN; (ii) LSTM + Word Embedding; (iii) Bidirectional + Word Embedding; (iv) LSTM + CNN-1D; and (v) BERT. Furthermore, we have evaluated the performance of these models on all three datasets, e.g., veracity, stance, and sentiment. Based on our second experimental evaluation, the BERT has a superior performance over the other models compared.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflicts of interest.

Figures

**Figure 1**
Examples of data structure adopted from [24].

**Figure 2**
COVID-19 rumour identification using AI techniques.

**Figure 3**
Flowchart depicting the collecting, labelling, and postprocessing of datasets.

**Figure 4**
Labels in the dataset are highlighted—adopted from [24].

**Figure 5**
Schematic of the long short-term memory cell—adopted from [47].

**Figure 6**
Schematic of the Temporal Convolutional Networks—adopted from [50].

**Figure 7**
LSTM—the graphical representation of evaluating results for sentiment dataset.

**Figure 8**
TCN—the graphical representation of evaluating results for sentiment dataset.

**Figure 9**
LSTM—the graphical representation of evaluating results for veracity dataset.

**Figure 10**
TCN—the graphical representation of evaluating results for veracity dataset.

**Figure 11**
LSTM—the graphical representation of evaluating results for stance dataset.

**Figure 12**
TCN—the graphical representation of evaluating results for stance dataset.

**Figure 13**
Summary of simple RNN model.

**Figure 14**
Summary of LSTM + Word Embedding model.

**Figure 15**
Summary of Bidirectional + Word Embedding model.

**Figure 16**
Summary of LSTM + CNN-1D model.

See this image and copyright information in PMC

Cited by

Humans are still better than ChatGPT: Case of the IEEEXtreme competition.
Koubaa A, Qureshi B, Ammar A, Khan Z, Boulila W, Ghouti L. Koubaa A, et al. Heliyon. 2023 Oct 29;9(11):e21624. doi: 10.1016/j.heliyon.2023.e21624. eCollection 2023 Nov. Heliyon. 2023. PMID: 37954270 Free PMC article.
A Robust End-to-End Deep Learning-Based Approach for Effective and Reliable BTD Using MR Images.
Ullah N, Khan MS, Khan JA, Choi A, Anwar MS. Ullah N, et al. Sensors (Basel). 2022 Oct 6;22(19):7575. doi: 10.3390/s22197575. Sensors (Basel). 2022. PMID: 36236674 Free PMC article.
TumorDetNet: A unified deep learning model for brain tumor detection and classification.
Ullah N, Javed A, Alhazmi A, Hasnain SM, Tahir A, Ashraf R. Ullah N, et al. PLoS One. 2023 Sep 27;18(9):e0291200. doi: 10.1371/journal.pone.0291200. eCollection 2023. PLoS One. 2023. PMID: 37756305 Free PMC article.

References

1. Roosa K., Lee Y., Luo R., et al. Real-time forecasts of the COVID-19 epidemic in China from february 5th to february 24th, 2020. Infectious Disease Modelling . 2020;5:256–263. doi: 10.1016/j.idm.2020.02.002. - DOI - PMC - PubMed
1. Yan L., Zhang H. T., Xiao Y., et al. Prediction of Criticality in Patients with Severe Covid-19 Infection Using Three Clinical Features: A Machine Learning-Based Prognostic Model with Clinical Data in Wuhan . New Haven, CT, USA: medRxiv; 2020.
1. Bernard Stoecklin S., Rolland P., Silue Y., et al. First cases of coronavirus disease 2019 (COVID-19) in France: surveillance, investigations and control measures, January 2020. Euro Surveillance . 2020;25(6) doi: 10.2807/1560-7917.es.2020.25.6.2000094.2000094 - DOI - PMC - PubMed
1. Huang C., Wang Y., Li X., et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet . 2020;395(10223):497–506. doi: 10.1016/s0140-6736(20)30183-5. - DOI - PMC - PubMed
1. Mahase E. Coronavirus: Covid-19 Has Killed More People than SARS and MERS Combined. despite lower case fatality rate . 2020;368 - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

[1] Roosa K., Lee Y., Luo R., et al. Real-time forecasts of the COVID-19 epidemic in China from february 5th to february 24th, 2020. Infectious Disease Modelling . 2020;5:256–263. doi: 10.1016/j.idm.2020.02.002. - DOI - PMC - PubMed

[2] Roosa K., Lee Y., Luo R., et al. Real-time forecasts of the COVID-19 epidemic in China from february 5th to february 24th, 2020. Infectious Disease Modelling . 2020;5:256–263. doi: 10.1016/j.idm.2020.02.002. - DOI - PMC - PubMed

[3] Yan L., Zhang H. T., Xiao Y., et al. Prediction of Criticality in Patients with Severe Covid-19 Infection Using Three Clinical Features: A Machine Learning-Based Prognostic Model with Clinical Data in Wuhan . New Haven, CT, USA: medRxiv; 2020.

[4] Yan L., Zhang H. T., Xiao Y., et al. Prediction of Criticality in Patients with Severe Covid-19 Infection Using Three Clinical Features: A Machine Learning-Based Prognostic Model with Clinical Data in Wuhan . New Haven, CT, USA: medRxiv; 2020.

[5] Bernard Stoecklin S., Rolland P., Silue Y., et al. First cases of coronavirus disease 2019 (COVID-19) in France: surveillance, investigations and control measures, January 2020. Euro Surveillance . 2020;25(6) doi: 10.2807/1560-7917.es.2020.25.6.2000094.2000094 - DOI - PMC - PubMed

[6] Bernard Stoecklin S., Rolland P., Silue Y., et al. First cases of coronavirus disease 2019 (COVID-19) in France: surveillance, investigations and control measures, January 2020. Euro Surveillance . 2020;25(6) doi: 10.2807/1560-7917.es.2020.25.6.2000094.2000094 - DOI - PMC - PubMed

[7] Huang C., Wang Y., Li X., et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet . 2020;395(10223):497–506. doi: 10.1016/s0140-6736(20)30183-5. - DOI - PMC - PubMed

[8] Huang C., Wang Y., Li X., et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet . 2020;395(10223):497–506. doi: 10.1016/s0140-6736(20)30183-5. - DOI - PMC - PubMed

[9] Mahase E. Coronavirus: Covid-19 Has Killed More People than SARS and MERS Combined. despite lower case fatality rate . 2020;368 - PubMed

[10] Mahase E. Coronavirus: Covid-19 Has Killed More People than SARS and MERS Combined. despite lower case fatality rate . 2020;368 - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques

Affiliations

A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical