Review

. 2022;12(1):129.

doi: 10.1007/s13278-022-00951-3. Epub 2022 Sep 5.

Detection and moderation of detrimental content on social media platforms: current status and future directions

Vaishali U Gongane¹, Mousami V Munot¹, Alwin D Anuse²

Affiliations

¹ E&TC Department, SCTR's Pune Institute of Computer Technology, SPPU, Pune, 411046 India.
² School of ECE, Dr Vishwanath Karad MIT_WPU, SPPU, Pune, 411038 India.

PMID: 36090695
PMCID: PMC9444091
DOI: 10.1007/s13278-022-00951-3

Review

Detection and moderation of detrimental content on social media platforms: current status and future directions

Vaishali U Gongane et al. Soc Netw Anal Min. 2022.

. 2022;12(1):129.

doi: 10.1007/s13278-022-00951-3. Epub 2022 Sep 5.

Authors

Vaishali U Gongane¹, Mousami V Munot¹, Alwin D Anuse²

Affiliations

¹ E&TC Department, SCTR's Pune Institute of Computer Technology, SPPU, Pune, 411046 India.
² School of ECE, Dr Vishwanath Karad MIT_WPU, SPPU, Pune, 411038 India.

PMID: 36090695
PMCID: PMC9444091
DOI: 10.1007/s13278-022-00951-3

Abstract

Social Media has become a vital component of every individual's life in society opening a preferred spectrum of virtual communication which provides an individual with a freedom to express their views and thoughts. While virtual communication through social media platforms is highly desirable and has become an inevitable component, the dark side of social media is observed in form of detrimental/objectionable content. The reported detrimental contents are fake news, rumors, hate speech, aggressive, and cyberbullying which raise up as a major concern in the society. Such detrimental content is affecting person's mental health and also resulted in loss which cannot be always recovered. So, detecting and moderating such content is a prime need of time. All social media platforms including Facebook, Twitter, and YouTube have made huge investments and also framed policies to detect and moderate such detrimental content. It is of paramount importance in the first place to detect such content. After successful detection, it should be moderated. With an overflowing increase in detrimental content on social media platforms, the current manual method to identify such content will never be enough. Manual and semi-automated moderation methods have reported limited success. A fully automated detection and moderation is a need of time to come up with the alarming detrimental content on social media. Artificial Intelligence (AI) has reached across all sectors and provided solutions to almost all problems, social media content detection and moderation is not an exception. So, AI-based methods like Natural Language Processing (NLP) with Machine Learning (ML) algorithms and Deep Neural Networks is rigorously deployed for detection and moderation of detrimental content on social media platforms. While detection of such content has been receiving good attention in the research community, moderation has received less attention. This research study spans into three parts wherein the first part emphasizes on the methods to detect the detrimental components using NLP. The second section describes about methods to moderate such content. The third part summarizes all observations to provide identified research gaps, unreported problems and provide research directions.

Keywords: Artificial intelligence (AI); Detection and moderation; Natural language processing (NLP); Social media (SM) platforms.

© The Author(s), under exclusive licence to Springer-Verlag GmbH Austria, part of Springer Nature 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

PubMed Disclaimer

Figures

**Fig.1**
Statistics of monthly active users on various social media platforms (https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/)

**Fig. 2**
Various forms of Detriment content published on SM

**Fig. 3**
Online abusive behavior experienced by teenagers

**Fig. 4**
Detection and moderation of UGC on SM platforms

**Fig. 5**
Flow chart of selection of articles for review

**Fig. 6**
AI-based techniques for detection of detrimental content on SM platforms

**Fig. 7**
A generic block diagram of automated SM content detection using NLP, ML and DL

**Fig. 8**
Process of detection and classification of a SM content using ML algorithms

**Fig. 9**
Statistics of ML algorithms for SM content Detection

**Fig. 10**
Example images in fake news articles

**Fig. 11**
Architecture of EANN: Event Adversarial Neural Networks for Multi-Modal Fake News Detection (Wang et al. 2018)

**Fig. 12**
Architecture of SpotFake (Singhal et al. 2019)

**Fig. 13**
Neural network model architecture for multimodal hate speech classification Kumar et al. (2021)

**Fig. 14**
Manual Content moderation on SM Platforms

See this image and copyright information in PMC

Cited by

From triple-mode network to triple-layered model - novel insights in social cognition.
Dubey S, Sengupta S, Chatterjee S, Ghosh R, Dewasi S, Das S, Pandit A, Dubey MJ. Dubey S, et al. Indian J Psychiatry. 2025 Jul;67(7):710-720. doi: 10.4103/indianjpsychiatry_446_25. Epub 2025 Jul 15. Indian J Psychiatry. 2025. PMID: 40786216 Free PMC article.
The Impact of Social Media & Technology on Child and Adolescent Mental Health.
Masri-Zada T, Martirosyan S, Abdou A, Barbar R, Kades S, Makki H, Haley G, Agrawal DK. Masri-Zada T, et al. J Psychiatry Psychiatr Disord. 2025;9(2):111-130. Epub 2025 Apr 16. J Psychiatry Psychiatr Disord. 2025. PMID: 40520349 Free PMC article.
The Quality of MitraClip™ Content on YouTube.
Nus BM, Sledge T, Wu K, Saunders CS, Khalife W. Nus BM, et al. Cureus. 2023 Aug 21;15(8):e43881. doi: 10.7759/cureus.43881. eCollection 2023 Aug. Cureus. 2023. PMID: 37614823 Free PMC article.
#WhatIEatinaDay: The Quality, Accuracy, and Engagement of Nutrition Content on TikTok.
Zeng M, Grgurevic J, Diyab R, Roy R. Zeng M, et al. Nutrients. 2025 Feb 24;17(5):781. doi: 10.3390/nu17050781. Nutrients. 2025. PMID: 40077651 Free PMC article.
Social Media and Youth Mental Health: Scoping Review of Platform and Policy Recommendations.
Chhabra J, Pilkington V, Benakovic R, Wilson MJ, La Sala L, Seidler Z. Chhabra J, et al. J Med Internet Res. 2025 Jun 20;27:e72061. doi: 10.2196/72061. J Med Internet Res. 2025. PMID: 40540734 Free PMC article.

See all "Cited by" articles

References

1. Ahmed H, Traore I, Saad S (2017) Detection of online fake news using N-Gram analysis and machine learning techniques. In: Traore I, Woungang I, Awad A (eds) Intelligent, secure, and dependable systems in distributed and cloud environments. ISDDC. Lecture Notes in Computer Science, Vol. 10618, pp 127–138. 10.1007/978-3-319-69155-8_9.
1. Amrutha BR, Bindu KR (2019) Detecting hate speech in tweets using different deep neural network architectures. In: Proceedings of the international conference on intelligent computing and control systems (ICICCS 2019) IEEE, pp 923–926. 10.1109/ICCS45141.2019.9065763.
1. Andersen JS, Zukunft O, Maalej W (2021) REM: efficient semi-automated real-time moderation of online forums. In: Proceedings of the joint conference of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing: system demonstrations. pp 142–149.
1. Ayo FE, Folorunso O, Ibharalu FT, Osinuga IA. Machine learning techniques for hate speech classification of twitter data: State-of-the-art, future challenges and research directions. Comput Sci Rev Elsevier. 2020 doi: 10.1016/j.cosrev.2020.100311. - DOI
1. Badjatiya P, Gupta S, Gupta M, Varma V (2017) Deep learning for hate speech detection in tweets. In: 26th international conference on world wide web companion, Perth, Australia, pp 759–760. 10.1145/3041021.3054223.

Publication types

Actions

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Detection and moderation of detrimental content on social media platforms: current status and future directions

Affiliations

Detection and moderation of detrimental content on social media platforms: current status and future directions

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

LinkOut - more resources

Full Text Sources