Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Dec 20;10(12):230964.
doi: 10.1098/rsos.230964. eCollection 2023 Dec.

Multimodal analysis of disinformation and misinformation

Affiliations
Review

Multimodal analysis of disinformation and misinformation

Anna Wilson et al. R Soc Open Sci. .

Abstract

The use of disinformation and misinformation campaigns in the media has attracted much attention from academics and policy-makers. Multimodal analysis or the analysis of two or more semiotic systems-language, gestures, images, sounds, among others-in their interrelation and interaction is essential to understanding dis-/misinformation efforts because most human communication goes beyond just words. There is a confluence of many disciplines (e.g. computer science, linguistics, political science, communication studies) that are developing methods and analytical models of multimodal communication. This literature review brings research strands from these disciplines together, providing a map of the multi- and interdisciplinary landscape for multimodal analysis of dis-/misinformation. It records the substantial growth starting from the second quarter of 2020-the start of the COVID-19 epidemic in Western Europe-in the number of studies on multimodal dis-/misinformation coming from the field of computer science. The review examines that category of studies in more detail. Finally, the review identifies gaps in multimodal research on dis-/misinformation and suggests ways to bridge these gaps including future cross-disciplinary research directions. Our review provides scholars from different disciplines working on dis-/misinformation with a much needed bird's-eye view of the rapidly emerging research of multimodal dis-/misinformation.

Keywords: literature review; machine learning; multimodal dis-/misinformation; qualitative analysis.

PubMed Disclaimer

Conflict of interest statement

We declare we have no competing interests.

Figures

Figure 1.
Figure 1.
PRISMA statement. The number of records included at each step are shown.
Figure 2.
Figure 2.
(a) Number of records published each year (1900 to 31 March 2020) within scope of Stage 1. (b) The number of records for each type of document.
Figure 3.
Figure 3.
The number of records by the same author (a) or records published in the same source (b).
Figure 4.
Figure 4.
Subject breakdown. The breakdown of subjects for all eligible records (a) and the breakdown into sub-subjects for the records in the social sciences (b).
Figure 5.
Figure 5.
Methodology. The distribution of subjects for content-specific eligible records (a), and the type of methodology used in these records (b).
Figure 6.
Figure 6.
The 20 most common words in the abstracts after stop words were removed.
Figure 7.
Figure 7.
Coherence scores for LDA models for different numbers of topics (minimum 2). The red-highlighted points indicate possible choices of topic numbers.
Figure 8.
Figure 8.
The frequency with which each topic was the primary topic in an abstract (a) and the normalized frequency with which topics co-occurred with other topics (b).
Figure 9.
Figure 9.
The frequency with which each topic was the primary topic in an abstract.
Figure 10.
Figure 10.
This chart compares the number of CS versus non CS publications across both Stages 1 and 2 using the stricter selection criteria outlined in Stage 2. All papers were screened to check that they were applicable to dis-/misinformation; the earliest example dated from 2015.
Figure 11.
Figure 11.
This is a general schematic of a bimodal disinformation detector. The blue boxes labelled 1 and 2 are modules that process each modality of the input data. The role of module 3 is to pass information, no matter how complex, between modalities. The outputs of these modules are first aggregated before then being passed into a black box classifier; concatenation and ‘multi-layer perceptrons’ were the most commonly observed ML methods, respectively.
Figure 12.
Figure 12.
Shown here is where the data originates and how these sources mix. The breakdowns were chosen to highlight the sorts of disinformation studied. A source was counted if the paper’s data source was distinct (e.g. datasets ‘X’ and ‘Y’ count as two). We counted 91 instances of sources, hence an average mixing of 1.25 sources per paper—largely independent. (a) Depicting the mixing of data sources and (b) breakdown of ‘Miscellaneous’ in (a).

Similar articles

References

    1. Altay S, Berriche M, Acerbi A. 2023. Misinformation on misinformation: conceptual and methodological challenges. Soc. Media Soc. 9, 1-13.
    1. Pennycook G, Rand DG. 2021. The psychology of fake news. Trends Cogn. Sci. 25, 388-402. (10.1016/j.tics.2021.02.007) - DOI - PubMed
    1. Pantazi M, Hale S, Klein O. 2021. Social and cognitive aspects of the vulnerability to political misinformation. Pol. Psychol. 42, 267-304. (10.1111/pops.12797) - DOI
    1. Verrall N, Mason D. 2018. The taming of the shrewd. RUSI J. 163, 20-28. (10.1080/03071847.2018.1445169) - DOI
    1. Lewandowsky S, Ecker UK, Cook J. 2017. Beyond misinformation: understanding and coping with the ‘Post-Truth’ Era. J. Appl. Res. Mem. Cogn. 6, 353-369. (10.1016/j.jarmac.2017.07.008) - DOI

LinkOut - more resources