Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 4;14(1):15443.
doi: 10.1038/s41598-024-63649-w.

Fake views removal and popularity on YouTube

Affiliations

Fake views removal and popularity on YouTube

Maria Castaldo et al. Sci Rep. .

Abstract

This paper analyses how YouTube authenticates engagement metrics and, more specifically, how the platform corrects view counts by removing "fake views" (i.e., views considered artificial or illegitimate by the platform). Working with one and a half years of data extracted from a thousand French YouTube channels, we show the massive extent of the corrections done by YouTube, which concern the large majority of the channels and over 78% of the videos in our corpus. Our analysis shows that corrections are not done continuously as videos collect new views, but instead occur in batches, generally around 5 p.m. every day. More significantly, most corrections occur relatively late in the life of the videos, after they have reached most of their audience, and the delay in correction is not independent of the final popularity of videos: videos corrected later in their life are more popular on average than those corrected earlier. We discuss the probable causes of this phenomenon and its possible negative consequences on content diffusion. By inflating view counts, fake views could make videos appear more popular than they are and unwarrantedly encourage their recommendation, thus potentially altering the public debate on the platform. This could have implications on the spread of online misinformation, but their in-depth exploration requires first-hand information on view corrections, which YouTube does not provide through its API. This paper presents a series of experimental techniques to work around this limitation, offering a practical contribution to the study of online attention cycles (as described in the "Data and methods" section). At the same time, this paper is also a call for greater transparency by YouTube and other online platforms about information with crucial implications for the quality of online debate.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
(A) 5 sample videos and their hourly evolution of views. (B) Fraction of corrections over real views (histogram) and number of views (blue line) for the 20 most corrected channels in terms of fraction of corrections. (C) Fraction of corrections over real views (histogram) and number of views (blue line) for the 20 most viewed channels in the dataset. (D) Lorenz curve of the distribution of views and corrections among different videos. (E) Percentage of videos affected by corrections.
Figure 2
Figure 2
(A) Distribution of interventions per hour of the day. (B) Distribution of corrections per hour of the day. (C) Distribution of views per hour of the day. (D) Normalized number of interventions, corrections and views per hour following publication. (E) Normalized number of interventions, corrections and views per day after publication.
Figure 3
Figure 3
(A) Fraction of corrections occurring after different percentages of real views. (B) Correlation between real and fake views per channel.
Figure 4
Figure 4
(A) Distribution of interventions in the days following the publication. (B) Distribution of popularity for videos corrected earlier, later and uncorrected.
Figure 5
Figure 5
Error introduced by the benchmark method, varying the time window.
Figure 6
Figure 6
XGBoost parameter tuning. Performances in terms of F1 score associated with different combinations of parameters’ values.
Figure 7
Figure 7
Consistency with 5-min dataset. (AC) Analysis on the distribution of intervention and corrections. (D,E) Analysis on the relation between speed of correction and popularity.

References

    1. Google - YouTube Terms of Service. How engagement metrics are counted (2022). https://support.google.com/YouTube/answer/2991785?hl=en%E2%80%8B. Accessed 12 July 2022.
    1. Gayle, D. YouTube cancels billions of music industry video views after finding they were fake or ’dead’. Daily Mail (2012). https://www.dailymail.co.uk/sciencetech/article-2254181/YouTube-wipes-bi.... Accessed 12 July 2022.
    1. Hoffberger, C. YouTube strips universal and sony of 2 billion fake views (2012). https://www.dailydot.com/unclick/YouTube-universal-sony-fake-views-black.... Accessed 12 July 2022.
    1. Fake YouTube views cut by 2 billion as google audits record companies’ video channels (2012). https://www.huffpost.com/entry/fake-youtube-views-cut-google-audit_n_238.... Accessed 16 June 2022.
    1. Dredge, S. Google goes to war on ’fraudulent’ YouTube video views. The Guardian (2014). http://www.theguardian.com/technology/2014/feb/05/YouTube-fake-views-cou.... 12 Accessed July 2022.

LinkOut - more resources