Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Aug 10;49(1):214-229.
doi: 10.1080/02664763.2020.1803810. eCollection 2022.

A model-based approach to Spotify data analysis: a Beta GLMM

Affiliations

A model-based approach to Spotify data analysis: a Beta GLMM

Mariangela Sciandra et al. J Appl Stat. .

Abstract

Digital music distribution is increasingly powered by automated mechanisms that continuously capture, sort and analyze large amounts of Web-based data. This paper deals with the management of songs audio features from a statistical point of view. In particular, it explores the data catching mechanisms enabled by Spotify Web API and suggests statistical tools for the analysis of these data. Special attention is devoted to songs popularity and a Beta model, including random effects, is proposed in order to give the first answer to questions like: which are the determinants of popularity? The identification of a model able to describe this relationship, the determination within the set of characteristics of those considered most important in making a song popular is a very interesting topic for those who aim to predict the success of new products.

Keywords: 62; 62H; 62P; Beta GLMM; Spotify web API; audio features; popularity index.

PubMed Disclaimer

Conflict of interest statement

No potential conflict of interest was reported by the author(s).

Figures

Figure 1.
Figure 1.
Average values of Spotify features over time.
Figure 2.
Figure 2.
Distribution of songs according to the Popularity Index, conditioning to the album (13 November 2019).
Figure 3.
Figure 3.
Scatterplot of popularity vs. acousticness and danceability.
Figure 4.
Figure 4.
Scatterplot popularity vs. liveness and speechness.
Figure 5.
Figure 5.
Scatterplot popularity vs. time and duration.
Figure 6.
Figure 6.
Scatterplot popularity vs. valence and loudness.
Figure 7.
Figure 7.
Scatterplot popularity vs. instrumentalness and energy.
Figure 8.
Figure 8.
Random intercepts with 95 % confidence intervals.

References

    1. Akaike H.. Information Theory as an Extension of the Maximum Likelihood Principle, Second international symposium on information theory. Petrov, Boris Nikolaevich and Csaki, F, 1973, pp. 267–281.
    1. Berger W.. Why is this song popular? (feat spotify). Available at https://medium.com/@albert.w.berger/what-makes-a-song-popular-in-a-certa...
    1. Bonat W.H., Ribeiro P.J., and Zeviani W.M., Likelihood analysis for a class of beta mixed models, J. Appl. Stat. 42 (Aug 2014), pp. 252–266. 10.1080/02664763.2014.947248. - DOI
    1. Brooks M.E., Kristensen K., van Benthem K.J., Magnusson A., Berg C.W., Nielsen A., Skaug H.J., Maechler M., and Bolker B.M., glmmTMB balances speed and flexibility among packages for zero-inflated generalized linear mixed modeling, R. J. 9 (2017), pp. 378–400. Available at https://journal.r-project.org/archive/2017/RJ-2017-066/index.html. doi: 10.32614/RJ-2017-066 - DOI
    1. Charlie, Rcharlie web site. Available at https://www.rcharlie.com//, 2019.

LinkOut - more resources