Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 18:55:110743.
doi: 10.1016/j.dib.2024.110743. eCollection 2024 Aug.

SunoCaps: A novel dataset of text-prompt based AI-generated music with emotion annotations

Affiliations

SunoCaps: A novel dataset of text-prompt based AI-generated music with emotion annotations

M Civit et al. Data Brief. .

Abstract

The SunoCaps dataset aims to provide an innovative contribution to music data. Expert description of human-made musical pieces, from the widely used MusicCaps dataset, are used as prompts for generating complete songs for this dataset. This Automatic Music Generation is done with the state-of-the-art Suno generator of audio-based music. A subset of 64 pieces from MusicCaps is currently included, with a total of 256 generated entries. This total stems from generating four different variations for each human piece; two versions based on the original caption and two versions based on the original aspect description. As an AI-generated music dataset, SunoCaps also includes expert-based information on prompt alignment, with the main differences between prompt and final generation annotated. Furthermore, annotations describing the main discrete emotions induced by the piece. This dataset can have an array of implementations, such as creating and improving music generation validation tools, training systems for multi-layered architectures and the optimization of music emotion estimation systems.

Keywords: Artificial intelligence; Automatic music generation; Data; Emotion feature; Generative AI; Prompt alignment.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig 1:
Fig. 1
SunoCaps Song Creation Process.

References

    1. M. Civit, SunoCaps, (2024). 10.34740/KAGGLE/DS/4891165. - DOI
    1. A. Agostinelli, T.I. Denk, Z. Borsos, J. Engel, M. Verzetti, A. Caillon, Q. Huang, A. Jansen, A. Roberts, M. Tagliasacchi, Musiclm: generating music from text, ArXiv Preprint ArXiv:2301.11325 (2023).
    1. Inc. Suno, make a song about anything, (2024). http://www.suno.com (accessed May 7, 2024).
    1. Harmon-Jones E., Harmon-Jones C., Summerell E. On the importance of both dimensional and discrete models of emotion. Behav. Sci. 2017;7:66. - PMC - PubMed
    1. Girard J.M., Cohn J.F., Jeni L.A., Lucey S., la Torre F. 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG) 2015. How much training data for facial action unit detection? pp. 1–8. - PMC - PubMed

LinkOut - more resources