Developing a benchmark for emotional analysis of music - PubMed (original) (raw)

Developing a benchmark for emotional analysis of music

Anna Aljanaki et al. PLoS One. 2017.

Abstract

Music emotion recognition (MER) field rapidly expanded in the last decade. Many new methods and new audio features are developed to improve the performance of MER algorithms. However, it is very difficult to compare the performance of the new methods because of the data representation diversity and scarcity of publicly available data. In this paper, we address these problems by creating a data set and a benchmark for MER. The data set that we release, a MediaEval Database for Emotional Analysis in Music (DEAM), is the largest available data set of dynamic annotations (valence and arousal annotations for 1,802 songs and song excerpts licensed under Creative Commons with 2Hz time resolution). Using DEAM, we organized the 'Emotion in Music' task at MediaEval Multimedia Evaluation Campaign from 2013 to 2015. The benchmark attracted, in total, 21 active teams to participate in the challenge. We analyze the results of the benchmark: the winning algorithms and feature-sets. We also describe the design of the benchmark, the evaluation procedures and the data cleaning and transformations that we suggest. The results from the benchmark suggest that the recurrent neural network based approaches combined with large feature-sets work best for dynamic MER.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Fig 1. A labyrinth of data representation choices for a MER algorithm.

The choices that we made for the benchmark are highlighted in red.

Fig 2. Annotation interface for both continuous (upper-left corner) and static per song (middle; using the self-assessment manikins [43]) ratings of arousal.

Fig 3. Fitted GAMs for the arousal and valence annotations of two songs.

Fig 4

Liking of the music and confidence in rating for a) valence, Spearman’s ρ = 0.37, _p_-value = 2.2 × 10−16 b) arousal, Spearman’s ρ = 0.29, _p_-value = 2.2 × 10−16.

Fig 5. Krippendorff’s α of dynamic annotations in 2015, averaged over all dynamic samples.

Fig 6

Distribution of the labels on arousal-valence plane for a) development-set b) evaluation-set.

Cited by

Differential effects of familiarity and emotional expression of musical cues on autobiographical memory properties.
Jakubowski K, Francini E. Jakubowski K, et al. Q J Exp Psychol (Hove). 2023 Sep;76(9):2001-2016. doi: 10.1177/17470218221129793. Epub 2022 Oct 27. Q J Exp Psychol (Hove). 2023. PMID: 36121341 Free PMC article.
Using machine learning analysis to interpret the relationship between music emotion and lyric features.
Xu L, Sun Z, Wen X, Huang Z, Chao CJ, Xu L. Xu L, et al. PeerJ Comput Sci. 2021 Nov 15;7:e785. doi: 10.7717/peerj-cs.785. eCollection 2021. PeerJ Comput Sci. 2021. PMID: 34901433 Free PMC article.
Toward Emotion Recognition From Physiological Signals in the Wild: Approaching the Methodological Issues in Real-Life Data Collection.
Larradet F, Niewiadomski R, Barresi G, Caldwell DG, Mattos LS. Larradet F, et al. Front Psychol. 2020 Jul 15;11:1111. doi: 10.3389/fpsyg.2020.01111. eCollection 2020. Front Psychol. 2020. PMID: 32760305 Free PMC article. Review.
A review of artificial intelligence methods enabled music-evoked EEG emotion recognition and their applications.
Su Y, Liu Y, Xiao Y, Ma J, Li D. Su Y, et al. Front Neurosci. 2024 Sep 4;18:1400444. doi: 10.3389/fnins.2024.1400444. eCollection 2024. Front Neurosci. 2024. PMID: 39296709 Free PMC article. Review.
Deep-Learning-Based Multimodal Emotion Classification for Music Videos.
Pandeya YR, Bhattarai B, Lee J. Pandeya YR, et al. Sensors (Basel). 2021 Jul 20;21(14):4927. doi: 10.3390/s21144927. Sensors (Basel). 2021. PMID: 34300666 Free PMC article.

References

1. Inskip C, Macfarlane A, Rafferty P. Towards the disintermediation of creative music search: analysing queries to determine important facets. International Journal on Digital Libraries. 2012;12(2):137–147. 10.1007/s00799-012-0084-1 - DOI
1. Yang YH, Chen HH. Machine recognition of music emotion: A review. ACM Transactions on Intelligent Systems and Technology. 2012;3(3). 10.1145/2168752.2168754 - DOI
1. Kim YE, Schmidt EM, Migneco R, Morton BG, Richardson P, Scott J, et al. Music emotion recognition: A state of the art review. In: Proceedings of International Society for Music Information Retrieval Conference; 2010. p. 255–266.
1. Laurier C, Lartillot O, Eerola T, Toiviainen P. Exploring relationships between audio features and emotion in music. In: Proceedings of Triennal Conference of European Society for Cognitive Sciences of Music; 2009. p. 260–264.
1. Yang YH, Lin YC, Su YF, Chen HH. A regression approach to music emotion recognition. IEEE Transactions on Audio, Speech, and Language Processing. 2008;16(2):448–457. 10.1109/TASL.2007.911513 - DOI

MeSH terms

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Miscellaneous
- NCI CPTAC Assay Portal

Developing a benchmark for emotional analysis of music - PubMed (original) (raw)