6533b85ffe1ef96bd12c1d45

RESEARCH PRODUCT

Genre-adaptive Semantic Computing and Audio-based Modelling for Music Mood Annotation

Tuomas EerolaGyörgy FazekasOlivier LartillotPasi SaariMathieu BarthetMark Sandler

subject

ExploitMusic information retrievalmusic information retrievalcomputer.software_genre050105 experimental psychologyGenre-adaptive.030507 speech-language pathology & audiology03 medical and health sciencesAnnotationPopular musicSemantic computingMusic information retrieval0501 psychology and cognitive sciencesValence (psychology)genre-adaptivesocial tagsta113music genrebusiness.industry05 social sciencesComputingMilieux_PERSONALCOMPUTINGmood predictionMusic moodHuman-Computer InteractionMoodta6131semantic computingArtificial intelligence0305 other medical sciencebusinessPsychologycomputerSoftwareNatural language processing

description

This study investigates whether taking genre into account is beneficial for automatic music mood annotation in terms of core affects valence, arousal, and tension, as well as several other mood scales. Novel techniques employing genre-adaptive semantic computing and audio-based modelling are proposed. A technique called the ACTwg employs genre-adaptive semantic computing of mood-related social tags, whereas ACTwg-SLPwg combines semantic computing and audio-based modelling, both in a genre-adaptive manner. The proposed techniques are experimentally evaluated at predicting listener ratings related to a set of 600 popular music tracks spanning multiple genres. The results show that ACTwg outperforms a semantic computing technique that does not exploit genre information, and ACTwg-SLPwg outperforms conventional techniques and other genre-adaptive alternatives. In particular, improvements in the prediction rates are obtained for the valence dimension which is typically the most challenging core affect dimension for audio-based annotation. The specificity of genre categories is not crucial for the performance of ACTwg-SLPwg. The study also presents analytical insights into inferring a concise tag-based genre representation for genre-adaptive music mood analysis.

10.1109/taffc.2015.2462841https://vbn.aau.dk/da/publications/e606b23c-1b08-46bb-b797-acb4abd96ef4