Bi-Modal Music Mood Recognition with Audio and Lyrics

This Repository is an implementation of Music Mood Detection Based On Audio And Lyrics With Deep Neural Net by R. Delbouys et al. This model uses two CNN layers and two dense layers to solve Music Emotion Recognition problem. It is using Multi-modal Architecture in Regression task. This bi-modal deep learning structure is expected to combine data from two different domains and reflect information that can not be covered by one domain.

Dataset

For datasets, the Deezer Mood Detection Dataset and parts of the Million Song Dataset was used. The Deezer Mood detection dataset didn't include the audio and the lyrics due to copyright issues and thus had to be supplemented used the Million Song Dataset. However with a few adjustments a dataset can be chosen and used for the task from this website

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
audio_augment.py		audio_augment.py
config.py		config.py
data_loader.py		data_loader.py
feature.py		feature.py
lyrics_preprocess.py		lyrics_preprocess.py
model.py		model.py
train_classification.py		train_classification.py
train_regression.py		train_regression.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bi-Modal Music Mood Recognition with Audio and Lyrics

Dataset

About

Releases

Packages

Languages

Jarvis1000x/Music_Mood_Recognition

Folders and files

Latest commit

History

Repository files navigation

Bi-Modal Music Mood Recognition with Audio and Lyrics

Dataset

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages