AMAAI Lab

The Audio, Music, and AI Lab at Singapore University of Technology and Design (SUTD)

Singapore

Pinned Repositories

awesome-MER
A curated list of Datasets, Models and Papers for Music Emotion Recognition (MER)
28 5 00
JamendoMaxCaps
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
Language:Python43 1 20
megamusicaps
Language:Python10 3 10
MidiCaps
A large-scale dataset of caption-annotated MIDI files.
Language:Python60 2 53
mirflex
Music Information Retrieval Feature Library for Extraction
Language:Python13 1 05
Music2Emotion
Music2Emo: Towards Unified Music Emotion Recognition across Dimensional and Categorical Models
Language:Python16 3 34
mustango
Mustango: Toward Controllable Text-to-Music Generation
Language:Python375 16 1828
MuVi
Predicting emotion from music videos: exploring the relative contribution of visual and auditory information on affective responses
Language:Python22 2 00
Text2midi
Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2midi allows users to create symbolic music that aligns with detailed textual prompts, including musical attributes like chords, tempo, and style.
Language:Python62 2 66
Video2Music
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
Language:Python163 4 523

AMAAI Lab's Repositories

AMAAI-Lab/mustango
Mustango: Toward Controllable Text-to-Music Generation
Language:Python375 16 1828
AMAAI-Lab/Video2Music
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
Language:Python163 4 523
AMAAI-Lab/Text2midi
Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2midi allows users to create symbolic music that aligns with detailed textual prompts, including musical attributes like chords, tempo, and style.
Language:Python62 2 66
AMAAI-Lab/MidiCaps
A large-scale dataset of caption-annotated MIDI files.
Language:Python60 2 53
AMAAI-Lab/JamendoMaxCaps
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
Language:Python43 1 20
AMAAI-Lab/awesome-MER
A curated list of Datasets, Models and Papers for Music Emotion Recognition (MER)
28 5 00
AMAAI-Lab/MuVi
Predicting emotion from music videos: exploring the relative contribution of visual and auditory information on affective responses
Language:Python22 2 00
AMAAI-Lab/Music2Emotion
Music2Emo: Towards Unified Music Emotion Recognition across Dimensional and Categorical Models
Language:Python16 3 34
AMAAI-Lab/mirflex
Music Information Retrieval Feature Library for Extraction
Language:Python13 1 05
AMAAI-Lab/megamusicaps
Language:Python10 3 10
AMAAI-Lab/DART
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
Language:HTML9 1 11
AMAAI-Lab/PreBit
This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bitcoin"
Language:Jupyter Notebook9 1 04
AMAAI-Lab/cross-dataset-emotion-alignment
code for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Language:Python8 2 10
AMAAI-Lab/DisfluencySpeech
Resources for DisfluencySpeech
7 2 00
AMAAI-Lab/Accented-TTS-MLVAE-ADV
Language:Python6 2 00
AMAAI-Lab/ai-audio-datasets-list
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
3 0 00
AMAAI-Lab/Audio-Music-AI-Research-Resources
1 1 0
AMAAI-Lab/CM-HRNN
Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure
Language:Python1 0 0
AMAAI-Lab/CVAE-Tacotron
Conditional VAE for Accented Speech Generation
Language:HTML1 0 0
AMAAI-Lab/emotionweb
Website emotion guidance
Language:JavaScript1 1 0
AMAAI-Lab/genmusic_demo_list
a list of demo websites for automatic music generation research
1 0 00
AMAAI-Lab/IAMM
An exploration of how generative text-to-music AI models can be used for emotion guidance
1 3 0
AMAAI-Lab/kylo-ren-app
Web interface for AI music generation models
Language:JavaScript1 0 0
AMAAI-Lab/singapore-music-classifier
Code for paper A dataset and classification model for Malay, Hindi, Tamil and Chinese music
Language:Jupyter Notebook1 2 00
AMAAI-Lab/survey-music-nlp
Repository for "Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval Systems: a Survey"
1 0 0
AMAAI-Lab/nnAudio
Audio processing by using pytorch 1D convolution network
Language:Python0 0 00
AMAAI-Lab/AudioLoader
PyTorch Dataset for Speech and Music audio
Language:Python0 0
AMAAI-Lab/FundamentalMusicEmbedding
Fundamental Music Embedding, FME
Language:Python0 0
AMAAI-Lab/Jointist
Official Implementation of Jointist
Language:Python0 0
AMAAI-Lab/ReconVAT
ReconVAT: a semi-supervised automatic music transcription (AMT) model
Language:Python0 0