ta603's Stars
haoheliu/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
affige/genmusic_demo_list
a list of demo websites for automatic music generation research
jeanfeydy/geomloss
Geometric loss functions between point clouds, images and volumes
zhvng/open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
NVlabs/edm2
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
Stanford-TML/EDGE
Official PyTorch Implementation of EDGE (CVPR 2023)
lisiyao21/Bailando
Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"
SJTMusicTeam/Muskits
An opensource music processing toolkit
X-LANCE/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
wzk1015/video-bgm-generation
[ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer
lingjzhu/charsiu
Charsiu: A neural phonetic aligner.
arkrow/PyMusicLooper
A python program for repeating music endlessly and creating seamless music loops, with play/export/tagging support.
L-YeZhu/CDCD
[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).
jthickstun/anticipation
Anticipatory Autoregressive Models
keums/icassp2022-vocal-transcription
Code for ICASSP2022 paper "Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music"
Yikai-Liao/symusic
A swift and unified toolkit for symbolic music processing
eloimoliner/CQTdiff
Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23
L-YeZhu/D2M-GAN
[ECCV2022] D2M-GAN for music generation from dance videos
rabitt/ismir2017-deepsalience
Companion code for ISMIR 2017 paper "Deep Salience Representations for $F_0$ Estimation in Polyphonic Music"
f90/jamendolyrics
Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation
sony/hFT-Transformer
Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).
bill317996/Melody-extraction-with-melodic-segnet
The source code of "A Streamlined Encoder/Decoder Architecture for Melody Extraction"
york135/singing_transcription_ICASSP2021
The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"
gulnazaki/lyrics-melody
Lyrics and Vocal Melody Generation conditioned on Accompaniment
seyong92/phoneme-informed-note-level-singing-transcription
A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023
gunagg/Dance2Music
Automatic Dance-driven Music Generation
amanteur/CHAD
Official Code of "A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task" (ISMIR 2023)
ta603/RefinPaint
seyong92/CSD_reannotation
Re-annotation for CSD dataset for singing transcription
ta603/Self-supervised_Metric_Learning