ta603

ta603's Stars

haoheliu/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Language:Python1.1k 26 57107
affige/genmusic_demo_list
a list of demo websites for automatic music generation research
609 33 741
jeanfeydy/geomloss
Geometric loss functions between point clouds, images and volumes
Language:Python586 13 7257
zhvng/open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
Language:Python514 16 2558
NVlabs/edm2
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
Language:Python489 12 519
Stanford-TML/EDGE
Official PyTorch Implementation of EDGE (CVPR 2023)
Language:Python437 11 4565
lisiyao21/Bailando
Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"
Language:Python382 12 5261
SJTMusicTeam/Muskits
An opensource music processing toolkit
Language:Python310 16 3144
X-LANCE/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
Language:Python301 15 1621
wzk1015/video-bgm-generation
[ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer
Language:Python284 9 2733
lingjzhu/charsiu
Charsiu: A neural phonetic aligner.
Language:Jupyter Notebook270 8 1733
arkrow/PyMusicLooper
A python program for repeating music endlessly and creating seamless music loops, with play/export/tagging support.
Language:Python249 6 3023
L-YeZhu/CDCD
[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).
Language:Python154 6 119
jthickstun/anticipation
Anticipatory Autoregressive Models
Language:Python147 5 1427
keums/icassp2022-vocal-transcription
Code for ICASSP2022 paper "Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music"
Language:Python139 1 418
Yikai-Liao/symusic
A swift and unified toolkit for symbolic music processing
Language:C++126 4 328
eloimoliner/CQTdiff
Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23
Language:Jupyter Notebook104 3 211
L-YeZhu/D2M-GAN
[ECCV2022] D2M-GAN for music generation from dance videos
Language:Python85 5 1214
rabitt/ismir2017-deepsalience
Companion code for ISMIR 2017 paper "Deep Salience Representations for $F_0$ Estimation in Polyphonic Music"
Language:Jupyter Notebook83 5 119
f90/jamendolyrics
Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation
Language:Python73 9 410
sony/hFT-Transformer
Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).
Language:Python70 3 310
bill317996/Melody-extraction-with-melodic-segnet
The source code of "A Streamlined Encoder/Decoder Architecture for Melody Extraction"
Language:Python69 3 413
york135/singing_transcription_ICASSP2021
The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"
Language:Python52 2 215
gulnazaki/lyrics-melody
Lyrics and Vocal Melody Generation conditioned on Accompaniment
Language:Python27 2 03
seyong92/phoneme-informed-note-level-singing-transcription
A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023
Language:Python24 5 21
gunagg/Dance2Music
Automatic Dance-driven Music Generation
Language:Python16 1 15
amanteur/CHAD
Official Code of "A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task" (ISMIR 2023)
Language:Python14 3 01
ta603/RefinPaint
Language:Python101
seyong92/CSD_reannotation
Re-annotation for CSD dataset for singing transcription
5 1 00
ta603/Self-supervised_Metric_Learning
Language:Python4 1 00