ShunyaYamagami

Pinned Repositories

demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Language:Python8.1k 152 5411k
Whisper-transcription_and_diarization-speaker-identification-
How to use OpenAIs Whisper to transcribe and diarize audio files
Language:Jupyter Notebook278 6 840
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook5.9k 71 988754
DPLCLIP
Domain Prompt Learning for Efficiently Adapting CLIP to Unseen Domains
Language:Python00
test-time-adaptation
A repository and benchmark for online test-time adaptation.
Language:Python00
video-bgm-generation
Music generation that matches the mood of videos. ref: Video Background Music Generation with Controllable Music Transformer
Language:Python00

ShunyaYamagami's Repositories

ShunyaYamagami/DPLCLIP
Domain Prompt Learning for Efficiently Adapting CLIP to Unseen Domains
Language:Python00
ShunyaYamagami/test-time-adaptation
A repository and benchmark for online test-time adaptation.
Language:Python00
ShunyaYamagami/video-bgm-generation
Music generation that matches the mood of videos. ref: Video Background Music Generation with Controllable Music Transformer
Language:Python00