Pinned Repositories
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Whisper-transcription_and_diarization-speaker-identification-
How to use OpenAIs Whisper to transcribe and diarize audio files
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
DPLCLIP
Domain Prompt Learning for Efficiently Adapting CLIP to Unseen Domains
test-time-adaptation
A repository and benchmark for online test-time adaptation.
video-bgm-generation
Music generation that matches the mood of videos. ref: Video Background Music Generation with Controllable Music Transformer
ShunyaYamagami's Repositories
ShunyaYamagami/DPLCLIP
Domain Prompt Learning for Efficiently Adapting CLIP to Unseen Domains
ShunyaYamagami/test-time-adaptation
A repository and benchmark for online test-time adaptation.
ShunyaYamagami/video-bgm-generation
Music generation that matches the mood of videos. ref: Video Background Music Generation with Controllable Music Transformer