Pinned Repositories
ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
awesome-LoRA
A curated list of Parameter Efficient Fine-tuning papers with a TL;DR
DisfluentFA
A Weakly Supervised Forced Alignment for disluent speech
DreamSound
Code for Investigating Personalization Methods in Text to Music Generation
KaldiLongAligner
Speech to Text Alignment tool implemented with Python and Kaldi
local_pnp
This repo contains experiments for local editing in Diffusion Models
localdiff-demo
A repo containing demo for Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis
Reading-Diffusion
A collection of interesting papers on Diffusion Models
wsac
This reporsitory code form Weakly Supervised Automaed Audio Captioning via Text Only Training
zelaki's Repositories
zelaki/DreamSound
Code for Investigating Personalization Methods in Text to Music Generation
zelaki/DisfluentFA
A Weakly Supervised Forced Alignment for disluent speech
zelaki/KaldiLongAligner
Speech to Text Alignment tool implemented with Python and Kaldi
zelaki/awesome-LoRA
A curated list of Parameter Efficient Fine-tuning papers with a TL;DR
zelaki/Reading-Diffusion
A collection of interesting papers on Diffusion Models
zelaki/localdiff-demo
A repo containing demo for Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis
zelaki/wsac
This reporsitory code form Weakly Supervised Automaed Audio Captioning via Text Only Training
zelaki/local_pnp
This repo contains experiments for local editing in Diffusion Models
zelaki/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
zelaki/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
zelaki/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
zelaki/Folder-Structure-Conventions
Folder / directory structure options and naming conventions for software projects
zelaki/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
zelaki/kaldi-long-audio-alignment
Long audio alignment using Kaldi
zelaki/passt_hear21
zelaki/presentations
This is a repo where to save Marp presentations
zelaki/sail_align
SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends on freely available software, namely HTK, srilm and sclite.
zelaki/secretsanta
Host secret santa without leaking your guests' informations 🎄
zelaki/SEffCaps
Automated Audio Captioning of Sound Effects in Movies and Videos
zelaki/user_study_templates
zelaki/wavetransformer
Code base for WaveTransformer: A novel architecture for automated audio captioning
zelaki/zelaki.github.io