Pinned Repositories
Cacophony
Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986
Filler-semi-CRF
Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]
GenerativeSourceSeparation
Open source code for the paper 'Music Source Separation with Generative Flow'
gzhu06.github.io
Personal webpage
Manifold-Constrained-Gradient-ipynb
Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/abs/2206.00941]
PodcastFillers_Utils
Utility functions for preprocessing PodcastFillers dataset
TDspkr-mismatch-study
Code base for "A study of the robustness of raw waveform based speaker embeddings under mismatched conditions"
Unconditional-Audio-Generation-Benchmark
Unconditional audio generation benchmark
Waveform-Synthesizer-with-Diffusion
archived
Y-vector
Y-vector: Multiscale Waveform Encoder for Speaker Embedding
gzhu06's Repositories
gzhu06/Cacophony
Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986
gzhu06/Y-vector
Y-vector: Multiscale Waveform Encoder for Speaker Embedding
gzhu06/GenerativeSourceSeparation
Open source code for the paper 'Music Source Separation with Generative Flow'
gzhu06/Manifold-Constrained-Gradient-ipynb
Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/abs/2206.00941]
gzhu06/Filler-semi-CRF
Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]
gzhu06/Unconditional-Audio-Generation-Benchmark
Unconditional audio generation benchmark
gzhu06/PodcastFillers_Utils
Utility functions for preprocessing PodcastFillers dataset
gzhu06/TDspkr-mismatch-study
Code base for "A study of the robustness of raw waveform based speaker embeddings under mismatched conditions"
gzhu06/Waveform-Synthesizer-with-Diffusion
archived
gzhu06/gzhu06.github.io
Personal webpage
gzhu06/openSFX-TFShard
A codebase for open source SFX data TFrecord sharding