gallilmaimon's Stars
facebookresearch/spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
jonkahana/ProbeGen
An official implementation of ProbeGen
MoSalama98/DSiRe
Official implementation of "Dataset Size Recovery from LoRA Weights" paper.
slp-rl/HebTTS
The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"
guyyariv/vLMIG
This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Generation
ShovalMessica/NAST
Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11037
eliahuhorwitz/MoTHer
Official PyTorch Implementation for the "Model Tree Heritage Recovery" paper.
slp-rl/salmon
The official code for the SALMonš£ benchmark
AsafShul/PoDD
Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.
eliahuhorwitz/Spectral-DeTuning
Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).
guyyariv/TempoTokens
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
slp-rl/SLM-Discrete-Representations
This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language Modeling" (ICASSP 2023)
slp-rl/DISSC
This is a from from the official repository of "Speaking Style Conversion With Discrete Self-Supervised Units"
guyyariv/AudioToken
This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
omriav/blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
slp-rl/aero
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)
gallilmaimon/DISSC
Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730
RoySheffer/im2wav
Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation
OdedH/textual-pca
Official implementation of "Describing Sets of Images with Textual-PCA".
facebookresearch/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
TzviLederer/silent-killer
Implementation of the paper Silent Killer
jonkahana/CLIPPR
An official PyTorch implementation for CLIPPR
eliahuhorwitz/Conffusion
Official Implementation for the "Conffusion: Confidence Intervals for Diffusion Models" paper.
gallilmaimon/LUNATC
This is the official implementation of "A Universal Adversarial Policy for Text Classifiers", Neural Networks (2022), https://doi.org/10.1016/j.neunet.2022.06.018
amosy3/Text2Model
jonkahana/DCoDR
PyTorch Implementation of DCoDR
slp-rl/SC-PhASE
This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (Interspeech 2022)