Pinned Repositories
AdvancedAutomaticSpeechRecognition
audio-gen-dreambooth
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
hf_image_uploader
metaseq
Repo for external large-scale work
notebooks
Some notebooks for NLP
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Wav2Vec2_ParlanceCTCDecode
Wav2Vec2_PyCTCDecode
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
whisper-tools
patrickvonplaten's Repositories
patrickvonplaten/Wav2Vec2_PyCTCDecode
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
patrickvonplaten/AdvancedAutomaticSpeechRecognition
patrickvonplaten/metaseq
Repo for external large-scale work
patrickvonplaten/t5-mtf-to-hf-converter
patrickvonplaten/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
patrickvonplaten/blog
Public repo for HF blog posts
patrickvonplaten/pytorch_diffusion
PyTorch reimplementation of Diffusion Models
patrickvonplaten/TRexGameRL
patrickvonplaten/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
patrickvonplaten/k-diffusion
Karras et al. (2022) diffusion models for PyTorch
patrickvonplaten/Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
patrickvonplaten/seq2seq-speech
Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.
patrickvonplaten/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
patrickvonplaten/data2vec_vision
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
patrickvonplaten/datasets-1
🤗 Fast, efficient, open-access datasets and evaluation metrics for Natural Language Processing and more in PyTorch, TensorFlow, NumPy and Pandas
patrickvonplaten/ddim
Denoising Diffusion Implicit Models
patrickvonplaten/diffusion
Denoising Diffusion Probabilistic Models
patrickvonplaten/huggingface_hub
All the open source things related to the Hugging Face Hub.
patrickvonplaten/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
patrickvonplaten/karlo
patrickvonplaten/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
patrickvonplaten/longt5
patrickvonplaten/markup2im
Diffusion-based markup-to-image generation
patrickvonplaten/PNDM
The official implementation for Pseudo Numerical Methods for Diffusion Models on Manifolds (ICLR 2022) and a generic framework for DDIM-like models
patrickvonplaten/pyctcdecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
patrickvonplaten/sample-generator
Tools to train a generative model on arbitrary audio samples
patrickvonplaten/score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
patrickvonplaten/Symphonia
Pure Rust multimedia format demuxing, tag reading, and audio decoding library
patrickvonplaten/Versatile-Diffusion
patrickvonplaten/VQ-Diffusion
Official implementation of VQ-Diffusion