Pinned Repositories
AdvancedAutomaticSpeechRecognition
audio-gen-dreambooth
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
hf_image_uploader
metaseq
Repo for external large-scale work
notebooks
Some notebooks for NLP
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Wav2Vec2_ParlanceCTCDecode
Wav2Vec2_PyCTCDecode
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
whisper-tools
patrickvonplaten's Repositories
patrickvonplaten/notebooks
Some notebooks for NLP
patrickvonplaten/audio-gen-dreambooth
patrickvonplaten/hf_image_uploader
patrickvonplaten/whisper-tools
patrickvonplaten/scientific_images
patrickvonplaten/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
patrickvonplaten/generative-models
Generative Models by Stability AI
patrickvonplaten/InvokeAI
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
patrickvonplaten/ml-engineering
Machine Learning Engineering Open Book
patrickvonplaten/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
patrickvonplaten/doc-builder
The package used to build the documentation of our Hugging Face repos
patrickvonplaten/instruct-pix2pix
patrickvonplaten/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
patrickvonplaten/neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
patrickvonplaten/stable-diffusion-1
Latent Text-to-Image Diffusion
patrickvonplaten/whisper
patrickvonplaten/whisper-long-form
patrickvonplaten/candle
Minimalist ML framework for Rust
patrickvonplaten/compel
A prompting enhancement library for transformers-type text embedding systems
patrickvonplaten/fairscale
PyTorch extensions for high performance and large scale training.
patrickvonplaten/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
patrickvonplaten/fetch-configs
patrickvonplaten/flash-attention
Fast and memory-efficient exact attention
patrickvonplaten/indexing
patrickvonplaten/invisible-watermark
python library for invisible image watermark (blind image watermark)
patrickvonplaten/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
patrickvonplaten/prodigy
The Prodigy optimizer and its variants for training neural networks.
patrickvonplaten/stable-diffusion
patrickvonplaten/UniPC
patrickvonplaten/wavem