Pinned Repositories
AdvancedAutomaticSpeechRecognition
audio-gen-dreambooth
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
hf_image_uploader
metaseq
Repo for external large-scale work
notebooks
Some notebooks for NLP
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Wav2Vec2_ParlanceCTCDecode
Wav2Vec2_PyCTCDecode
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
whisper-tools
patrickvonplaten's Repositories
patrickvonplaten/notebooks
Some notebooks for NLP
patrickvonplaten/audio-gen-dreambooth
patrickvonplaten/hf_image_uploader
patrickvonplaten/scientific_images
patrickvonplaten/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
patrickvonplaten/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
patrickvonplaten/generative-models
Generative Models by Stability AI
patrickvonplaten/InvokeAI
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
patrickvonplaten/ml-engineering
Machine Learning Engineering Open Book
patrickvonplaten/stable-diffusion
patrickvonplaten/stable-diffusion-1
Latent Text-to-Image Diffusion
patrickvonplaten/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
patrickvonplaten/doc-builder
The package used to build the documentation of our Hugging Face repos
patrickvonplaten/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
patrickvonplaten/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
patrickvonplaten/whisper
patrickvonplaten/whisper-long-form
patrickvonplaten/candle
Minimalist ML framework for Rust
patrickvonplaten/compel
A prompting enhancement library for transformers-type text embedding systems
patrickvonplaten/fairscale
PyTorch extensions for high performance and large scale training.
patrickvonplaten/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
patrickvonplaten/fetch-configs
patrickvonplaten/flash-attention
Fast and memory-efficient exact attention
patrickvonplaten/indexing
patrickvonplaten/invisible-watermark
python library for invisible image watermark (blind image watermark)
patrickvonplaten/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
patrickvonplaten/prodigy
The Prodigy optimizer and its variants for training neural networks.
patrickvonplaten/UniPC
patrickvonplaten/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
patrickvonplaten/wavem