Pinned Repositories
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
finetrainers
Scalable and memory-optimized training of diffusion models
instruction-tuned-sd
Code for instruction-tuning Stable Diffusion.
Adventures-in-TensorFlow-Lite
This repository contains notebooks that show the usage of TensorFlow Lite for quantizing deep neural networks.
diffusers-torchao
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
ml-deployment-k8s-fastapi
This project shows how to serve an ONNX-optimized image classification model as a web service with FastAPI, Docker, and Kubernetes.
nanoDiT
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
probing-vits
Probing the representations of Vision Transformers.
TF-2.0-Hacks
Contains my explorations of TensorFlow 2.x
tt-scale-flux
Inference-time scaling of diffusion-based image and video generation models.
sayakpaul's Repositories
sayakpaul/diffusers-torchao
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
sayakpaul/tt-scale-flux
Inference-time scaling of diffusion-based image and video generation models.
sayakpaul/caption-upsampling
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
sayakpaul/cmmd-pytorch
PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.
sayakpaul/maxim-tf
Implementation of MAXIM in TensorFlow.
sayakpaul/nanoDiT
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
sayakpaul/flux-image-editing
Scripts to teach Flux the task of image editing from language with the Flux Control framework.
sayakpaul/single-video-curation-svd
Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.
sayakpaul/stable-diffusion-keras-ft
Fine-tuning Stable Diffusion using Keras.
sayakpaul/hf-codegen
A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.
sayakpaul/simple-image-recaptioning
Recaption large (Web)Datasets with vllm and save the artifacts.
sayakpaul/keras-xla-benchmarks
Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.
sayakpaul/portfolio
Personal site of Sayak Paul. Deployed here 👉
sayakpaul/q8-ltx-video
This repository shows how to use Q8 kernels with `diffusers` to optimize inference of LTX-Video on ADA GPUs.
sayakpaul/instructpix2pix-sdxl
Training InstructPi2Pix with SDXL.
sayakpaul/cait-tf
Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.
sayakpaul/you-dont-know-tensorflow
Contains materials for my talk "You don't know TensorFlow".
sayakpaul/SSD-1B
SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.
sayakpaul/auto-diffusers-docs
Use an LLM to generate reasonable code snippets in a hardware-aware manner for Diffusers.
sayakpaul/model-card-generation-hf
Shows how to bulk generate model cards for models on 🤗 Hub.
sayakpaul/quarto-blog
sayakpaul/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
sayakpaul/sayakpaul
Sayak Paul Portfolio
sayakpaul/ao
PyTorch native quantization and sparsity for training and inference
sayakpaul/blossom
Repository to test automated PyPI publishing through GitHub Actions.
sayakpaul/notebooks
Notebooks using the Hugging Face libraries 🤗
sayakpaul/watermark-detection
Model for watermark classification implemented with PyTorch
sayakpaul/diffusion-to-gguf
sayakpaul/img-ds-creation-scripts
Scripts pair-coded with LLLMs to format and prepare large-scale image datasets.
sayakpaul/poc-style-bot