LouisRouss
AI research Engineer particularly interested in generative AI and more broadly in Computer Vision
IRT Saint ExuperyToulouse
LouisRouss's Stars
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers
lucidrains/rectified-flow-pytorch
Implementation of rectified flow and some of its followup research / improvements in Pytorch
Lucas-rbnt/DRIM
[MICCAI 2024] DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Data
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
gnobitab/RectifiedFlow
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
black-forest-labs/flux
Official inference repo for FLUX.1 models
Zian-Xu/Swin-MAE
Pytorch implementation of Swin MAE https://arxiv.org/abs/2212.13805
KwaiVGI/LivePortrait
Bring portraits to life!
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
lucidrains/multimodal-dit-pytorch
Implementation of a multimodal diffusion transformer in Pytorch
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
mcmonkeyprojects/SwarmUI
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
fal-ai/aura-sr
AuraSR: GAN-based Super-Resolution for real-world
deel-ai/xplique
👋 Xplique is a Neural Networks Explainability Toolbox
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
SuReLI/RRLS
Robust Reinforcement Learning Suite
mosaicml/diffusion
lllyasviel/Omost
Your image is almost there!
OSU-NLP-Group/MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
chaiNNer-org/chaiNNer
A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and powerful programmatic image processing application.
RotsteinNoam/Paint-by-Inpaint
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis