LouisRouss
AI research Engineer particularly interested in generative AI and more broadly in Computer Vision
IRT Saint ExuperyToulouse
LouisRouss's Stars
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
nupurkmr9/vision-aided-gan
Ensembling Off-the-shelf Models for GAN Training (CVPR 2022 Oral)
NVlabs/edm2
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
GaParmar/img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
upscayl/upscayl
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
TencentQQGYLab/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
charlax/professional-programming
A collection of learning resources for curious software engineers
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
state-spaces/mamba
Mamba SSM architecture
rlabbe/filterpy
Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoothers, and more. Has companion book 'Kalman and Bayesian Filters in Python'.
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
lixinustc/Awesome-diffusion-model-for-image-processing
one summary of diffusion-based image processing, including restoration, enhancement, coding, quality assessment
cyclomon/UNSB
Official Repository of "Unpaired Image-to-Image Translation via Neural Schrödinger Bridge" (ICLR 2024)
ybbbbt/DreamSpace
DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation
aigc-apps/sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
cheind/py-motmetrics
:bar_chart: Benchmark multiple object trackers (MOT) in Python
xingyizhou/CenterTrack
Simultaneous object detection and tracking using center points.
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
real-stanford/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
NirAharon/BoT-SORT
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
lllyasviel/Fooocus
Focus on prompting and generating
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
594422814/UDT
THUDM/RelayDiffusion
The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation