LouisRouss

AI research Engineer particularly interested in generative AI and more broadly in Computer Vision

IRT Saint ExuperyToulouse

LouisRouss's Stars

FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python4.3k317
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
Language:Python1.9k65
nupurkmr9/vision-aided-gan
Ensembling Off-the-shelf Models for GAN Training (CVPR 2022 Oral)
Language:Python38426
NVlabs/edm2
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
Language:Python53824
GaParmar/img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Language:Python1.7k190
upscayl/upscayl
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
Language:TypeScript31.4k1.5k
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python58.1k6.2k
TencentQQGYLab/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Language:Python1.1k57
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
2k27
charlax/professional-programming
A collection of learning resources for curious software engineers
Language:Python46.8k3.7k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python56.6k5.8k
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.6k243
state-spaces/mamba
Mamba SSM architecture
Language:Python13.3k1.1k
rlabbe/filterpy
Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoothers, and more. Has companion book 'Kalman and Bayesian Filters in Python'.
Language:Python3.4k628
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
Language:Python8.8k1.2k
lixinustc/Awesome-diffusion-model-for-image-processing
one summary of diffusion-based image processing, including restoration, enhancement, coding, quality assessment
64250
cyclomon/UNSB
Official Repository of "Unpaired Image-to-Image Translation via Neural Schrödinger Bridge" (ICLR 2024)
Language:Python1719
ybbbbt/DreamSpace
DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation
1071
aigc-apps/sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
Language:Python5k399
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Language:Python4.4k231
cheind/py-motmetrics
:bar_chart: Benchmark multiple object trackers (MOT) in Python
Language:Python1.4k258
xingyizhou/CenterTrack
Simultaneous object detection and tracking using center points.
Language:Python2.4k527
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
1.3k65
real-stanford/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
Language:Python1.7k329
NirAharon/BoT-SORT
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
Language:Jupyter Notebook945426
lllyasviel/Fooocus
Focus on prompting and generating
Language:Python41.7k5.9k
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
Language:Python1.2k39
594422814/UDT
Language:MATLAB15824
THUDM/RelayDiffusion
The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]
Language:Python27319
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook11k1.1k

LouisRouss

LouisRouss's Stars

FoundationVision/VAR

facebookresearch/schedule_free

nupurkmr9/vision-aided-gan

NVlabs/edm2

GaParmar/img2img-turbo

upscayl/upscayl

comfyanonymous/ComfyUI

TencentQQGYLab/ELLA

lllyasviel/LayerDiffuse

charlax/professional-programming

labmlai/annotated_deep_learning_paper_implementations

Luodian/Otter

state-spaces/mamba

rlabbe/filterpy

voicepaw/so-vits-svc-fork

lixinustc/Awesome-diffusion-model-for-image-processing

cyclomon/UNSB

ybbbbt/DreamSpace

aigc-apps/sd-webui-EasyPhoto

luosiallen/latent-consistency-model

cheind/py-motmetrics

xingyizhou/CenterTrack

wangkai930418/awesome-diffusion-categorized

real-stanford/diffusion_policy

NirAharon/BoT-SORT

lllyasviel/Fooocus

gnobitab/InstaFlow

594422814/UDT

THUDM/RelayDiffusion

facebookresearch/seamless_communication