LouisRouss
AI research Engineer particularly interested in generative AI and more broadly in Computer Vision
IRT Saint ExuperyToulouse
LouisRouss's Stars
AntonOsika/gpt-engineer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
invoke-ai/InvokeAI
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
facefusion/facefusion
Industry leading face manipulation platform
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
mlfoundations/open_clip
An open source implementation of CLIP.
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
KaiyangZhou/deep-person-reid
Torchreid: Deep learning person re-identification in PyTorch.
facebookresearch/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
SysCV/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
lucidrains/gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
mlco2/codecarbon
Track emissions from Compute and recommend ways to reduce their impact on the environment.
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
havakv/pycox
Survival analysis with PyTorch
chuanyangjin/fast-DiT
Fast Diffusion Models with Transformers
MC-E/DragonDiffusion
ICLR 2024 (Spotlight)
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
jaredleekatzman/DeepSurv
DeepSurv is a deep learning approach to survival analysis.
ChenWu98/cycle-diffusion
[ICCV 2023] A latent space for stochastic diffusion models
THUDM/RelayDiffusion
The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]
kfirgoldberg/ConceptLab
Official Implementation for "ConceptLab: Creative Generation using Diffusion Prior Constraints"
lorenzo-stacchio/Stable-Diffusion-Inpaint
Stable diffusion for inpainting
594422814/UDT
BaratiLab/Diffusion-based-Fluid-Super-resolution
PyTorch implementation of the diffusion-based method for CFD data super-resolution proposed in the paper "A Physics-informed Diffusion Model for High-fidelity Flow Field Reconstruction".
UCSB-NLP-Chang/CoPaint
Implementation of paper 'Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models'
deel-ai/oodeel
Simple, compact, and hackable post-hoc deep OOD detection for already trained tensorflow or pytorch image classifiers.