gitlabspy's Stars
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
black-forest-labs/flux
Official inference repo for FLUX.1 models
shadps4-emu/shadPS4
PlayStation 4 emulator for Windows, Linux and macOS written in C++
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
KwaiVGI/LivePortrait
Bring portraits to life!
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
NeoVertex1/SuperPrompt
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
Kwai-Kolors/Kolors
Kolors Team
XLabs-AI/x-flux
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
cloneofsimo/minDiffusion
Self-contained, minimalistic implementation of diffusion models with Pytorch.
Algolzw/daclip-uir
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
NVlabs/DiffiT
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
cloneofsimo/minSDXL
Huggingface-compatible SDXL Unet implementation that is readily hackable
dongzhuoyao/awesome-flow-matching
A summary of related works about flow matching, stochastic interpolants
lucidrains/autoregressive-diffusion-pytorch
Implementation of Autoregressive Diffusion in Pytorch
bytedance/tarsier
Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.
VINHYU/CoSeR
[CVPR 2024] CoSeR: Bridging Image and Language for Cognitive Super-Resolution
instantX-research/CSGO
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
TencentARC/SmartEdit
Official code of SmartEdit [CVPR-2024 Highlight]
lucidrains/rectified-flow-pytorch
Implementation of rectified flow and some of its followup research / improvements in Pytorch
ViTAE-Transformer/QFormer
The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"
jimmycv07/DiffIR2VR-Zero
hp-l33/AiM
Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"
xhinker/sd_embed
Generate long weighted prompt embeddings for Stable Diffusion
lxa9867/ControlVAR
This is the official implementation for ControlVAR.
instantX-research/IP-Adapter-for-SD3
More suitable IP-Adapter for the DiT architecture