gitlabspy

gitlabspy's Stars

hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
Language:Python48.8k 308 6937.2k
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python21.1k 175 2041.5k
shadps4-emu/shadPS4
PlayStation 4 emulator for Windows, Linux and macOS written in C++
Language:C++21k 172 9121.3k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
14.5k 270 135937
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python14.5k 126 4181.6k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python7.4k 57 845567
NeoVertex1/SuperPrompt
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
6k 82 21563
Kwai-Kolors/Kolors
Kolors Team
Language:Python4.3k 44 157322
XLabs-AI/x-flux
Language:Python2k 33 133141
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.6k 21 7874
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.5k 26 8279
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Language:Python1.4k 18 8178
cloneofsimo/minDiffusion
Self-contained, minimalistic implementation of diffusion models with Pytorch.
Language:Python937 10 7127
Algolzw/daclip-uir
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
Language:Python738 9 9939
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
Language:Python553 5 3525
NVlabs/DiffiT
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
488 54 518
cloneofsimo/minSDXL
Huggingface-compatible SDXL Unet implementation that is readily hackable
Language:Jupyter Notebook415 5 433
dongzhuoyao/awesome-flow-matching
A summary of related works about flow matching, stochastic interpolants
415 17 215
lucidrains/autoregressive-diffusion-pytorch
Implementation of Autoregressive Diffusion in Pytorch
Language:Python365 12 610
bytedance/tarsier
Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.
Language:Python338 8 2319
VINHYU/CoSeR
[CVPR 2024] CoSeR: Bridging Image and Language for Cognitive Super-Resolution
337 28 1210
instantX-research/CSGO
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
Language:Jupyter Notebook314 16 2110
TencentARC/SmartEdit
Official code of SmartEdit [CVPR-2024 Highlight]
Language:Python310 13 4711
lucidrains/rectified-flow-pytorch
Implementation of rectified flow and some of its followup research / improvements in Pytorch
Language:Python266 9 1013
ViTAE-Transformer/QFormer
The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"
Language:Python201 2 310
jimmycv07/DiffIR2VR-Zero
Language:Python153 3 1214
hp-l33/AiM
Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"
Language:Python127 2 107
xhinker/sd_embed
Generate long weighted prompt embeddings for Stable Diffusion
Language:Python110 2 1615
lxa9867/ControlVAR
This is the official implementation for ControlVAR.
Language:Python100 3 223
instantX-research/IP-Adapter-for-SD3
More suitable IP-Adapter for the DiT architecture
29 8 11