chenqi1126

Sun Yat-sen University (SYSU)

chenqi1126's Stars

comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python61.6k 432 4.1k6.6k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.4k 352 1.8k4.6k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.7k 230 2733.2k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python26.9k 211 4.4k5.5k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.9k 189 5242.3k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.9k 158 1.6k2.3k
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
14k 116 501.4k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.3k 258 128840
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.8k 101 370878
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.7k 81 5051k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10.1k 97 676980
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.5k 61 399350
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python3.9k 29 462236
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
3.7k 141 28209
lllyasviel/Paints-UNDO
Understand Human Behavior to Align True Needs
Language:Python3.6k 20 64318
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
2.7k 125 10227
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
2k 114 3628
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Language:Python1.6k 21 120118
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
Language:Python1.4k 50 3674
DirtyHarryLYL/LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
848 53 1436
SunzeY/AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Language:Jupyter Notebook741 13 5446
Bujiazi/MotionClone
Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Language:Python370 18 1528
JiuTian-VL/JiuTian-LION
[CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Language:Jupyter Notebook125 13 66
kevin-ssy/CLIP_as_RNN
Official Implementation for CVPR 2024 paper: CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
101 16 113
PangzeCheung/SingDiffusion
[CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
Language:Python65 4 13
vpulab/ovam
Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024
Language:Python58 3 36
CodeGoat24/DreamText
Official implementation of High Fidelity Scene Text Synthesis.
Language:Python36 6 50
LinlyAC/VDT-AGPReID
View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network (CVPR'24)
Language:Python35 2 41
ccccwb/Multimodal-Detection-and-Tracking-UAV
A Multimodal Detection and Tracking System based on DJI Payload SDK and Mobile SDK.
Language:C11 1 52
WondrousWisdomcard/DiffuseQR
A Progressive Optimization Method for Text-Guided Aesthetic QR Code Generation
2 1 00

chenqi1126

chenqi1126's Stars

comfyanonymous/ComfyUI

lm-sys/FastChat

meta-llama/llama3

huggingface/diffusers

hpcaitech/Open-Sora

haotian-liu/LLaVA

dair-ai/ml-visuals

BradyFU/Awesome-Multimodal-Large-Language-Models

guoyww/AnimateDiff

mlfoundations/open_clip

salesforce/LAVIS

tencent-ailab/IP-Adapter

QwenLM/Qwen2-VL

showlab/Awesome-Video-Diffusion

lllyasviel/Paints-UNDO

jingyi0000/VLM_survey

lllyasviel/LayerDiffuse

aigc-apps/EasyAnimate

TencentARC/MotionCtrl

DirtyHarryLYL/LLM-in-Vision

SunzeY/AlphaCLIP

Bujiazi/MotionClone

JiuTian-VL/JiuTian-LION

kevin-ssy/CLIP_as_RNN

PangzeCheung/SingDiffusion

vpulab/ovam

CodeGoat24/DreamText

LinlyAC/VDT-AGPReID

ccccwb/Multimodal-Detection-and-Tracking-UAV

WondrousWisdomcard/DiffuseQR