ModestYjx

The closer you are to your dream, the more motivated you are！

Peking UniversityBeijing

ModestYjx's Stars

CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook68.9k 563 71710.2k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.6k 449 3155.1k
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python31k 220 5572.8k
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook26.7k 325 4053.4k
google/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
Language:C++25.6k 491 4.9k5k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.9k 158 1.6k2.3k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10.1k 97 676980
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Language:Jupyter Notebook9.2k 89 348860
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.8k 77 579634
facebookresearch/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Language:Python6.7k 97 6921.2k
kohya-ss/sd-scripts
Language:Python5.5k 56 1.2k901
Akegarasu/lora-scripts
SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
Language:Python4.8k 30 521585
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python3.1k 29 200219
facebookresearch/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Language:Python2.6k 28 238398
baaivision/Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
Language:Python2.5k 37 71176
AIGCDesignGroup/ReplaceAnything
2.4k 128 2096
IceClear/StableSR
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
Language:Python2.3k 25 151149
dreamoving/dreamoving-project
Official implementation of DreaMoving
1.8k 130 1097
ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Language:Python1.7k 16 24104
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.7k 22 8885
Sense-X/Co-DETR
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
Language:Python1.1k 10 191124
PKU-YuanGroup/Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Language:Python903 7 6644
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
Language:Python499 8 6431
e4s2022/e4s
(CVPR 2023) E4S: Fine-grained Face Swapping via Regional GAN Inversion
Language:Python374 20 4033
UX-Decoder/LLaVA-Grounding
Language:Python370 20 2614
sail-sg/CLoT
CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation".
Language:Python303 8 2315
VPGTrans/VPGTrans
Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.
Language:Python271 6 1925
facebookresearch/DCI
Densely Captioned Images (DCI) dataset repository.
Language:Python163 4 155
bytedance/FreeSeg
Language:Python132 5 1616
shan-mx/Video-CLIP-Indexer
Language:Python51 2 115

ModestYjx

ModestYjx's Stars

CompVis/stable-diffusion

Stability-AI/stablediffusion

lllyasviel/ControlNet

openai/CLIP

google/mediapipe

haotian-liu/LLaVA

salesforce/LAVIS

modelscope/facechain

facebookresearch/xformers

facebookresearch/SlowFast

kohya-ss/sd-scripts

Akegarasu/lora-scripts

PKU-YuanGroup/Video-LLaVA

facebookresearch/Mask2Former

baaivision/Painter

AIGCDesignGroup/ReplaceAnything

IceClear/StableSR

dreamoving/dreamoving-project

ttengwang/Caption-Anything

baaivision/Emu

Sense-X/Co-DETR

PKU-YuanGroup/Chat-UniVi

shenyunhang/APE

e4s2022/e4s

UX-Decoder/LLaVA-Grounding

sail-sg/CLoT

VPGTrans/VPGTrans

facebookresearch/DCI

bytedance/FreeSeg

shan-mx/Video-CLIP-Indexer