liuheng92's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
lllyasviel/Fooocus
Focus on prompting and generating
chenfei-wu/TaskMatrix
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
yoheinakajima/babyagi
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
apple/ml-ferret
XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
HumanAIGC/OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
luban-agi/Awesome-AIGC-Tutorials
Curated tutorials and resources for Large Language Models, AI Painting, and more.
apple/ml-mgie
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
AIGCDesignGroup/ReplaceAnything
microsoft/GLIP
Grounded Language-Image Pre-training
pymatting/pymatting
A Python library for alpha matting
rlawjdghek/StableVITON
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
TheShadow29/awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
OFA-Sys/ONE-PEACE
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
lichengunc/refer
Referring Expression Datasets API
aimagelab/multimodal-garment-designer
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
Cheems-Seminar/grounded-segment-any-parts
Grounded Segment Anything: From Objects to Parts
UX-Decoder/LLaVA-Grounding
michaelowenliu/awesome-image-matting
A collection of AWESOME things about image matting.
MaverickRen/PixelLM
PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding. PixelLM is accepted by CVPR 2024.
BryanPlummer/flickr30k_entities
Flickr30K Entities Dataset
bytedance/coconut_cvpr2024
YigitEkin/CLIPAway
[NeurIPS 2024] Official Implementation of CLIPAway