liuheng92

liuheng92's Stars

langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook95.5k 692 7.9k15.5k
lllyasviel/Fooocus
Focus on prompting and generating
Language:Python41.7k 325 1.5k5.9k
chenfei-wu/TaskMatrix
Language:Python34.6k 301 3553.3k
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Language:Python23.7k 383 1812k
yoheinakajima/babyagi
Language:Python20.5k 302 1512.7k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.5k 159 1.6k2.3k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.3k 114 3901.4k
apple/ml-ferret
Language:Python8.5k 159 0500
XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Language:Jupyter Notebook7.6k 92 148795
open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
Language:Jupyter Notebook7k 97 7071.1k
HumanAIGC/OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
5.7k 214 57431
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.1k 49 453385
luban-agi/Awesome-AIGC-Tutorials
Curated tutorials and resources for Large Language Models, AI Painting, and more.
3.9k 29 2261
apple/ml-mgie
Language:Python3.9k 62 0252
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
Language:Python3.3k 39 57193
AIGCDesignGroup/ReplaceAnything
2.4k 127 2096
microsoft/GLIP
Grounded Language-Image Pre-training
Language:Python2.2k 46 171194
pymatting/pymatting
A Python library for alpha matting
Language:Python1.8k 41 66221
rlawjdghek/StableVITON
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
Language:Python1k 50 0165
TheShadow29/awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
1k 28 598
OFA-Sys/ONE-PEACE
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Language:Python975 14 5763
lichengunc/refer
Referring Expression Datasets API
Language:Jupyter Notebook467 7 2278
aimagelab/multimodal-garment-designer
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
Language:Python413 28 3048
Cheems-Seminar/grounded-segment-any-parts
Grounded Segment Anything: From Objects to Parts
Language:Jupyter Notebook390 4 1018
UX-Decoder/LLaVA-Grounding
Language:Python355 20 2614
michaelowenliu/awesome-image-matting
A collection of AWESOME things about image matting.
288 25 626
MaverickRen/PixelLM
PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding. PixelLM is accepted by CVPR 2024.
Language:Python184 4 265
BryanPlummer/flickr30k_entities
Flickr30K Entities Dataset
Language:MATLAB167 3 126
bytedance/coconut_cvpr2024
Language:Jupyter Notebook148 4 296
YigitEkin/CLIPAway
[NeurIPS 2024] Official Implementation of CLIPAway
Language:Python53 4 32