liewjunhao

liewjunhao's Stars

Significant-Gravitas/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
Language:Python148k 1.6k 1.9k32k
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Language:C++71.7k 647 2k7.8k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook48.5k 313 6815.7k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.7k 449 3155.1k
microsoft/TaskMatrix
Language:Python34.3k 316 3393.4k
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Language:Python23.9k 381 1822k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.6k 115 3971.4k
facebookresearch/AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
Language:Python12.1k 93 1761k
facebookresearch/DensePose
A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body
Language:Jupyter Notebook7k 250 2501.3k
openai/consistency_models
Official repo for consistency models.
Language:Python6.2k 59 54423
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Language:Python4.1k 67 73355
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.8k 48 176290
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
Language:Python3.4k 40 57195
OpenGVLab/InternImage
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Language:Python2.6k 35 268241
isl-org/ZoeDepth
Metric depth estimation from a single image
Language:Jupyter Notebook2.4k 35 119222
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Language:Python2.2k 19 58139
mayuelala/FollowYourPose
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
Language:Python1.3k 25 5290
s9roll7/ebsynth_utility
AUTOMATIC1111 UI extension for creating videos using img2img and ebsynth.
Language:Python1.3k 10 140128
lupantech/chameleon-llm
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
Language:Jupyter Notebook1.1k 19 1190
ChenyangLEI/All-In-One-Deflicker
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
Language:Python719 23 3442
kakaobrain/karlo
Language:Python694 12 1240
LambdaLabsML/lambda-diffusers
Language:Jupyter Notebook567 8 1590
ma-xu/Context-Cluster
[ICLR 2023 Oral] Image as Set of Points
Language:Python549 10 3740
Picsart-AI-Research/PAIR-Diffusion
[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
Language:Python508 18 1522
ziqihuangg/ReVersion
[SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images
Language:Python496 20 919
mkocabas/PARE
Code for ICCV2021 paper PARE: Part Attention Regressor for 3D Human Body Estimation
Language:Python387 15 4273
zju3dv/Wis3D
A web-based 3D visualization tool for 3D computer vision.
Language:TypeScript289 16 321
TZW1998/Taming-Stable-Diffusion-with-Human-Ranking-Feedback
This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et al. https://arxiv.org/abs/2303.03751
Language:Jupyter Notebook196 7 221
ZZWENG/Diffusion_HPC
Official Repository for "Diffusion HPC: Generate Synthetic Data for Human Mesh Recovery in Challenging Domains" (3DV 2024 Spotlight)
Language:Python42 3 32
RaymondWang987/FMNet
The official project of ACM MM 2022 paper "Less is More: Consistent Video Depth Estimation with Masked Frames Modeling".
Language:Python35 9 32