liewjunhao's Stars
Significant-Gravitas/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
microsoft/TaskMatrix
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
facebookresearch/AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
facebookresearch/DensePose
A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body
openai/consistency_models
Official repo for consistency models.
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
OpenGVLab/InternImage
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
isl-org/ZoeDepth
Metric depth estimation from a single image
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
mayuelala/FollowYourPose
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
s9roll7/ebsynth_utility
AUTOMATIC1111 UI extension for creating videos using img2img and ebsynth.
lupantech/chameleon-llm
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
ChenyangLEI/All-In-One-Deflicker
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
kakaobrain/karlo
LambdaLabsML/lambda-diffusers
ma-xu/Context-Cluster
[ICLR 2023 Oral] Image as Set of Points
Picsart-AI-Research/PAIR-Diffusion
[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
ziqihuangg/ReVersion
[SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images
mkocabas/PARE
Code for ICCV2021 paper PARE: Part Attention Regressor for 3D Human Body Estimation
zju3dv/Wis3D
A web-based 3D visualization tool for 3D computer vision.
TZW1998/Taming-Stable-Diffusion-with-Human-Ranking-Feedback
This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et al. https://arxiv.org/abs/2303.03751
ZZWENG/Diffusion_HPC
Official Repository for "Diffusion HPC: Generate Synthetic Data for Human Mesh Recovery in Challenging Domains" (3DV 2024 Spotlight)
RaymondWang987/FMNet
The official project of ACM MM 2022 paper "Less is More: Consistent Video Depth Estimation with Masked Frames Modeling".