HeliosZhao's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
CompVis/stable-diffusion
A latent text-to-image diffusion model
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
lllyasviel/ControlNet
Let us control diffusion models!
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
openai/consistency_models
Official repo for consistency models.
lllyasviel/ControlNet-v1-1-nightly
Nightly release of ControlNet 1.1
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
showlab/Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
TencentARC/T2I-Adapter
T2I-Adapter
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
lucidrains/make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
Anything-of-anything/Anything-3D
Segment-Anything + 3D. Let's lift anything to 3D.
NVlabs/ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
JonasSchult/Mask3D
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
showlab/VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
nihaomiao/CVPR23_LFDM
The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"
sihyun-yu/PVDM
Official PyTorch implementation of Video Probabilistic Diffusion Models in Projected Latent Space (CVPR 2023).
patrickvonplaten/controlnet_aux
CVMI-Lab/PLA
(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
real-stanford/semantic-abstraction
[CoRL 2022] This repository contains code for generating relevancies, training, and evaluating Semantic Abstraction.
ICTMCG/FakeSV
Official repository for "FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms", AAAI 2023.
showlab/ShowAnything
AmingWu/Single-DGOD
saltoricristiano/cosmix-uda
Official PyTorch implementation of the ECCV 2022 paper "CoSMix: Compositional Semantic Mix for Domain Adaptation in 3D LiDAR Segmentation"
LuigiRiz/NOPS
Official implementation of the CVPR 2023 paper "Novel Class Discovery for 3D Point Cloud Semantic Segmentation"
HeliosZhao/SHADE-VisualDG
Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization
VITA-Group/MLSP
[ECCV 2022] "Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction" by Hanxue Liang, Hehe Fan, Zhiwen Fan, Yi Wang, Tianlong Chen, Yu Cheng, Zhangyang Wang
HeliosZhao/SFOCDA
Source-Free Open Compound Domain Adaptation in Semantic Segmentation. IEEE TCSVT