sxyu's Stars
lllyasviel/Fooocus
Focus on prompting and generating
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Bing-su/adetailer
Auto detecting, masking and inpainting with detection model.
TencentARC/T2I-Adapter
T2I-Adapter
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
zju3dv/EasyVolcap
[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research
christophschuhmann/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
nv-tlabs/nglod
Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes (CVPR 2021 Oral)
horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
bytedance/ImageDream
The code releasing for https://image-dream.github.io/
baegwangbin/DSINE
[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation
zju3dv/LoG
Level of Gaussians
princeton-vl/DPVO
Deep Patch Visual Odometry/SLAM
jonbarron/camp_zipnerf
dakenf/diffusers.js
diffusers implementation for node.js and browser
sherwinbahmani/4dfy
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
google-research/sparf
This is the official code release for SPARF: Neural Radiance Fields from Sparse and Noisy Poses [CVPR 2023-Highlight]
ashawkey/kiuikit
A toolkit for 3D computer vision tasks.
Shopify/screenshot-glb
A command line utility for taking screenshots of glTF 2.0 Binary 3D model files
sayakpaul/cmmd-pytorch
PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.
YooPaul/dreamsparse
Geometry-aware Novel View Synthesis with Pre-trained 2D Prior
praveen-palanisamy/webgym
WebGym: Web-browser-based tasks for RL Agents