krahets's Stars
black-forest-labs/flux
Official inference repo for FLUX.1 models
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
guoyww/AnimateDiff
Official implementation of AnimateDiff.
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
mli/autocut
用文本编辑器剪视频
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
kohya-ss/sd-scripts
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Kwai-Kolors/Kolors
Kolors Team
richzhang/PerceptualSimilarity
LPIPS metric. pip install lpips
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
Nerogar/OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
donydchen/mvsplat
🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
bytedance/ImageDream
The code releasing for https://image-dream.github.io/
henry123-boy/SpaTracker
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
apple/ARKitScenes
This repo accompanies the research paper, ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data and contains the data, scripts to visualize and process assets, and training code described in our paper.
zju3dv/EasyVolcap
[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research
ytrock/THuman2.0-Dataset
zhenzhiwang/HumanVid
[NeurIPS D&B Track 2024] Official implementation of HumanVid
SangHunHan92/2K2K
Official Code and Dataset for "High-fidelity 3D Human Digitization from Single 2K Resolution Images" (CVPR 2023 Highlight)
GAP-LAB-CUHK-SZ/MVHumanNet
scannetpp/scannetpp
[ICCV 2023 Oral] ScanNet++: A High-Fidelity Dataset of 3D Indoor Scenes
BayesRays/BayesRays
Official Code for Bayes' Rays Paper