v-prgmr's Stars
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Lightning-AI/LitServe
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
autonomousvision/mip-splatting
[CVPR'24 Best Student Paper] Mip-Splatting: Alias-free 3D Gaussian Splatting
facebookresearch/MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
dcharatan/pixelsplat
[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacchi, and Vincent Sitzmann
minghanqin/LangSplat
Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
modelscope/richdreamer
Live Demo:https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
princeton-computational-imaging/NSF
Official code repository for the paper: "Neural Spline Fields for Burst Image Fusion and Layer Separation"
magic-research/InstaDrag
Experiencing lightning fast (~1s) and accurate drag-based image editing
shivangi-aneja/FaceTalk
[CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
JeremyCJM/DiffSHEG
[CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation
Understanding-Visual-Datasets/VisDiff
Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)
LLM360/amber-data-prep
Data preparation code for Amber 7B LLM
Carmenw1203/DanceCamera3D-Official
DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance. [CVPR 2024] Official PyTorch implementation
akx/ggify
Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp
ZhanxyR/SHERT
[CVPR'24 Oral] Official Pytorch implementation for Semantic Human Mesh Reconstruction with Textures.
cnhaox/NeRF-HuGS
Reference implementation of CVPR 2024 (Oral) paper "NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation"
zhuyr97/Reflection_RemoVal_CVPR2024
clova-tool/CLOVA-tool
r4dl/LAENeRF
Original reference implementation of "LAENeRF: Local Appearance Editing for Neural Radiance Fields"
facebookresearch/SIEVE
SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)
VJWQ/AV-CONV
Official code release for "The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective"
KVGandikota/Text-guidedSR
Code for the paper Text-guided Explorable Image Super-resolution
valleballe/depthfusion
iamlemec/llama.cpp
Port of Facebook's LLaMA model in C/C++
yilong2001/berts.cpp
基于GGML 的 bert 模型家族推理服务,支持分类模型、seq2seq 文本生成模型等等