yanxinhao's Stars
isl-org/Open3D
Open3D: A Modern Library for 3D Data Processing
ziqihuangg/Awesome-Evaluation-of-Visual-Generation
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
VAST-AI-Research/TripoSR
SonSang/dmesh
Official implementation for "DMesh: A Differentiable Representation for General Meshes" (NeurIPS 2024)
Pointcept/PointTransformerV3
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
hwjiang1510/LEAP
[ICLR 2024] Code for LEAP: Liberate Sparse-view 3D Modeling from Camera Poses
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
salesforce/ULIP
guochengqian/PointNeXt
[NeurIPS'22] PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies
qizekun/ShapeLLM
[ECCV 2024] ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
VITA-Group/LightGaussian
[NeurIPS 2024 Spotlight]"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
mikeqzy/3dgs-avatar-release
3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting
LMD0311/PointMamba
[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
bytedance/DEADiff
[CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"
state-spaces/mamba
Mamba SSM architecture
CLAY-3D/OpenCLAY
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
instantX-research/InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
justimyhxu/GRM
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
OpenRobotLab/PointLLM
[ECCV 2024 Best Paper Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds
lucidrains/perceiver-pytorch
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
agiresearch/AIOS
AIOS: LLM Agent Operating System
3DTopia/ThemeStation
[SIGGRAPH 2024] ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
donydchen/mvsplat
🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.