yanxinhao

yanxinhao's Stars

isl-org/Open3D
Open3D: A Modern Library for 3D Data Processing
Language:C++11.3k2.3k
ziqihuangg/Awesome-Evaluation-of-Visual-Generation
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
17111
VAST-AI-Research/TripoSR
Language:Python4.4k505
SonSang/dmesh
Official implementation for "DMesh: A Differentiable Representation for General Meshes" (NeurIPS 2024)
Language:Python2376
Pointcept/PointTransformerV3
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
Language:Python73344
hwjiang1510/LEAP
[ICLR 2024] Code for LEAP: Liberate Sparse-view 3D Modeling from Camera Poses
Language:Python1706
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python4k303
salesforce/ULIP
Language:Python42139
guochengqian/PointNeXt
[NeurIPS'22] PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies
Language:Shell770111
qizekun/ShapeLLM
[ECCV 2024] ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Language:Python1289
VITA-Group/LightGaussian
[NeurIPS 2024 Spotlight]"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang
Language:Python55450
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Language:Python2.9k188
mikeqzy/3dgs-avatar-release
3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting
Language:Python31730
LMD0311/PointMamba
[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis
Language:Python33823
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3.2k277
bytedance/DEADiff
[CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"
Language:Python2144
state-spaces/mamba
Mamba SSM architecture
Language:Python12.8k1.1k
CLAY-3D/OpenCLAY
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
80510
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Language:Python3.1k329
instantX-research/InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
Language:Jupyter Notebook1.6k102
justimyhxu/GRM
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
53432
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.4k669
OpenRobotLab/PointLLM
[ECCV 2024 Best Paper Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds
Language:Python55524
lucidrains/perceiver-pytorch
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
Language:Python1.1k134
agiresearch/AIOS
AIOS: LLM Agent Operating System
Language:Python3.3k393
3DTopia/ThemeStation
[SIGGRAPH 2024] ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars
Language:Python2038
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.1k333
donydchen/mvsplat
🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
Language:Python75336
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python25.4k5.3k
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python52.6k5.6k