XuweiyiChen
1st year Ph.D student at UVA | MS at UMich | BS at UW Working on 3D Computer Vision and Language.
Pinned Repositories
SelfExplain
3D-GRAND
Official Implementation of 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
chat-with-nerf
Chat with NeRF enables users to interact with a NeRF model by typing in natural language.
moh
Official Repository of Multi-Object Hallucination in Vision-Language Models (NeurIPS 2024)
3D-Diffusion-Policy
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
FastApi-PostgreSQL-RESTful-APIs
Pedestrian_vehicles
This is a project that can help identify pedestrian and vehicles in urban environment.
Pix2Gif
UniCtrl
Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
vocabularyApp
A app targets on how to remember vocabulary
XuweiyiChen's Repositories
XuweiyiChen/UniCtrl
Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
XuweiyiChen/3D-Diffusion-Policy
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
XuweiyiChen/awesome-multimodal-self-correction
XuweiyiChen/blender
Official mirror of Blender
XuweiyiChen/BlenderGPT
Use commands in English to control Blender with OpenAI's GPT-4
XuweiyiChen/busy_gpu
XuweiyiChen/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
XuweiyiChen/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
XuweiyiChen/feature-3dgs
[CVPR 2024 Highlight] Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
XuweiyiChen/IsaacLab
Unified framework for robot learning built on NVIDIA Isaac Sim
XuweiyiChen/Long-CLIP
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
XuweiyiChen/manim
Animation engine for explanatory math videos
XuweiyiChen/mast3r
Grounding Image Matching in 3D with MASt3R
XuweiyiChen/mdlm
Simplified Masked Diffusion Language Model
XuweiyiChen/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
XuweiyiChen/octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
XuweiyiChen/octo-template
XuweiyiChen/OpenLRM
An open-source impl. of Large Reconstruction Models
XuweiyiChen/Pointcept
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
XuweiyiChen/probe3d
[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models
XuweiyiChen/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search
XuweiyiChen/scannetpp
XuweiyiChen/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
XuweiyiChen/shape-of-motion
XuweiyiChen/SOLO
Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"
XuweiyiChen/spann3r
3D Reconstruction with Spatial Memory
XuweiyiChen/ssl_eval_protocols
XuweiyiChen/StoryDiffusion
Create Magic Story!
XuweiyiChen/xuweiyichen.github.oi
XuweiyiChen/zeno-hub-uva
AI Evaluation Platform