XuweiyiChen

Ph.D at UVA | MS at UMich | BS at UW Working on 3D Computer Vision and Language.

Pinned Repositories

SelfExplain
Language:Python36 4 813
3D-GRAND
Official Implementation of 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
18 5 00
chat-with-nerf
Chat with NeRF enables users to interact with a NeRF model by typing in natural language.
Language:Python273 5 1116
3D-VisTA
Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
Language:Python00
FastApi-PostgreSQL-RESTful-APIs
Language:Python1 3 00
Pedestrian_vehicles
This is a project that can help identify pedestrian and vehicles in urban environment.
Language:Jupyter Notebook2 1 20
Pix2Gif
Language:Python6 0 05
SSC
Semantic Scene Completion
Language:Python0 0 00
UniCtrl
Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
52 4 32
vocabularyApp
A app targets on how to remember vocabulary
Language:JavaScript1 1 00

XuweiyiChen's Repositories

XuweiyiChen/UniCtrl
Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
52 4 32
XuweiyiChen/Pix2Gif
Language:Python6 0 05
XuweiyiChen/AnimateDiff
Official implementation of AnimateDiff.
Language:Python0 0 00
XuweiyiChen/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
Language:Python0 0
XuweiyiChen/blender
Official mirror of Blender
XuweiyiChen/busy_gpu
Language:Shell
XuweiyiChen/CameraCtrl
Language:Python0 0
XuweiyiChen/ControlNet
Let us control diffusion models!
Language:Python0 0
XuweiyiChen/DAT
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
Language:Python0 0
XuweiyiChen/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
XuweiyiChen/deformable-attention
Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"
XuweiyiChen/dream-in-4d
Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]
Language:Python0 0
XuweiyiChen/dust3r
DUSt3R: Geometric 3D Vision Made Easy
Language:Python0 0
XuweiyiChen/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
XuweiyiChen/embodied-generalist
Official code repository for 3D embodied generalist agent LEO
Language:Python0 0
XuweiyiChen/FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Language:Jupyter Notebook0 0
XuweiyiChen/grok-1
Grok open release
Language:Python0 0
XuweiyiChen/GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python0 0
XuweiyiChen/InternVL
[CVPR 2024] InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B
Language:Jupyter Notebook0 0
XuweiyiChen/IsaacLab
Unified framework for robot learning built on NVIDIA Isaac Sim
XuweiyiChen/jtd-remote
Example of Just the Docs as a remote theme
Language:SCSS0 0
XuweiyiChen/LLaVA_Attn_Control
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python0 0
XuweiyiChen/open_flamingo
An open-source framework for training large multimodal models.
XuweiyiChen/StoryDiffusion
Create Magic Story!
XuweiyiChen/transformer-debugger
Language:Python0 0
XuweiyiChen/transformers_attn_control
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
XuweiyiChen/TripoSR
Language:Python0 0
XuweiyiChen/trl
Train transformer language models with reinforcement learning.
XuweiyiChen/VideoCrafter
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Language:Python0 0
XuweiyiChen/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
Language:Python0 0