Pinned Repositories
SelfExplain
3D-GRAND
Official Implementation of 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
chat-with-nerf
Chat with NeRF enables users to interact with a NeRF model by typing in natural language.
3D-VisTA
Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
FastApi-PostgreSQL-RESTful-APIs
Pedestrian_vehicles
This is a project that can help identify pedestrian and vehicles in urban environment.
Pix2Gif
SSC
Semantic Scene Completion
UniCtrl
Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
vocabularyApp
A app targets on how to remember vocabulary
XuweiyiChen's Repositories
XuweiyiChen/UniCtrl
Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
XuweiyiChen/Pix2Gif
XuweiyiChen/AnimateDiff
Official implementation of AnimateDiff.
XuweiyiChen/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
XuweiyiChen/blender
Official mirror of Blender
XuweiyiChen/busy_gpu
XuweiyiChen/CameraCtrl
XuweiyiChen/ControlNet
Let us control diffusion models!
XuweiyiChen/DAT
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
XuweiyiChen/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
XuweiyiChen/deformable-attention
Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"
XuweiyiChen/dream-in-4d
Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]
XuweiyiChen/dust3r
DUSt3R: Geometric 3D Vision Made Easy
XuweiyiChen/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
XuweiyiChen/embodied-generalist
Official code repository for 3D embodied generalist agent LEO
XuweiyiChen/FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
XuweiyiChen/grok-1
Grok open release
XuweiyiChen/GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
XuweiyiChen/InternVL
[CVPR 2024] InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B
XuweiyiChen/IsaacLab
Unified framework for robot learning built on NVIDIA Isaac Sim
XuweiyiChen/jtd-remote
Example of Just the Docs as a remote theme
XuweiyiChen/LLaVA_Attn_Control
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
XuweiyiChen/open_flamingo
An open-source framework for training large multimodal models.
XuweiyiChen/StoryDiffusion
Create Magic Story!
XuweiyiChen/transformer-debugger
XuweiyiChen/transformers_attn_control
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
XuweiyiChen/TripoSR
XuweiyiChen/trl
Train transformer language models with reinforcement learning.
XuweiyiChen/VideoCrafter
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
XuweiyiChen/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.