hjy-u's Stars
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
huggingface/blog
Public repo for HF blog posts
real-stanford/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
stepjam/RLBench
A large-scale benchmark and learning environment.
DirtyHarryLYL/LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
vimalabs/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
opendilab/awesome-decision-transformer
A curated list of Decision Transformer resources (continually updated)
minghanqin/LangSplat
Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
mbreuss/diffusion-literature-for-robotics
Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.
jlin816/dynalang
Code for "Learning to Model the World with Language." ICML 2024 Oral.
liruiw/GenSim
Generating Robotic Simulation Tasks via Large Language Models
vimalabs/VIMABench
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
flow-diffusion/AVDC
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
LostXine/LLaRA
LLaRA: Large Language and Robotics Assistant
H-Freax/Awesome-Video-Robotic-Papers
This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.
lucidrains/diffusion-policy
Implementation of Diffusion Policy, Toyota Research's supposed breakthrough in leveraging DDPMs for learning policies for real-world Robotics
LeapLabTHU/GSVA
[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models
SiyuanHuang95/ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Rubics-Xuan/MRES
This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.
microsoft/PACT
Perception-Action Causal Transformer
SudeepDasari/data4robotics
real-stanford/xskill
[CoRL 2023] XSkill: cross embodiment skill discovery
Dantong88/LLARVA
Azurehappen/UrbanRTK-INS-OutlierOpt
Risk-Averse Optimization framework for RTK-GNSS/INS urban navigation.
yunyikristy/DualMind
abliao/PIVOT-R