embodied-ai
There are 67 repositories under embodied-ai topic.
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
unrealcv/unrealcv
UnrealCV: Connecting Computer Vision to Unreal Engine
facebookresearch/theseus
A library for differentiable nonlinear optimization
dora-rs/dora
DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
hyp1231/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
zchoi/Awesome-Embodied-Agent-with-LLMs
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!
OpenRL-Lab/openrl
Unified Reinforcement Learning Framework
haosulab/ManiSkill
SAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark
huangwl18/VoxPoser
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
OpenDriveLab/DriveAGI
[Incl. GenAD, CVPR 2024 Highlight] Embracing Foundation Models into Autonomous Agent and System
allenai/procthor
🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses
huangwl18/language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
geng-haoran/Simulately
A universal summary of current robotics simulators
MarSaKi/VLN-BEVBert
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
MarSaKi/ETPNav
[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"
haoranD/Awesome-Embodied-AI
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
thunlp/LEGENT
Open Platform for Embodied Agents
rllab-snu/RNR-Map
Official Github repository for "Renderable Neural Radiance Map for Visual Navigation". (CVPR 2023)
simpler-env/SimplerEnv
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
allenai/manipulathor
ManipulaTHOR, a framework that facilitates visual manipulation of objects using a robotic arm
MarSaKi/NvEM
[ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"
zd11024/NaviLLM
[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
YicongHong/Discrete-Continuous-VLN
Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
eric-ai-lab/VLMbench
NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"
Xiaoming-Zhao/PointNav-VO
[ICCV 2021] Official implementation of "The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation"
BraveGroup/SheetCopilot
We release a general framework for prompting LLMs to manipulate software in a closed-loop manner.
csiro-robotics/Uncertainty-LPR
📣 [IEEE IROS 2023] Official Repository of IROS 23 paper "Uncertainty-Aware Lidar Place Recognition in Novel Environments"
rllab-snu/Visual-Graph-Memory
Official GitHub Repository for paper "Visual Graph Memory with Unsupervised Representation for Visual Navigation", ICCV 2021
3dlg-hcvc/hssd
Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.
HCPLab-SYSU/Book-of-MLM
《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞
HanqingWangAI/Active_VLN
The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`
2toinf/DecisionNCE
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
FudanDISC/ReForm-Eval
An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
allenai/phone2proc
📱👉🏠 Perform conditional procedural generation to generate houses like your own!
CEC-Agent/CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"