embodied-ai

There are 67 repositories under embodied-ai topic.

  • Otter

    Luodian/Otter

    🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

    Language:Python3.5k100159240
  • unrealcv/unrealcv

    UnrealCV: Connecting Computer Vision to Unreal Engine

    Language:C++1.8k97205430
  • theseus

    facebookresearch/theseus

    A library for differentiable nonlinear optimization

    Language:Python1.6k31180121
  • dora-rs/dora

    DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.

    Language:Rust1.3k209860
  • hyp1231/awesome-llm-powered-agent

    Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

  • zchoi/Awesome-Embodied-Agent-with-LLMs

    This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!

  • OpenRL-Lab/openrl

    Unified Reinforcement Learning Framework

    Language:Python58085659
  • haosulab/ManiSkill

    SAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark

    Language:Python5241515489
  • huangwl18/VoxPoser

    VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

    Language:Python42481754
  • OpenDriveLab/DriveAGI

    [Incl. GenAD, CVPR 2024 Highlight] Embracing Foundation Models into Autonomous Agent and System

    Language:Python41925513
  • allenai/procthor

    🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses

    Language:Python23583819
  • huangwl18/language-planner

    Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"

    Language:Jupyter Notebook2294731
  • geng-haoran/Simulately

    A universal summary of current robotics simulators

    Language:TypeScript1998313
  • MarSaKi/VLN-BEVBert

    [ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"

    Language:Python1674174
  • MarSaKi/ETPNav

    [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"

    Language:Python1661615
  • haoranD/Awesome-Embodied-AI

    A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

  • thunlp/LEGENT

    Open Platform for Embodied Agents

    Language:Python142819
  • rllab-snu/RNR-Map

    Official Github repository for "Renderable Neural Radiance Map for Visual Navigation‬". (CVPR 2023)

  • simpler-env/SimplerEnv

    Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)

    Language:Jupyter Notebook105414
  • allenai/manipulathor

    ManipulaTHOR, a framework that facilitates visual manipulation of objects using a robotic arm

    Language:Jupyter Notebook88101213
  • MarSaKi/NvEM

    [ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"

    Language:C++77112
  • zd11024/NaviLLM

    [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'

    Language:Python773116
  • YicongHong/Discrete-Continuous-VLN

    Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation

    Language:Python76467
  • eric-ai-lab/VLMbench

    NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"

    Language:Python744128
  • Xiaoming-Zhao/PointNav-VO

    [ICCV 2021] Official implementation of "The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation"

    Language:Python6021911
  • BraveGroup/SheetCopilot

    We release a general framework for prompting LLMs to manipulate software in a closed-loop manner.

    Language:Python58454
  • csiro-robotics/Uncertainty-LPR

    📣 [IEEE IROS 2023] Official Repository of IROS 23 paper "Uncertainty-Aware Lidar Place Recognition in Novel Environments"

    Language:Python57803
  • yyvhang/lemon_3d

    Language:Python56162
  • rllab-snu/Visual-Graph-Memory

    Official GitHub Repository for paper "Visual Graph Memory with Unsupervised Representation for Visual Navigation", ICCV 2021

    Language:Python537511
  • 3dlg-hcvc/hssd

    Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.

    Language:Python462194
  • Book-of-MLM

    HCPLab-SYSU/Book-of-MLM

    《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞

    Language:HTML464
  • HanqingWangAI/Active_VLN

    The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`

    Language:Python43627
  • 2toinf/DecisionNCE

    [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"

    Language:Python42300
  • FudanDISC/ReForm-Eval

    An benchmark for evaluating the capabilities of large vision-language models (LVLMs)

    Language:Python32084
  • allenai/phone2proc

    📱👉🏠 Perform conditional procedural generation to generate houses like your own!

    Language:Python30501
  • CEC-Agent/CEC

    Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"

    Language:Python30014