/ai2thor

An open-source platform for Visual AI.

Primary LanguageC#Apache License 2.0Apache-2.0

A Near Photo-Realistic Interactable Framework for Embodied AI Agents

GitHub release License Documentation

🏡 Environments

iTHOR ManipulaTHOR RoboTHOR
A high-level interaction framework that facilitates research in embodied common sense reasoning. A mid-level interaction framework that facilitates visual manipulation of objects using a robotic arm. A framework that facilitates Sim2Real research with a collection of simlated scene counterparts in the physical world.

🌍 Features

🏡 Scenes. 200+ custom built high-quality scenes. The scenes can be explored on our demo page. We are working on rapidly expanding the number of available scenes and domain randomization within each scene.

🪑 Objects. 1500+ custom designed household objects across 100+ object types. Each object is heavily annotated, which allows for near-realistic physics interaction.

🤖 Agent Types. Multi-agent support, a custom built LoCoBot agent, a Kinova 3 inspired robotic manipulation agent, and a drone agent.

🦾 Actions. 200+ actions that facilitate research in a wide range of interaction and navigation based embodied AI tasks.

🖼 Images. First-class support for many image modalities and camera adjustments. Some modalities include ego-centric RGB images, instance segmentation, semantic segmentation, depth frames, normals frames, top-down frames, orthographic projections, and third-person camera frames. User's can also easily change camera properties, such as the size of the images and field of view.

🗺 Metadata. After each step in the environment, there is a large amount of sensory data available about the state of the environment. This information can be used to build highly complex custom reward functions.

📰 Latest Announcements

Date Announcement
5/2021
RandomizeMaterials.mp4
RandomizeMaterials is now supported! It enables a massive amount of realistic looking domain randomization within each scene. Try it out on the demo
4/2021 We are excited to release ManipulaTHOR, an environment within the AI2-THOR framework that facilitates visual manipulation of objects using a robotic arm. Please see the full 3.0.0 release notes here.
4/2021
iTHOR-RandomizeLighting.mp4
RoboTHOR-RandomizeLighting.mp4
RandomizeLighting is now supported! It includes many tunable parameters to allow for vast control over its effects. Try it out on the demo!

2/2021 We are excited to host the AI2-THOR Rearrangement Challenge, RoboTHOR ObjectNav Challenge, and ALFRED Challenge, held in conjunction with the Embodied AI Workshop at CVPR 2021.
2/2021 AI2-THOR v2.7.0 announces several massive speedups to AI2-THOR! Read more about it here.
6/2020 We've released 🐳 AI2-THOR Docker a mini-framework to simplify running AI2-THOR in Docker.
4/2020 Version 2.4.0 update of the framework is here. All sim objects that aren't explicitly part of the environmental structure are now moveable with physics interactions. New object types have been added, and many new actions have been added. Please see the full 2.4.0 release notes here.
2/2020 AI2-THOR now includes two frameworks: iTHOR and RoboTHOR. iTHOR includes interactive objects and scenes and RoboTHOR consists of simulated scenes and their corresponding real world counterparts.
9/2019 Version 2.1.0 update of the framework has been added. New object types have been added. New Initialization actions have been added. Segmentation image generation has been improved in all scenes.
6/2019 Version 2.0 update of the AI2-THOR framework is now live! We have over quadrupled our action and object states, adding new actions that allow visually distinct state changes such as broken screens on electronics, shattered windows, breakable dishware, liquid fillable containers, cleanable dishware, messy and made beds and more! Along with these new state changes, objects have more physical properties like Temperature, Mass, and Salient Materials that are all reported back in object metadata. To combine all of these new properties and actions, new context sensitive interactions can now automatically change object states. This includes interactions like placing a dirty bowl under running sink water to clean it, placing a mug in a coffee machine to automatically fill it with coffee, putting out a lit candle by placing it in water, or placing an object over an active stove burner or in the fridge to change its temperature. Please see the full 2.0 release notes here to view details on all the changes and new features.

💻 Installation

With pip

pip install ai2thor

With conda

conda install -c conda-forge ai2thor

With Docker

🐳 AI2-THOR Docker can be used, which adds the configuration for running a X server to be used by Unity 3D to render scenes.

Minimal Example

Once you've installed AI2-THOR, you can verify that everything is working correctly by running the following minimal example:

from ai2thor.controller import Controller
controller = Controller(scene="FloorPlan10")
event = controller.step(action="RotateRight")
metadata = event.metadata
print(event, event.metadata.keys())

Requirements

Component Requirement
OS Mac OS X 10.9+, Ubuntu 14.04+
Graphics Card DX9 (shader model 3.0) or DX11 with feature level 9.3 capabilities.
CPU SSE2 instruction set support.
Python Versions 3.5+
Linux X server with GLX module enabled

💬 Support

Questions. If you have any questions on AI2-THOR, please ask them on our GitHub Discussions Page.

Issues. If you encounter any issues while using AI2-THOR, please open an Issue on GitHub.

🏫 Learn more

Section Description
Demo Interact and play with AI2-THOR live in the browser.
iTHOR Documentation Documentation for the iTHOR environment.
ManipulaTHOR Documentation Documentation for the ManipulaTHOR environment.
RoboTHOR Documentation Documentation for the RoboTHOR environment.
AI2-THOR Colab A way to run AI2-THOR freely on the cloud using Google Colab.
AllenAct An Embodied AI Framework build at AI2 that provides first-class support for AI2-THOR.
AI2-THOR Unity Development A (sparse) collection of notes that may be useful if editing on the AI2-THOR backend.
AI2-THOR WebGL Development Documentation on packaging AI2-THOR for the web, which might be useful for annotation based tasks.

📒 Citation

If you use iTHOR, please cite the original AI2-THOR paper:

@article{ai2thor,
  author={Eric Kolve and Roozbeh Mottaghi and Winson Han and
          Eli VanderBilt and Luca Weihs and Alvaro Herrasti and
          Daniel Gordon and Yuke Zhu and Abhinav Gupta and
          Ali Farhadi},
  title={{AI2-THOR: An Interactive 3D Environment for Visual AI}},
  journal={arXiv},
  year={2017}
}

If you use ManipulaTHOR, please cite the following paper:

@inproceedings{manipulathor,
  title={{ManipulaTHOR: A Framework for Visual Object Manipulation}},
  author={Kiana Ehsani and Winson Han and Alvaro Herrasti and
          Eli VanderBilt and Luca Weihs and Eric Kolve and
          Aniruddha Kembhavi and Roozbeh Mottaghi},
  booktitle={CVPR},
  year={2021}
}

If you use RoboTHOR, please cite the following paper:

@inproceedings{robothor,
  author={Matt Deitke and Winson Han and Alvaro Herrasti and
          Aniruddha Kembhavi and Eric Kolve and Roozbeh Mottaghi and
          Jordi Salvador and Dustin Schwenk and Eli VanderBilt and
          Matthew Wallingford and Luca Weihs and Mark Yatskar and
          Ali Farhadi},
  title={{RoboTHOR: An Open Simulation-to-Real Embodied AI Platform}},
  booktitle={CVPR},
  year={2020}
}

👋 Our Team

AI2-THOR is an open-source project built by the PRIOR team at the Allen Institute for AI (AI2). AI2 is a non-profit institute with the mission to contribute to humanity through high-impact AI research and engineering.