TDW Multi-Modal Challenge

Search for a dropped object in a room using the Magnebot API and pre-calculated audio that was generated by a physics simulation of the falling object.

Setup

If you currently have tdw installed: pip3 uninstall tdw
If you currently have magnebot installed: pip3 uninstall magnebot
git clone https://github.com/alters-mit/multimodal_challenge.git
cd multimodal_challenge
git checkout distractors
pip3 install -e .
Download TDW build 1.8.29
(Optional) Download the asset bundles (read this for more information).

`MultiModal` challenge controller

Use a MultiModal controller to initialize a scene, populate it with objects (including the target object), and load the corresponding audio data.

Read the API documentation.

from multimodal_challenge.multimodal import MultiModal

m = MultiModal()
m.init_scene(scene="mm_kitchen_1a", layout=0, trial=57)

Audio Dataset Generation

`init_data.py`

This is a backend tool for TDW developers to convert saved TDW commands into initialization instructions.

Further documentation here.

`occupancy_mapper.py`

This script generates and saves two occupancy maps:

An occupancy map indicating which cells are free, occupied by an object, or not within the scene
An occupancy map indicating where a Magnebot can be added to the scene.

Further documentation here.

`rehearsal.py`

Define DatasetTrial initialization parameters for distractor objects and the target object. Drop the objects. If they land in acceptable positions, and if there is a position to add the Magnebot in the scene, record the DatasetTrial. This will give dataset.py initialization parameters.

Further documentation here.

`dataset.py`

Use the initialization data generated by rehearsal.py to create Trials. A Trial is initialization data for each object in the scene (position, rotation, etc.), initialization data for the Magenbot, a .wav file of the audio, and an occupancy map as a .npy file.

Further documentation here.

alters-mit/multimodal_challenge

TDW Multi-Modal Challenge

Setup

MultiModal challenge controller

Audio Dataset Generation

init_data.py

occupancy_mapper.py

rehearsal.py

dataset.py

Changelog

`MultiModal` challenge controller

`init_data.py`

`occupancy_mapper.py`

`rehearsal.py`

`dataset.py`