alexpashevich/E.T.
Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal transformer that encodes language inputs and the full episode history of visual observations and actions.
CMIT
Issues
- 0
visualization code
#20 opened - 0
Fine-tuning?
#19 opened - 0
- 1
- 0
- 1
Evaluating only sub-goals
#14 opened - 5
- 6
- 0
render_trajs Seems to get stuck
#9 opened - 10
Stuck while rendering trajectory
#8 opened - 1
- 6
Error in trying evaluation task
#6 opened - 2
- 3
Trajectories were skipped
#4 opened - 6
Share 45k synthetic trajectories
#3 opened - 1
- 1
Not able to render on Colab
#1 opened