gComm: A Python repository from anirbanl

gComm: An environment for investigating generalization in Grounded Language Acquisition

gComm is a step towards developing a robust platform to foster research in grounded language acquisition in a more challenging and realistic setting. It comprises a 2-d grid environment with a set of agents (a stationary speaker and a mobile listener connected via a communication channel) exposed to a continuous array of tasks in a partially observable setting. The key to solving these tasks lies in agents developing linguistic abilities and utilizing them for efficiently exploring the environment. The speaker and listener have access to information provided in different modalities, i.e. the speaker's input is a natural language instruction that contains the target and task specifications and the listener's input is its grid-view. Each must rely on the other to complete the assigned task, however, the only way they can achieve the same, is to develop and use some form of communication. gComm provides several tools for studying different forms of communication and assessing their generalization.

Getting Started
Baselines
Demos
Additional Features
Publications

This repository contains the code for the environment, including the baselines and metrics.

Getting Started

To set up environment,

$ git clone https://github.com/SonuDixit/gComm.git
$ python setup.py install  # install setuptools package before running this line
$ cd gComm/

Run the following to see if the env works.

$ python test_package.py

Input actions manually: <'left', 'right', 'forward', 'backward', 'push', 'pull', 'pickup', 'drop'>

Important Arguments

Arguments can be found in the file: gComm/arguments.py

Environment arguments --grid_size --min_other_objects --max_objects
Grammar and Vocabulary arguments --type_grammar --transitive_verbs --nouns --color_adjectives --size_adjectives --all_light --keep_fixed_weights
RL-framework --num_episodes --episode_len --grid_input_type
Communication Channel comm_type
Rendering --render_episode

$ python baselines.py --render_episode
$ python baselines.py --render_episode --wait_time 0.6  # slower rendering (default: 0.3)

Baselines

Task	Baseline	Convergence Rewards
	Simple Speaker	0.70
	Random Speaker	0.40
Walk	Fixed Speaker	0.43
	Perfect Speaker	0.95
	Oracle Listener	0.99

	Simple Speaker	0.55
	Random Speaker	0.19
Push & Pull	Fixed Speaker	0.15
	Perfect Speaker	0.85
	Oracle Listener	0.90

To run each baseline:

Simple Speaker (Categorical)

# walk
$ python baselines.py --type_grammar simple_intrans --grid_input_type vector --all_light --num_episodes 300000 --episode_len 10 --comm_type categorical

# push and pull
$ python baselines.py --type_grammar simple_trans --transitive_verbs push,pull --min_other_objects 2 --max_objects 2 --grid_input_type vector --all_light --num_episodes 400000 --episode_len 10 --comm_type categorical

Random Speaker

# walk
$ python baselines.py --type_grammar simple_intrans --grid_input_type vector --all_light --num_episodes 200000 --episode_len 10 --comm_type random

# push and pull
$ python baselines.py --type_grammar simple_trans --transitive_verbs push,pull --min_other_objects 2 --max_objects 2 --grid_input_type vector --all_light --num_episodes 300000 --episode_len 10 --comm_type random

Fixed Speaker

# walk
$ python baselines.py --type_grammar simple_intrans --grid_input_type vector --all_light --num_episodes 200000 --episode_len 10 --comm_type fixed

# push and pull
$ python baselines.py --type_grammar simple_trans --transitive_verbs push,pull --min_other_objects 2 --max_objects 2 --grid_input_type vector --all_light --num_episodes 300000 --episode_len 10 --comm_type random

Perfect Speaker

# walk
$ python baselines.py --type_grammar simple_intrans --grid_input_type vector --all_light --num_episodes 200000 --episode_len 10 --comm_type perfect

# push and pull
$ python baselines.py --type_grammar simple_trans --transitive_verbs push,pull --min_other_objects 2 --max_objects 2 --grid_input_type vector --all_light --num_episodes 300000 --episode_len 10 --comm_type perfect

Oracle Listener

# walk
$ python baselines.py --type_grammar simple_intrans --grid_input_type with_target --all_light --num_episodes 200000 --episode_len 10 --comm_type oracle

# push and pull
$ python baselines.py --type_grammar simple_trans --transitive_verbs push,pull --min_other_objects 2 --max_objects 2 --grid_input_type with_target --all_light --num_episodes 300000 --episode_len 10 --comm_type oracle

Demos

 1. WALK ; 2. PUSH; 3. PULL

Additional Features

1. Levels: mazes and obstacles

⋅⋅⋅⋅⋅⋅

Maze parameters --obstacles_flag --num_obstacles --enable_maze --maze_complexity --maze_density

$  python baselines.py --enable_maze --maze_complexity 0.3 --maze_density 0.3 --render_episode 

# test on a bigger grid
$ python test_package.py --enable_maze --maze_density 0.3 --maze_complexity 0.3 --grid_size 8 --max_objects 12 --render_episode

2. Lights Out

$ python baselines.py --lights_out
$ python baselines.py --lights_out --render_episode  # for rendering

3. Metrics

topsim: measure compositionality of messages

$ python topsim.py

 ============ protocol: perfectly compositional =============
Concept         Messages  
green box       aa        
blue box        ba        
green circle    ab        
blue circle     bb        
pearson_corr = 1.0 , spearman_corr = 1.0

 ============ protocol: surjective (not injective) =============
Concept         Messages  
green box       ab        
blue box        ba        
green circle    ab        
blue circle     bb        
pearson_corr = 0.6793662204867574 , spearman_corr = 0.694022093788567

 ============ protocol: holistic =============
Concept         Messages  
green box       ba        
blue box        aa        
green circle    ab        
blue circle     bb        
pearson_corr = 0.5 , spearman_corr = 0.5

 ============ protocol: ambiguous language =============
Concept         Messages  
green box       aa        
blue box        aa        
green circle    aa        
blue circle     aa        
pearson_corr = 0.3651483716701107 , spearman_corr = 0.36514837167011077

4. Other types of communication

Continuous Messages

$ python baselines.py --type_grammar simple_intrans --grid_input_type vector --all_light --num_episodes 300000 --episode_len 10 --comm_type continuous

Binary Messages

python baselines.py --type_grammar simple_intrans --grid_input_type vector --all_light --num_episodes 300000 --episode_len 10 --comm_type binary

Publications

[1] Rishi Hazra and Sonu Dixit, 2021. "gComm: An environment for investigating generalization in Grounded Language Acquisition". In NAACL 2021 Workshop: ViGIL.

[2] Rishi Hazra*, Sonu Dixit*, and Sayambhu Sen, 2021. "Zero-Shot Generalization using Intrinsically Motivated Compositional Emergent Protocols". In NAACL 2021 Workshop: ViGIL.

[3] Rishi Hazra*, Sonu Dixit*, and Sayambhu Sen, 2020. "Infinite use of finite means: Zero-Shot Generalization using Compositional Emergent Protocols".

anirbanl/gComm