OrgSim-RL Platform

Repo maintainer: Maximilian W. Hofer (maximilian.hofer@epfl.ch)

AUTHOR:  Maximilian W. Hofer  
SOURCE:  https://github.com/mxhofer/OrgSim-RL  
LICENSE: Access to this code is provided under an MIT License.

The OrgSim-RL platform is a reinforcement learning simulation tool to model organizational returns to specialization.

Key directories and files

dyna-q/: dyna-Q algorithm
vm/: virtual machine scripts
.streamlit/: Streamlit authentication
config.yaml: all parameter configurations
dashboard.py: Streamlit dashboard

Usage

Fork repo

Create your own copy of the repository to freely experiment with OrgSim-RL.

Install dependencies

git clone https://github.com/mxhofer/OrgSim-RL.git

pip install -r requirements.txt

Prepare Google Cloud environment

You will need to enable the following services:

BigQuery
AppEngine (make sure you have deployment rights)
Container Registry
IAM (make sure you have permission to create a new service account)

Configure the virtual machine (VM)

Create a VM with a disk (e.g. 200GB)
Add to the disk:
1. vm/rpc.sh script
2. When you stop this first VM, keep the disk around so you can re-use copies of the disk for future VMs
Stop the VM
Create a new image from the disk. Future VMs will use a copy of this image.
Update VM configurations in vm/vm.py:
1. GitHub URL
  1. Format: https://<username>:<token>@<github_url>
2. Project name
3. Image name
4. Compute zone

Run simulation on local machine (not recommended)

The config.yaml file contains all parameter values. The parameter values for specializaiton (lambda), automation (tau), environmental change (delta) are passed through the command line.

Beware: the simulation with default parameters takes > 12 hours to complete on an M1 MacBook Pro.

In your terminal, navigate to the dyna-q/ directory.
Run the simulation:

python dynaQ.py lambda <par_val> tau <par_val> delta <par_val>
Review results in dyna-q/outputs/results/<simulation_date>/

Run simulation on GCE (recommended)

The vm/vm.py file starts VMs and overwrites parameter values for specializaiton (lambda), automation (tau), environmental change (delta).

Set parameters in config.yaml
Set a value for the TAG variable at the top of dyna-q/dynaQ.py to identify the simulation run.
Push repository to GitHub
Set parameter ranges in vm/vm.py
Run vm/vm.py
Go to Google Cloud / Compute Engine / VM instances
1. Click on a VM
2. Open Serial Port 1 (console) to check stdout log
Check Google Cloud / Cloud Storage / Browser for simulation ouputs:
1. Check bucket simulation-output/results/
Validate outputs
1. Use the validate_outputs function in vm/vm.py

Ingest results into BigQuery

Follow the general guide on ingesting .csv files into BigQuery here
1. Data set ID: date + TAG
2. Create table from: select a .csv output file in the appropriate bucket. Then edit the filename to *.csv to ingest all files in that bucket.
3. Table name: results (hard requirement as the SQL queries expect this table name!)
4. Schema: Auto detect.
5. Cluster by delta, lambda to speed up querying performance.
6. In Advanced Options. Header rows to skip: 1
7. Create table.
Validate number of rows in table
1. Click on the table name
2. Navigate to Details
3. Check Number of rows. Number of rows should equal:
  1. # of values for lambda x # of values for tau x # of values lambda x # episodes x # runs

Configure Streamlit

On Google Cloud:
1. Create a service account with Viewer permissions. See details here.
2. Create a key.
3. Download the key as a JSON.
Save the key file in .streamlit/
Update the path to the key file in dashboard.py
Add the key file to .gitignore

Deploy dashboard

Test that the dashboard is running locally:
1. streamlit run dashboard.py --server.port=8080 --server.address=0.0.0.0
2. Running the dashboard will write .csv files of the outputs to disk in dyna-q/outputs
Test that the dashboard is running locally in a Docker container:
1. docker build . -t dashboard
2. docker run -p 8080:8080 dashboard
Deploy dashboard:
1. gcloud app deploy dashboard.yaml
2. Click on the URL of the deployed service

Your deployed Streamlit dashboard is ready to use!