HumanCompatibleAI

CHAI seeks to develop the conceptual and technical wherewithal to reorient the general thrust of AI research towards provably beneficial systems.

Pinned Repositories

adversarial-policies
Find best-response to a fixed policy in multi-agent RL
Language:Python283 13 2246
evaluating-rewards
Library to compare and evaluate reward functions
Language:Python65 8 58
human_aware_rl
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
Language:Python108 9 1746
imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Language:Python1.4k 17 346260
overcooked-demo
Web application where humans can play Overcooked with AI agents.
Language:JavaScript58 7 1927
overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
Language:Jupyter Notebook778 19 71173
rlsp
Reward Learning by Simulating the Past
Language:Python44 8 06
seals
Benchmark environments for reward modelling and imitation learning algorithms.
Language:Python46 10 146
tensor-trust
A prompt injection game to collect data for robust ML research
Language:Python54 6 1645
tensor-trust-data
Dataset for the Tensor Trust project
Language:Jupyter Notebook39 4 25

HumanCompatibleAI's Repositories

HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Language:Python1.4k 17 346260
HumanCompatibleAI/overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
Language:Jupyter Notebook778 19 71173
HumanCompatibleAI/adversarial-policies
Find best-response to a fixed policy in multi-agent RL
Language:Python283 13 2246
HumanCompatibleAI/human_aware_rl
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
Language:Python108 9 1746
HumanCompatibleAI/evaluating-rewards
Library to compare and evaluate reward functions
Language:Python65 8 58
HumanCompatibleAI/overcooked-demo
Web application where humans can play Overcooked with AI agents.
Language:JavaScript58 7 1927
HumanCompatibleAI/tensor-trust
A prompt injection game to collect data for robust ML research
Language:Python54 6 1645
HumanCompatibleAI/seals
Benchmark environments for reward modelling and imitation learning algorithms.
Language:Python46 10 146
HumanCompatibleAI/tensor-trust-data
Dataset for the Tensor Trust project
Language:Jupyter Notebook39 4 25
HumanCompatibleAI/eirli
An Empirical Investigation of Representation Learning for Imitation (EIRLI), NeurIPS'21
Language:Python36 8 24
HumanCompatibleAI/ranking-challenge
Testing ranking algorithms to improve social cohesion
Language:Python29 7 73
HumanCompatibleAI/leela-interp
Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"
Language:Jupyter Notebook20 0 08
HumanCompatibleAI/nn-clustering-pytorch
Checking the divisibility of neural networks, and investigating the nature of the pieces networks can be divided into.
Language:Python6 4 02
HumanCompatibleAI/recon-email
Script for automatically creating the reconnaissance email.
Language:HTML5 3 01
HumanCompatibleAI/reward-preprocessing
Preprocessing reward functions to make them more interpretable
Language:Python5 3 00
HumanCompatibleAI/multiagent-competition
Code for the paper "Emergent Complexity via Multi-agent Competition"
Language:Python4 6 04
HumanCompatibleAI/assistance-games
Supporting code for Assistance Games as a Framework paper
Language:Python3 7 0
HumanCompatibleAI/reducing-exploitability
Language:Python3 2 40
HumanCompatibleAI/stable-baselines3
PyTorch version of Stable Baselines, improved implementations of reinforcement learning algorithms.
Language:Python3 2 01
HumanCompatibleAI/dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
Language:Python2 3 0
HumanCompatibleAI/ranking-challenge-perspective
Prosocial Ranking Challenge Perspective Ranker
Language:Jupyter Notebook1
HumanCompatibleAI/reward-function-interpretability
Language:Jupyter Notebook1 3 50
HumanCompatibleAI/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Language:Python1 1 0
HumanCompatibleAI/sacred
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
Language:Python1 2 0
HumanCompatibleAI/katago-driver-bug-repro
Docker files to help reproduce bug described in https://forums.developer.nvidia.com/t/kernel-oops-null-pointer-dereference-when-closing-cuda-application-katago/211270/3
Language:Dockerfile5 0
HumanCompatibleAI/pytorch-summary
Model summary in PyTorch similar to `model.summary()` in Keras
Language:Python2 0
HumanCompatibleAI/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Language:Python2 01
HumanCompatibleAI/rc-submission-civirank
PRC: Civirank submission
HumanCompatibleAI/rc-submission-dante
PRC: Testing ranking algorithms to improve social cohesion
Language:JavaScript
HumanCompatibleAI/sgf-viewer
A simple webpage that can visualize a sgf string encoded as a url fragment.
Language:CSS3 0