Center for Human-Compatible AI

CHAI seeks to develop the conceptual and technical wherewithal to reorient the general thrust of AI research towards provably beneficial systems.

Pinned Repositories

adversarial-policies
Find best-response to a fixed policy in multi-agent RL
Language:Python277 14 2247
eirli
An Empirical Investigation of Representation Learning for Imitation (EIRLI), NeurIPS'21
Language:Python36 9 24
evaluating-rewards
Library to compare and evaluate reward functions
Language:Python62 8 57
human_aware_rl
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
Language:Python107 9 1745
imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Language:Python1.4k 19 342251
overcooked-demo
Web application where humans can play Overcooked with AI agents.
Language:JavaScript57 7 1926
overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
Language:Jupyter Notebook729 19 64157
rlsp
Reward Learning by Simulating the Past
Language:Python43 8 06
seals
Benchmark environments for reward modelling and imitation learning algorithms.
Language:Python44 10 146
tensor-trust
A prompt injection game to collect data for robust ML research
Language:Python49 6 1645

Center for Human-Compatible AI's Repositories

HumanCompatibleAI/rlsp
Reward Learning by Simulating the Past
Language:Python43 8 06
HumanCompatibleAI/atari-irl
Language:Python28 4 35
HumanCompatibleAI/deep-rlsp
Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.
Language:Python26 7 06
HumanCompatibleAI/population-irl
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
Language:Python26 9 182
HumanCompatibleAI/learning_biases
Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.
Language:Jupyter Notebook23 8 011
HumanCompatibleAI/human_ai_robustness
Language:Python21 8 01
HumanCompatibleAI/better-adversarial-defenses
Training in bursts for defending against adversarial policies
Language:Python11 5 13
HumanCompatibleAI/interpreting-rewards
Experiments in applying interpretability techniques to learned reward functions.
Language:Jupyter Notebook9 5 01
HumanCompatibleAI/derail
Supporting code for diagnostic seals paper
Language:Python2 4 0
HumanCompatibleAI/minerl
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
Language:Python2 2 0
HumanCompatibleAI/multi-agent
Language:Python2 5 03
HumanCompatibleAI/cs294-149-fa18-notes
LaTeX Notes from the Fall 2018 version of CS294-149: AGI Safety and Control
Language:TeX1 5 12
HumanCompatibleAI/ilqr
Iterative Linear Quadratic Regulator with auto-differentiatiable dynamics models
Language:Python1 5 0
HumanCompatibleAI/logical-active-classification
Use active learning to classify data represented as boundaries of regions in parameter space where a parametrised logical formula holds.
Language:Python1 4 01
HumanCompatibleAI/simulation-awareness
(experimental) RL agents should be more aligned if they do not know whether they are in simulation or in the real world
Language:Python1 5 0
HumanCompatibleAI/interactive-behaviour-design
Language:Python0 7 11
HumanCompatibleAI/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python5 02
HumanCompatibleAI/carla-autoware
Integration of AutoWare AV software with the CARLA simulator
Language:Python5 01
HumanCompatibleAI/coiltraine
Training framework for conditional imitation learning
Language:Python6 0
HumanCompatibleAI/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python4 01
HumanCompatibleAI/interactive-behaviour-design-baselines
Language:HTML6 0
HumanCompatibleAI/interactive-behaviour-design-basicfetch
Language:Python6 0
HumanCompatibleAI/interactive-behaviour-design-gym
Language:Python7 0
HumanCompatibleAI/malmo
Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presented by this unique environment. --- For installation instructions, scroll down to *Getting Started* below, or visit the project page for more information:
Language:Java2 0
HumanCompatibleAI/scenario_runner
Traffic scenario definition and execution engine
Language:Python5 0

Center for Human-Compatible AI

Pinned Repositories

adversarial-policies

eirli

evaluating-rewards

human_aware_rl

imitation

overcooked-demo

overcooked_ai

rlsp

seals

tensor-trust

Center for Human-Compatible AI's Repositories

HumanCompatibleAI/rlsp

HumanCompatibleAI/atari-irl

HumanCompatibleAI/deep-rlsp

HumanCompatibleAI/population-irl

HumanCompatibleAI/learning_biases

HumanCompatibleAI/human_ai_robustness

HumanCompatibleAI/better-adversarial-defenses

HumanCompatibleAI/interpreting-rewards

HumanCompatibleAI/derail

HumanCompatibleAI/minerl

HumanCompatibleAI/multi-agent

HumanCompatibleAI/cs294-149-fa18-notes

HumanCompatibleAI/ilqr

HumanCompatibleAI/logical-active-classification

HumanCompatibleAI/simulation-awareness

HumanCompatibleAI/interactive-behaviour-design

HumanCompatibleAI/baselines

HumanCompatibleAI/carla-autoware

HumanCompatibleAI/coiltraine

HumanCompatibleAI/gym

HumanCompatibleAI/interactive-behaviour-design-baselines

HumanCompatibleAI/interactive-behaviour-design-basicfetch

HumanCompatibleAI/interactive-behaviour-design-gym

HumanCompatibleAI/malmo

HumanCompatibleAI/scenario_runner