lrhammond

Hello world.

University of OxfordOxford, UK

lrhammond's Stars

hendrycks/apps
APPS: Automated Programming Progress Standard (NeurIPS 2021)
Language:Python39651
UKGovernmentBEIS/inspect_ai
Inspect: A framework for large language model evaluations
Language:Python55995
METR/task-standard
METR Task Standard
Language:TypeScript11328
acsresearch/interlab
Language:Jupyter Notebook172
fiezt/ICML-2020-Implicit-Stackelberg-Learning
Language:Jupyter Notebook11
fiezt/Stackelberg-Code
Code for "Convergence of Learning Dynamics in Stackelberg Games"
Language:Jupyter Notebook131
openai/weak-to-strong
Language:Python2.5k303
EleutherAI/pythia
The hub for EleutherAI's work on interpretability and learning dynamics
Language:Jupyter Notebook2.2k163
eareyan/pysegta
Language:Jupyter Notebook32
PKM-er/obsidian-zotlit
A third-party project that aims to facilitate the integration between Obsidian.md and Zotero, by providing a set of community plugins for both Obsidian and Zotero.
Language:TypeScript64229
zkml-community/awesome-zkml
Aggregator for amazing ZKML resources
38425
google-deepmind/deep-verify
Language:Python167
google-deepmind/meltingpot
A suite of test scenarios for multi-agent reinforcement learning.
Language:Python590120
eugenevinitsky/sequential_social_dilemma_games
Repo for reproduction of sequential social dilemmas
Language:Python385132
kentsommer/pytorch-value-iteration-networks
Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)
Language:Python31662
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python1.8k382
longtermrisk/marltoolbox
A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
Language:Python293
psf/black
The uncompromising Python code formatter
Language:Python38.7k2.4k
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
Language:C#17k4.1k
gto76/python-cheatsheet
Comprehensive Python Cheatsheet
Language:Python36.2k6.5k
chloechsu/revisiting-ppo
Language:Jupyter Notebook477
openai/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Language:Python10k2.2k
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python15.7k4.9k
riveSunder/OpenSafety
Open Safety Gym with PyBullet
Language:Python7
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python30.1k2.8k
rosewang2008/gym-cooking
🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Computational Modeling Prize in High Cognition, and a NeurIPS 2020 CoopAI Workshop Best Paper.
Language:Python18437
pyutils/line_profiler
Line-by-line profiling for Python
Language:Python2.7k119
sebdumancic/pylo2
Python wrapper around several Prolog engines. Hoping to make symbolic AI a part of standard AI toolkit.
Language:Python847
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python2.1k603
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2.3k785

lrhammond

lrhammond's Stars

hendrycks/apps

UKGovernmentBEIS/inspect_ai

METR/task-standard

acsresearch/interlab

fiezt/ICML-2020-Implicit-Stackelberg-Learning

fiezt/Stackelberg-Code

openai/weak-to-strong

EleutherAI/pythia

eareyan/pysegta

PKM-er/obsidian-zotlit

zkml-community/awesome-zkml

google-deepmind/deep-verify

google-deepmind/meltingpot

eugenevinitsky/sequential_social_dilemma_games

kentsommer/pytorch-value-iteration-networks

oxwhirl/pymarl

longtermrisk/marltoolbox

psf/black

Unity-Technologies/ml-agents

gto76/python-cheatsheet

chloechsu/revisiting-ppo

openai/spinningup

openai/baselines

riveSunder/OpenSafety

jax-ml/jax

rosewang2008/gym-cooking

pyutils/line_profiler

sebdumancic/pylo2

Farama-Foundation/Minigrid

openai/multiagent-particle-envs