lrhammond's Stars
hendrycks/apps
APPS: Automated Programming Progress Standard (NeurIPS 2021)
UKGovernmentBEIS/inspect_ai
Inspect: A framework for large language model evaluations
METR/task-standard
METR Task Standard
acsresearch/interlab
fiezt/ICML-2020-Implicit-Stackelberg-Learning
fiezt/Stackelberg-Code
Code for "Convergence of Learning Dynamics in Stackelberg Games"
openai/weak-to-strong
EleutherAI/pythia
The hub for EleutherAI's work on interpretability and learning dynamics
eareyan/pysegta
PKM-er/obsidian-zotlit
A third-party project that aims to facilitate the integration between Obsidian.md and Zotero, by providing a set of community plugins for both Obsidian and Zotero.
zkml-community/awesome-zkml
Aggregator for amazing ZKML resources
google-deepmind/deep-verify
google-deepmind/meltingpot
A suite of test scenarios for multi-agent reinforcement learning.
eugenevinitsky/sequential_social_dilemma_games
Repo for reproduction of sequential social dilemmas
kentsommer/pytorch-value-iteration-networks
Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
longtermrisk/marltoolbox
A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
psf/black
The uncompromising Python code formatter
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
gto76/python-cheatsheet
Comprehensive Python Cheatsheet
chloechsu/revisiting-ppo
openai/spinningup
An educational resource to help anyone learn deep reinforcement learning.
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
riveSunder/OpenSafety
Open Safety Gym with PyBullet
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
rosewang2008/gym-cooking
🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Computational Modeling Prize in High Cognition, and a NeurIPS 2020 CoopAI Workshop Best Paper.
pyutils/line_profiler
Line-by-line profiling for Python
sebdumancic/pylo2
Python wrapper around several Prolog engines. Hoping to make symbolic AI a part of standard AI toolkit.
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"