DavidSlayback's Stars
plasma-umass/scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Farama-Foundation/Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
pydata/numexpr
Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
ELS-RD/kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
viblo/pymunk
Pymunk is a easy-to-use pythonic 2d physics library that can be used whenever you need 2d rigid body physics from Python
uoe-agents/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
helenahartmann/awesome-PhD
All the resources I wish I knew when starting my PhD. This repository is aimed to be a living, constantly developing resource where everybody can contribute with new resources!
dfm/extending-jax
Extending JAX with custom C++ and CUDA code
Farama-Foundation/miniwob-plusplus
MiniWoB++: a web interaction benchmark for reinforcement learning
Farama-Foundation/MicroRTS-Py
A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)
Div99/IQ-Learn
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
rohanpsingh/mujoco-python-viewer
Simple renderer for use with MuJoCo (>=2.1.2) Python Bindings.
ArnaudFickinger/gym-multigrid
Lightweight multi-agent gridworld Gym environment
ykwon0407/WeightedSHAP
WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)
vwxyzjn/invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
jurgisp/memory-maze
Evaluating long-term memory of reinforcement learning algorithms
OutpostUniverse/OPHD
OutpostHD - Open source remake of Sierra On-Line's Outpost
tianjunz/NovelD
aijunbai/taxi
Hierarchical Online Planning and Reinforcement Learning on Taxi
RedTachyon/coltra-rl
A modular implementation of PPO, and soon hopefully other algorithms.
GUT-AI/gut-ai
Documentation, content and meta files about GUT-AI.
ludc/gymecs
AdaCompNUS/magic
Macro-Action Generator-Critic (MAGIC) - Learning Macro-actions for online POMDP planning
Farama-Foundation/Procgen-Staging
Procgen2: A community maintained fork of procgen
lebrice/Tutorials
Source code for the Mila Tutorials
masud99r/bae
Code for the paper: Bootstrap Advantage Estimation for Policy Optimization in Reinforcement Learning. https://arxiv.org/abs/2210.07312