DavidSlayback

@offerfit

DavidSlayback's Stars

plasma-umass/scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Language:Python12.3k 92 477402
Farama-Foundation/Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Language:Python7.8k 50 488881
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Language:Python2.5k 44 657328
pydata/numexpr
Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more
Language:Python2.3k 61 382212
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python2.2k 39 194616
ELS-RD/kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Language:Jupyter Notebook1.5k 29 17494
HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Language:Python1.4k 19 342254
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python1.1k 17 28136
viblo/pymunk
Pymunk is a easy-to-use pythonic 2d physics library that can be used whenever you need 2d rigid body physics from Python
Language:Python954 19 233190
uoe-agents/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
Language:Python542 8 63145
helenahartmann/awesome-PhD
All the resources I wish I knew when starting my PhD. This repository is aimed to be a living, constantly developing resource where everybody can contribute with new resources!
447 26 025
dfm/extending-jax
Extending JAX with custom C++ and CUDA code
Language:Python380 10 623
Farama-Foundation/miniwob-plusplus
MiniWoB++: a web interaction benchmark for reinforcement learning
Language:HTML299 15 2548
Farama-Foundation/MicroRTS-Py
A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)
Language:Python236 11 3946
Div99/IQ-Learn
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
Language:Python209 3 2032
rohanpsingh/mujoco-python-viewer
Simple renderer for use with MuJoCo (>=2.1.2) Python Bindings.
Language:Python203 2 2432
ArnaudFickinger/gym-multigrid
Lightweight multi-agent gridworld Gym environment
Language:Python199 2 342
ykwon0407/WeightedSHAP
WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)
Language:Jupyter Notebook160 2 119
vwxyzjn/invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Language:Python149 2 322
jurgisp/memory-maze
Evaluating long-term memory of reinforcement learning algorithms
Language:Python137 3 3014
OutpostUniverse/OPHD
OutpostHD - Open source remake of Sierra On-Line's Outpost
Language:C++110 13 38721
tianjunz/NovelD
Language:Python39 2 36
aijunbai/taxi
Hierarchical Online Planning and Reinforcement Learning on Taxi
Language:C++30 6 111
RedTachyon/coltra-rl
A modular implementation of PPO, and soon hopefully other algorithms.
Language:Python26 3 142
GUT-AI/gut-ai
Documentation, content and meta files about GUT-AI.
24 3 02
ludc/gymecs
Language:Python23 4 00
AdaCompNUS/magic
Macro-Action Generator-Critic (MAGIC) - Learning Macro-actions for online POMDP planning
Language:C++17 22 32
Farama-Foundation/Procgen-Staging
Procgen2: A community maintained fork of procgen
Language:C++11 3 18
lebrice/Tutorials
Source code for the Mila Tutorials
Language:Python7 3 00
masud99r/bae
Code for the paper: Bootstrap Advantage Estimation for Policy Optimization in Reinforcement Learning. https://arxiv.org/abs/2210.07312
Language:Python3 1 02

DavidSlayback

DavidSlayback's Stars

plasma-umass/scalene

Farama-Foundation/Gymnasium

pytorch/rl

pydata/numexpr

Farama-Foundation/Minigrid

ELS-RD/kernl

HumanCompatibleAI/imitation

tinkoff-ai/CORL

viblo/pymunk

uoe-agents/epymarl

helenahartmann/awesome-PhD

dfm/extending-jax

Farama-Foundation/miniwob-plusplus

Farama-Foundation/MicroRTS-Py

Div99/IQ-Learn

rohanpsingh/mujoco-python-viewer

ArnaudFickinger/gym-multigrid

ykwon0407/WeightedSHAP

vwxyzjn/invalid-action-masking

jurgisp/memory-maze

OutpostUniverse/OPHD

tianjunz/NovelD

aijunbai/taxi

RedTachyon/coltra-rl

GUT-AI/gut-ai

ludc/gymecs

AdaCompNUS/magic

Farama-Foundation/Procgen-Staging

lebrice/Tutorials

masud99r/bae