Howuhh

RL Researcher @ dunnolab

Pinned Repositories

xland-minigrid
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
Language:Python276 11 1719
average_reward_ppo
Implementation of "Average-Reward Reinforcement Learning with Trust Region Methods" paper.
Language:Python8 1 01
chess_minimax
minimax algorithm for chess with alpha-beta pruning
Language:Jupyter Notebook8 1 05
evolution_strategies_openai
implementation of "Evolution Strategies as a Scalable Alternative to Reinforcement Learning" OpenAI paper
Language:Python20 2 02
faster-trajectory-transformer
Implementation of Trajectory Transformer with attention caching and batched beam search
Language:Python110 1 714
link_pred_spark
similarity between graph nodes based on local information with PySpark
Language:Python9 1 01
prioritized_experience_replay
Prioritized Experience Replay implementation with proportional prioritization
Language:Python76 1 110
sac-n-jax
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
Language:Python52 1 13
CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python1.2k 17 28144
sac-rnd
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
Language:Python52 3 05

Howuhh's Repositories

Howuhh/faster-trajectory-transformer
Implementation of Trajectory Transformer with attention caching and batched beam search
Language:Python110 1 714
Howuhh/prioritized_experience_replay
Prioritized Experience Replay implementation with proportional prioritization
Language:Python76 1 110
Howuhh/sac-n-jax
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
Language:Python52 1 13
Howuhh/evolution_strategies_openai
implementation of "Evolution Strategies as a Scalable Alternative to Reinforcement Learning" OpenAI paper
Language:Python20 2 02
Howuhh/link_pred_spark
similarity between graph nodes based on local information with PySpark
Language:Python9 1 01
Howuhh/average_reward_ppo
Implementation of "Average-Reward Reinforcement Learning with Trust Region Methods" paper.
Language:Python8 1 01
Howuhh/chess_minimax
minimax algorithm for chess with alpha-beta pruning
Language:Jupyter Notebook8 1 05
Howuhh/MHRW
metropolis-hastings random walk with PySpark
Language:Jupyter Notebook7 1 0
Howuhh/cic_gym
Adaptation of original "Contrastive Intrinsic Control for Unsupervised Skill Discovery" implementation to OpenAI Gym
Language:Python3 1 11
Howuhh/Howuhh.github.io
Language:HTML3 1 0
Howuhh/autograd_but_smaller
Simple implementation of reverse-mode automatic differentiation on numpy arrays
Language:Jupyter Notebook2 2 01
Howuhh/halfcheetah_experts
expert policies for forward and backflip halfcheetah envs
Language:Python2 1 0
Howuhh/Predators-and-Preys
Language:Python2 0 01
Howuhh/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python1 0 0
Howuhh/JaxMARL
Multi-Agent Reinforcement Learning with JAX
Language:Python1 0 0
Howuhh/memory-maze
Evaluating long-term memory of reinforcement learning algorithms
Language:Python1 0 0
Howuhh/pgx
🎲 Vectorized RL game environments written in JAX with end-to-end AlphaZero examples
Language:Python1 0 0
Howuhh/cic
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Language:Python0 0
Howuhh/d4rl
A benchmark for offline reinforcement learning.
Language:Python0 0
Howuhh/dul_2021
Language:Jupyter Notebook0 0
Howuhh/hse_bayesian_ml
Language:Jupyter Notebook1 0
Howuhh/hse_recsys
hse recommender systems course
Language:Jupyter Notebook1 0
Howuhh/hse_reinforcement_learning
HSE Reinforcement Learning course
Language:Jupyter Notebook2 0
Howuhh/linear-transformer-experiments
Experiments using fast linear transformer
Language:Python0 0
Howuhh/link_pred
link prediction in social network based on node neighborhoods
Language:Jupyter Notebook0 0
Howuhh/Minari
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
Language:Python0 0
Howuhh/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
Language:Cython0 0
Howuhh/robosuite
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
Language:Python0 0
Howuhh/TTS_HW
Language:Python0 0
Howuhh/vector-quantize-pytorch
Vector Quantization, in Pytorch
Language:Python0 0