Yangli0505's Stars
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
optuna/optuna
A hyperparameter optimization framework
HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
araffin/rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
wandb/examples
Example deep learning projects that use wandb's features.
openai/random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
louisnino/RLcode
google-research/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
araffin/rl-tutorial-jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Stable-Baselines-Team/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
bitsauce/Carla-ppo
This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.
Stable-Baselines-Team/rl-colab-notebooks
Colab notebooks part of the documentation of Stable Baselines reinforcement learning library
g6ling/Reinforcement-Learning-Pytorch-Cartpole
Simple Cartpole example writed with pytorch.
ikeepo/stable-baselines-zh
Stable Baselines官方文档中文版
mrkulk/hierarchical-deep-RL
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation
modriczhang/HRL-Rec
"Hierarchical Reinforcement Learning for Integrated Recommendation" (AAAI 2021) https://ojs.aaai.org/index.php/AAAI/article/view/16580
AlgTUDelft/WCSAC
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"
rmst/rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
martin6336/DrawFigureForPaper
Some python scripts for drawing figures in scientific papers
eager-dev/eagerx_tutorials
Tutorials on how to use EAGERx
bramdemoor-BE/Reward-shaping-to-improve-the-performance-of-DRL-in-inventory-management
Link to paper: https://www.ssrn.com/abstract=3804655
zhihanyang2022/drqn
Exploring whether DRQN + action prior + state-based expert + history-based entropy-reduction expert
0xangelo/gym-industrial
A fork of the Industrial Benchmark, refactored and packaged for PyPI
jvgemert/jvgemert.github.io
NeteaseFuxiRL/action-balance-exploration
INFLUENCEorg/IAOP
stevencarrau/RL-POMDP-MEM
Memory-based approaches to Reinforcement learning for POMDPs
danialkamran/highway-env
A minimalist environment for decision-making in autonomous driving
thiagopbueno/thiagopbueno.github.io
About me page!
tk2232/sac_discrete
SAC discrete action space