Yangli0505

China

Yangli0505's Stars

academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript11.9k 91 36342.7k
optuna/optuna
A hyperparameter optimization framework
Language:Python10.6k 117 1.7k1k
HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Language:Python1.3k 18 340244
araffin/rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Language:Python1.1k 32 86208
wandb/examples
Example deep learning projects that use wandb's features.
Language:Jupyter Notebook1.1k 16 84289
openai/random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
Language:Python873 26 20160
louisnino/RLcode
Language:Python854 2 8284
google-research/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
Language:Jupyter Notebook753 11 1746
araffin/rl-tutorial-jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Language:Jupyter Notebook604 11 13114
Stable-Baselines-Team/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
Language:Python469 16 139173
bitsauce/Carla-ppo
This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.
Language:Python228 4 2556
Stable-Baselines-Team/rl-colab-notebooks
Colab notebooks part of the documentation of Stable Baselines reinforcement learning library
Language:Jupyter Notebook202 6 637
g6ling/Reinforcement-Learning-Pytorch-Cartpole
Simple Cartpole example writed with pytorch.
Language:Python165 9 323
ikeepo/stable-baselines-zh
Stable Baselines官方文档中文版
Language:Python93 1 014
mrkulk/hierarchical-deep-RL
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation
Language:Lua86 14 123
modriczhang/HRL-Rec
"Hierarchical Reinforcement Learning for Integrated Recommendation" (AAAI 2021) https://ojs.aaai.org/index.php/AAAI/article/view/16580
Language:Python52 5 66
AlgTUDelft/WCSAC
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"
Language:Python50 4 518
rmst/rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
Language:Python38 6 18
martin6336/DrawFigureForPaper
Some python scripts for drawing figures in scientific papers
Language:Python26 1 08
eager-dev/eagerx_tutorials
Tutorials on how to use EAGERx
Language:Jupyter Notebook16 1 24
bramdemoor-BE/Reward-shaping-to-improve-the-performance-of-DRL-in-inventory-management
Link to paper: https://www.ssrn.com/abstract=3804655
Language:Python134
zhihanyang2022/drqn
Exploring whether DRQN + action prior + state-based expert + history-based entropy-reduction expert
Language:Python8 1 00
0xangelo/gym-industrial
A fork of the Industrial Benchmark, refactored and packaged for PyPI
Language:Python4 1 00
jvgemert/jvgemert.github.io
Language:HTML4 2 00
NeteaseFuxiRL/action-balance-exploration
Language:Python4 2 02
INFLUENCEorg/IAOP
Language:C++3 3 11
stevencarrau/RL-POMDP-MEM
Memory-based approaches to Reinforcement learning for POMDPs
Language:Jupyter Notebook3 2 01
danialkamran/highway-env
A minimalist environment for decision-making in autonomous driving
Language:Python13
thiagopbueno/thiagopbueno.github.io
About me page!
Language:CSS1
tk2232/sac_discrete
SAC discrete action space
Language:Python1 0 00

Yangli0505

Yangli0505's Stars

academicpages/academicpages.github.io

optuna/optuna

HumanCompatibleAI/imitation

araffin/rl-baselines-zoo

wandb/examples

openai/random-network-distillation

louisnino/RLcode

google-research/rliable

araffin/rl-tutorial-jnrr19

Stable-Baselines-Team/stable-baselines3-contrib

bitsauce/Carla-ppo

Stable-Baselines-Team/rl-colab-notebooks

g6ling/Reinforcement-Learning-Pytorch-Cartpole

ikeepo/stable-baselines-zh

mrkulk/hierarchical-deep-RL

modriczhang/HRL-Rec

AlgTUDelft/WCSAC

rmst/rlrd

martin6336/DrawFigureForPaper

eager-dev/eagerx_tutorials

bramdemoor-BE/Reward-shaping-to-improve-the-performance-of-DRL-in-inventory-management

zhihanyang2022/drqn

0xangelo/gym-industrial

jvgemert/jvgemert.github.io

NeteaseFuxiRL/action-balance-exploration

INFLUENCEorg/IAOP

stevencarrau/RL-POMDP-MEM

danialkamran/highway-env

thiagopbueno/thiagopbueno.github.io

tk2232/sac_discrete