Yangli0505

China

Yangli0505's Stars

ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python33.3k 475 18.5k5.6k
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python8.8k 63 1.5k1.7k
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Language:Python5.6k 106 711.2k
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python3.9k 36 34844
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k 67 229829
rll/rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
Language:Python2.9k 162 183801
yangwohenmai/LSTM
基于LSTM的时间序列预测研究
Language:Python2.7k 20 20690
Farama-Foundation/HighwayEnv
A minimalist environment for decision-making in autonomous driving
Language:Python2.6k 29 464739
devsisters/DQN-tensorflow
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
Language:Python2.5k 146 57764
rlworkgroup/garage
A toolkit for reproducible reinforcement learning research.
Language:Python1.9k 56 1k310
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python1.8k 29 130382
chickenbestlover/RNN-Time-series-Anomaly-Detection
RNN based Time-series Anomaly detector model implemented in Pytorch.
Language:Python1.2k 37 52318
haarnoja/sac
Soft Actor-Critic
Language:Python976 29 27234
huawei-noah/SMARTS
Scalable Multi-Agent RL Training School for Autonomous Driving
Language:Python935 13 1k187
eleurent/phd-bibliography
References on Optimal Control, Reinforcement Learning and Motion Planning
918 38 2205
laiguokun/LSTNet
Language:Python655 25 22169
umbertogriffo/Predictive-Maintenance-using-LSTM
Example of Multiple Multivariate Time Series Prediction with LSTM Recurrent Neural Networks in Python with Keras.
Language:Python624 26 5238
PatientEz/CNN-BiLSTM-Attention-Time-Series-Prediction_Keras
CNN+BiLSTM+Attention Multivariate Time Series Prediction implemented by Keras
Language:Python559 6 693
cjy1992/gym-carla
An OpenAI gym wrapper for CARLA simulator
Language:Python530 10 47109
jachiam/cpo
Constrained Policy Optimization
Language:Python305 8 582
guillaume-chevalier/Linear-Attention-Recurrent-Neural-Network
A recurrent attention module consisting of an LSTM cell which can query its own past cell states by the means of windowed multi-head attention. The formulas are derived from the BN-LSTM and the Transformer Network. The LARNN cell with attention can be easily used inside a loop on the cell state, just like any other RNN. (LARNN)
Language:Jupyter Notebook144 9 032
chauvinSimon/Reinforcement-Learning-for-Decision-Making-in-self-driving-cars
Reinforcement-Learning-for-Decision-Making-in-self-driving-cars
Language:Python103 3 131
skumar9876/Hierarchical-DQN
Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https://arxiv.org/pdf/1604.06057.pdf
Language:Python79 3 116
microsoft/oac-explore
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
Language:Python68 7 1123
wm5920/mnist_web_tensorflow_demo
网页手写数字，后台通过回归和cnn及时识别
Language:Python6823
zbzhu99/Constrained-Decision-Making-Paper-List
Paper list for constrained policy optimization in reinforcement learning.
66 3 111
UniqueAndys/Host-Load-Prediction-with-LSTM
host load prediction with Long Short-Term Memory in cloud computing
Language:Python34 3 114
sisl/AutonomousMerging.jl
Implementation of a highway merging scenario
Language:Julia30 14 49
MohamedAliRashad/NeurIPs-2020-SlidesLive
Links to Presentations happened in NeurIPs 2020 via SlidesLive
3 2 03
yang0110/RL-Algorithms-Implementation
Language:Python2 1 00

Yangli0505

Yangli0505's Stars

ray-project/ray

DLR-RM/stable-baselines3

p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

sweetice/Deep-reinforcement-learning-with-pytorch

ikostrikov/pytorch-a2c-ppo-acktr-gail

rll/rllab

yangwohenmai/LSTM

Farama-Foundation/HighwayEnv

devsisters/DQN-tensorflow

rlworkgroup/garage

oxwhirl/pymarl

chickenbestlover/RNN-Time-series-Anomaly-Detection

haarnoja/sac

huawei-noah/SMARTS

eleurent/phd-bibliography

laiguokun/LSTNet

umbertogriffo/Predictive-Maintenance-using-LSTM

PatientEz/CNN-BiLSTM-Attention-Time-Series-Prediction_Keras

cjy1992/gym-carla

jachiam/cpo

guillaume-chevalier/Linear-Attention-Recurrent-Neural-Network

chauvinSimon/Reinforcement-Learning-for-Decision-Making-in-self-driving-cars

skumar9876/Hierarchical-DQN

microsoft/oac-explore

wm5920/mnist_web_tensorflow_demo

zbzhu99/Constrained-Decision-Making-Paper-List

UniqueAndys/Host-Load-Prediction-with-LSTM

sisl/AutonomousMerging.jl

MohamedAliRashad/NeurIPs-2020-SlidesLive

yang0110/RL-Algorithms-Implementation