Yangli0505's Stars
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
rll/rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
yangwohenmai/LSTM
基于LSTM的时间序列预测研究
Farama-Foundation/HighwayEnv
A minimalist environment for decision-making in autonomous driving
devsisters/DQN-tensorflow
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
rlworkgroup/garage
A toolkit for reproducible reinforcement learning research.
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
chickenbestlover/RNN-Time-series-Anomaly-Detection
RNN based Time-series Anomaly detector model implemented in Pytorch.
haarnoja/sac
Soft Actor-Critic
huawei-noah/SMARTS
Scalable Multi-Agent RL Training School for Autonomous Driving
eleurent/phd-bibliography
References on Optimal Control, Reinforcement Learning and Motion Planning
laiguokun/LSTNet
umbertogriffo/Predictive-Maintenance-using-LSTM
Example of Multiple Multivariate Time Series Prediction with LSTM Recurrent Neural Networks in Python with Keras.
PatientEz/CNN-BiLSTM-Attention-Time-Series-Prediction_Keras
CNN+BiLSTM+Attention Multivariate Time Series Prediction implemented by Keras
cjy1992/gym-carla
An OpenAI gym wrapper for CARLA simulator
jachiam/cpo
Constrained Policy Optimization
guillaume-chevalier/Linear-Attention-Recurrent-Neural-Network
A recurrent attention module consisting of an LSTM cell which can query its own past cell states by the means of windowed multi-head attention. The formulas are derived from the BN-LSTM and the Transformer Network. The LARNN cell with attention can be easily used inside a loop on the cell state, just like any other RNN. (LARNN)
chauvinSimon/Reinforcement-Learning-for-Decision-Making-in-self-driving-cars
Reinforcement-Learning-for-Decision-Making-in-self-driving-cars
skumar9876/Hierarchical-DQN
Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https://arxiv.org/pdf/1604.06057.pdf
microsoft/oac-explore
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
wm5920/mnist_web_tensorflow_demo
网页手写数字,后台通过回归和cnn及时识别
zbzhu99/Constrained-Decision-Making-Paper-List
Paper list for constrained policy optimization in reinforcement learning.
UniqueAndys/Host-Load-Prediction-with-LSTM
host load prediction with Long Short-Term Memory in cloud computing
sisl/AutonomousMerging.jl
Implementation of a highway merging scenario
MohamedAliRashad/NeurIPs-2020-SlidesLive
Links to Presentations happened in NeurIPs 2020 via SlidesLive
yang0110/RL-Algorithms-Implementation