YL03

YL03's Stars

johannes-manner/SeMoDe
SeMoDe is a tool to support lifecycle activites of Serverless functions on different platforms. Currently automated test generation on AWS Lambda is possible and performance considerations due to the cold start issue are work in progress.
Language:Java121
Azure-Samples/functions-distributed-tracing-sample
Distributed Tracing sample for Azure Functions and Java with Application Insights
Language:C#67
kaixindelele/DRLib
DRLib：a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.
Language:Python49770
ZYunfeii/DRL_algorithm_library
This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.
Language:Python8420
revenol/DROO
Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks
Language:Python516187
apourchot/CEM-RL
Combining Evolutionary Algorithms and deep RL in various ways
Language:Python9821
wyjung0625/p3s
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
Language:Python225
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Language:Python1.9k495
twni2016/Meta-SAC
Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
Language:Python283
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python8.3k1.6k
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.5k831
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1.6k331
facebookresearch/hydra
Hydra is a framework for elegantly configuring complex applications
Language:Python8.4k608
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Language:Python5.5k1.2k
facebookresearch/LearningToLearn
Collection of algorithms to learn loss and reward functions via gradient-based bi-level optimization.
Language:Jupyter Notebook10219
eambutu/snail-pytorch
Implementation of "A Simple Neural Attentive Meta-Learner" (SNAIL, https://arxiv.org/pdf/1707.03141.pdf) in PyTorch
Language:Python14328
chanb/metalearning_RL
Language:Python182
jxx123/rl-tf2
My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0
Language:Python289
quantumiracle/Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations
Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms include DDPG, PPO.
Language:Python269
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python32k5.4k
hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Language:Python4.1k728
google-research/google-research
Google Research
Language:Jupyter Notebook33.4k7.8k
tristandeleu/pytorch-maml-rl
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
Language:Python807156
amazon-science/meta-q-learning
Code for the paper "Meta-Q-Learning"( ICLR 2020)
Language:Python10216
toshikwa/sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
Language:Python26133
quantumiracle/Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Language:Jupyter Notebook1k119
dragen1860/MAML-Pytorch
Elegant PyTorch implementation of paper Model-Agnostic Meta-Learning (MAML)
Language:Python2.3k422
HaiyinPiao/pytorch-a2clstm-DRQN
using recurrent networks(LSTM) to solve POMDPs
Language:Python332
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Language:Python1.2k260
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python3.5k818