pengxiang-pang's Stars
mindspore-courses/Deep-Reinforcement-Learning-Algorithms-with-MindSpore
MindSpore implementations of deep reinforcement learning algorithms and environments
alirezakazemipour/DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
navneet-nmk/Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
metadriverse/TS2C
[ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"
compsciencelab/ppo_D
This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https://arxiv.org/abs/2007.03328
fiberleif/POfD
Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.
Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
ppocma/ppocma
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
adik993/ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
boyu-ai/ma-gym
A collection of multi agent environments based on OpenAI gym.
AmazingAng/WTF-DeepRL
Deep RL algorithm in pytorch
boyu-ai/Hands-on-ML
https://hml.boyuai.com
Paulescu/hands-on-rl
Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻🦸🏽
boyu-ai/Hands-on-RL
https://hrl.boyuai.com/
TaoHuang13/hindsight-experience-replay-with-demo
PyTorch implementation of the paper Overcoming Exploration in Reinforcement Learning with Demonstrations in surgical robot manipulation tasks.
jangirrishabh/Overcoming-exploration-from-demos
Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines from OpenAI
mermaid-js/mermaid
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
mermaid-js/mermaid-live-editor
Edit, preview and share mermaid charts/diagrams. New implementation of the live editor.
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
KindXiaoming/pykan
Kolmogorov Arnold Networks
ermongroup/MA-AIRL
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
decisionforce/MAIRL
[RA-L & ICRA 2021] Adversarial Inverse Reinforcement Learning with Self-attention Dynamics Model
LucasCJYSDL/Multi-task-Hierarchical-AIRL
Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"
yangmuzhi/airl
learning robust rewards with adversarial inverse reinforcement learning
qiwihui/spinningup
OpenAI团队的深度强化学习教程中文版
abubakar-sani/jam_shield_v2_0
This repository contains all the projects, and necessary scripts and files developed for the anti-jamming projects. You can clone it to train and test the DRL agents, as well as to reproduce the results.
Kaixhin/imitation-learning
Imitation learning algorithms
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)