pengxiang-pang

pengxiang-pang's Stars

mindspore-courses/Deep-Reinforcement-Learning-Algorithms-with-MindSpore
MindSpore implementations of deep reinforcement learning algorithms and environments
Language:Python142
alirezakazemipour/DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
Language:Python6024
navneet-nmk/Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
Language:Python604
metadriverse/TS2C
[ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"
Language:Python121
compsciencelab/ppo_D
This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https://arxiv.org/abs/2007.03328
Language:Jupyter Notebook173
fiberleif/POfD
Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.
Language:Python141
Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Language:Python1.1k189
ppocma/ppocma
Language:Python7016
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
Language:C#17.2k4.2k
adik993/ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Language:Python13327
datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook9.4k1.9k
boyu-ai/ma-gym
A collection of multi agent environments based on OpenAI gym.
208
AmazingAng/WTF-DeepRL
Deep RL algorithm in pytorch
Language:Jupyter Notebook28757
boyu-ai/Hands-on-ML
https://hml.boyuai.com
Language:Jupyter Notebook34589
Paulescu/hands-on-rl
Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽
Language:Jupyter Notebook1.1k133
boyu-ai/Hands-on-RL
https://hrl.boyuai.com/
Language:Jupyter Notebook2.5k545
TaoHuang13/hindsight-experience-replay-with-demo
PyTorch implementation of the paper Overcoming Exploration in Reinforcement Learning with Demonstrations in surgical robot manipulation tasks.
Language:Python10
jangirrishabh/Overcoming-exploration-from-demos
Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines from OpenAI
Language:Python15230
mermaid-js/mermaid
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
Language:TypeScript71.9k6.5k
mermaid-js/mermaid-live-editor
Edit, preview and share mermaid charts/diagrams. New implementation of the live editor.
Language:TypeScript4.2k649
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python65.3k8k
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook15k1.4k
ermongroup/MA-AIRL
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
Language:Python19927
decisionforce/MAIRL
[RA-L & ICRA 2021] Adversarial Inverse Reinforcement Learning with Self-attention Dynamics Model
Language:Python312
LucasCJYSDL/Multi-task-Hierarchical-AIRL
Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"
Language:Python151
yangmuzhi/airl
learning robust rewards with adversarial inverse reinforcement learning
Language:Python10
qiwihui/spinningup
OpenAI团队的深度强化学习教程中文版
Language:Python247
abubakar-sani/jam_shield_v2_0
This repository contains all the projects, and necessary scripts and files developed for the anti-jamming projects. You can clone it to train and test the DRL agents, as well as to reproduce the results.
Language:Python133
Kaixhin/imitation-learning
Imitation learning algorithms
Language:Python45639
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python5.6k636