KonoeSubaru's Stars
PKU-MARL/HARL
Official implementation of HARL algorithms based on PyTorch.
LiSir-HIT/Reinforcement-Learning
kinds of reinforcement learning model by Pytorch
intelligent-environments-lab/CityLearn
Official reinforcement learning environment for demand response and load shaping
HansenHua/MFPO-INFOCOM24
An online federated reinforcement learning algorithm published in INFOCOM2024
DesikRengarajan/FEDORA
[NeurIPS 2024] Code for Federated Ensemble-Directed Offline Reinforcement Learning
qiongwu86/Edge-Caching-Based-on-Multi-Agent-Deep-Reinforcement-Learning-and-Federated-Learning
microsoft/HuRL
Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper
Alirezad126/PD-DDPGfD
Code for Primal-Dual Deep Deterministic Policy Gradient From Demonstrations
akjayant/PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
openai/safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
chauncygu/Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
tayalmanan28/Safe_Reinforcement_Learning
Repository containing the code for the paper "Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions". Specifically, an implementation of SAC + Robust Control Barrier Functions (RCBFs) for safe reinforcement learning in two custom environments
ammarhydr/SAC-Lagrangian
PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
PKU-Alignment/safety-gymnasium
NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
snu-mllab/DPPO
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
david-lindner/idrl
Code accompanying the paper "Information Directed Reward Learning for Reinforcement Learning" (NeurIPS 2021).
PKU-Alignment/Safe-Policy-Optimization
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
LucasCJYSDL/HierAIRL
A novel Hierarchical Imitation Learning algorithm based on AIRL.
FederatedAI/FATE
An Industrial Grade Federated Learning Framework
Jordan-Haidee/FedDDPG
wangyu92/cartpole-ppo-federated-learning
tensorlayer/TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
yumath/bertNER
ChineseNER based on BERT, with BiLSTM+CRF layer