Pinned Repositories
-
An implementation of the actor-critic famous algorithm in a distributed way. Based on the following paper: Asynchronous Methods for Deep Reinforcement Learning
-_1
论文出处:https://arxiv.org/abs/2402.03741
2023-code-CDC-A-Distributed-LQDTG-Approach-to-Formation-Control-with-Collision-Avoidance
Code for the paper submitted for ECC 2023
A-Barrier-Lyapunov-Actor-Critic-Reinforcement-Learning-Approach-for-Safe-and-Stable-Control
Adaptive-optimal-control-of-linear-periodic-systems-An-off-policy-value-iteration-approach
code for the paper that proposed a data-driven RL method based on value iteration for linear periodic systems
AMS-DRL-for-Pursuit-Evasion
This is the official repository for the paper "Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones".
Attack-Resiliency-In-Truck-Platooning-Using-Two-Team-Games
Using two team game theory, we can make the truck platoons resilient against any malicious attack.
AVGM
awesome-deep-rl
A curated list of awesome Deep Reinforcement Learning resources.
dmpcrl-concept
Proof of concept example for the idea of using distributed model predictive control as a function approximator in distributed reinforcement learning.
Niufuxi's Repositories
Niufuxi/AVGM
Niufuxi/Robust-and-cooperative-formation-control-of-nonlinear-multi-agent-systems
Yang's PhD work
Niufuxi/penalized-bilevel-gradient-descent
An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.
Niufuxi/Flocking-Multi-Agent
Python implementation of "Flocking for multi-agent dynamic systems: Algorithms and theory" by Olfati-Saber for multi-agent triangular formation.
Niufuxi/bp_lambda
A TD-like model for learning and using synthetic gradients
Niufuxi/on-policy-investigation
This is the official implementation of Multi-Agent PPO (MAPPO). Forked from the original repo for testing purposes.
Niufuxi/Toward-Multi-Agent-Reinforcement-Learning-for-Distributed-Event-Triggered-Control
Repository for the paper belonging to https://sites.google.com/view/learning-distributed-etc/start
Niufuxi/MAS-Simulation
A library for simulation of multi-agent systems under non-ideal communication
Niufuxi/awesome-safe-reinforcement-learning
Niufuxi/Multi-Agent-Reinforcement-Learning-papers
Multi-Agent Reinforcement Learning (MARL) papers
Niufuxi/Adaptive-optimal-control-of-linear-periodic-systems-An-off-policy-value-iteration-approach
code for the paper that proposed a data-driven RL method based on value iteration for linear periodic systems
Niufuxi/Data-driven_UAV
UAV Data-driven Control Environment
Niufuxi/nash-dqn
Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games. Zihan Ding, Dijia Su, Qinghua Liu, Chi Jin
Niufuxi/SDMAA2C
Code for the paper "Synchronous and Distributed Multi-Agent Advantage Actor-Critic Method!"
Niufuxi/Resilient-MPC-for-Cyber-Physical-Multi-Agent-Systems-under-DoS-Attacks
Niufuxi/MultiRobots_CoverMap
多机器人地图覆盖仿真
Niufuxi/LMPC_Quadrotors
Learning Model Predictive Control (LMPC) for Quadrotor Optimal Path Planning and Obstacle Avoidance
Niufuxi/IROS22_DARL1N
Niufuxi/Safe_Occlusion_Aware_Planning
Repository for "Safe Occlusion-aware Autonomous Driving via Game-Theoretic Active Perception" - RSS 2021
Niufuxi/Multiagent-RL
The official code releasement of publications in MARL field of TJU RL lab.
Niufuxi/GSL
Generalist-Specialist Learning
Niufuxi/MultiVehicleEnv
Niufuxi/Formation-Control
time-varying formation control of UAVs
Niufuxi/DADAM
DADAM: A Consensus-based Distributed Adaptive Gradient Method for Online Optimization
Niufuxi/Reinforcement-Learning-for-Real-time-Pricing-and-Scheduling-Control-in-EV-Charging-Stations
Reinforcement Learning for Real time Pricing and Scheduling Control in EV Charging Stations
Niufuxi/RL-MPC
Niufuxi/Deep-RL-Policy-Search-for-MPC
This repo is related to Deep Policy search using MPC.
Niufuxi/Resilient-consensus-based-MARL
This repository includes a realization of the resilient projection-based consensus actor-critic algorithm that is resilient to adversarial attacks on communication channels.
Niufuxi/ET-distributed-UAV-path-planner
Niufuxi/TRPO-PPO-in-MARL