wulihan20212021

wulihan20212021's Stars

GaiZhenbiao/ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
Language:Python15.2k 84 7922.3k
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
4k 242 9725
facebookresearch/rebel
An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.
Language:C++652 26 33110
chenhongge/StateAdvDRL
[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"
116 5 318
santhisenan/SDN_DDoS_Simulation
An attempt to detect and prevent DDoS attacks using reinforcement learning. The simulation was done using Mininet.
Language:Python108 5 727
matlab-deep-learning/rl-agent-based-traffic-control
Develop agent-based traffic management system by model-free reinforcement learning
Language:MATLAB45 8 015
p-casgrain/Nash-DQN
Deep Reinforcement Learning for Nash Equilibria
Language:Jupyter Notebook39 1 021
asokraju/Adv-MARL
Adversarial attacks in consensus-based multi-agent reinforcement learning
Language:Python18 3 26
DeepakKarishetti/Reinforcement_learning-PID-auto-tuning
Auto tuning of PID parameters of a quad-rotor using Q-learning
Language:MATLAB18 2 04
ddfan/swarm_evolve
Model-Based Stochastic Search for Large Scale Optimization of Multi-Agent UAV Swarms
Language:C++15 3 312
wangbx66/differentially-private-q-learning
Language:Python12 2 04
apizbakar/Soft-Actor-Critic-Reinforcement-Learning-Mobile-Robot-Navigation
This example uses Soft Actor Critic(SAC) based reinforcement learning to develop the mobile robot navigation. For a brief summary of the SAC algorithm, see Soft Actor Critic(SAC) Agents. This example scenario trains a mobile robot to navigate from location A to location B to avoid obstacles given range sensor readings that detect obstacles in the map. The objective of the reinforcement learning algorithm is to learn what controls (linear and angular velocity) for navigation from an initial to goal position and during the travel also can avoid colliding into obstacles. This example uses an occupancy map of a known environment to generate range sensor readings, detect obstacles, and check collisions the robot may make. The range sensor readings are the observations for the SAC agent, and the linear and angular velocity controls are the action.
Language:MATLAB110
semanticweights/tarok
:spades: Slovenian Tarok card game environment for the OpenSpiel framework.
Language:C++10 3 92
mnecipkurt/tsg19
Language:MATLAB92
DSS-lab/DRLCyberAssessment_DQNCode
Language:MATLAB7 3 01
abhisikdar/Quickest-Detection-FDI-Remote-Estimation
Code for our paper titled "Quickest detection of false data injection in remote state estimation" published at IEEE ISIT 2021.
Language:Jupyter Notebook6 1 03
xahiru/NetworkSecRLwithPareto
Network Security Attack and Defence Strategy selection using Reinforcement Learning and Pareto efficiency
Language:MATLAB5 2 02
peweetheman/Reinforcement_Learning_In_Two_Player_Simultaneous_Action_Games
Language:Jupyter Notebook33