Pinned Repositories
AFLDDPG
* Wu Q, Wang S, Fan P, et al. Deep Reinforcement Learning Based Vehicle Selection for Asynchronous Federated Learning Enabled Vehicular Edge Computing[J]. arXiv preprint arXiv:2304.02832, 2023. 链接: https://arxiv.org/abs/2304.02832 代码: https://github.com/qiongwu86/AFLDDPG
Computing_offloading
DRL-Based-Long-Term-Resource-Planning
Paper publised in TNSM entitled "DRL-Based Long-Term Resource Planning for Task Offloading Policies in Multi-Server Edge Computing Networks"
DRL-for-edge-computing
DRL-MEC
Dynamic Task Software Caching-Assisted Computation Offloading for Multi-Access Edge Computing
DRL-TOBS
Codes for the paper titled Online Joint Task Offloading and Resource Management in Heterogeneous Mobile Edge Environments.
edge-offloading
computation offloading in mobile edge computing using Reinforcement Learning
Game-Theoretic-Deep-Reinforcement-Learning
Code of Paper "Joint Task Offloading and Resource Optimization in NOMA-based Vehicular Edge Computing: A Game-Theoretic DRL Approach", JSA 2022.
Graph-reinforcement-learning-literature
This open source library is available to summarize several years of research papers on graph reinforcement learning for the convenience of researchers
PeerJ-Computer-Science
yyds-xtt's Repositories
yyds-xtt/Resources-Allocation-in-The-Edge-Computing-Environment-Using-Reinforcement-Learning
Simulated the scenario between edge servers and users with a clear graphic interface. Also, implemented the continuous control with Deep Deterministic Policy Gradient (DDPG) to determine the resources allocation (offload targets, computational resources, migration bandwidth) in the edge servers
yyds-xtt/my_MEC_program
I build this Mobile Edge Computation simulating environment all by myself, and use the costomized ddpg reinforcement learning algorithm to make offloading decision.
yyds-xtt/RL-Ridesharing
Effcient Ridesharing Dispatch Using Multi-Agent Reinforcement Learning
yyds-xtt/Bilevel-Optimization-in-Coordination-Game
code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)
yyds-xtt/ComputationOffloadingRL
Matlab project for fairspace
yyds-xtt/deeprl_network
multi-agent deep reinforcement learning for networked system control.
yyds-xtt/delay-aware-MARL
Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".
yyds-xtt/drl_experiment
yyds-xtt/Federated-Learning-PyTorch
Implementation of Communication-Efficient Learning of Deep Networks from Decentralized Data
yyds-xtt/GAN-DDQN
yyds-xtt/GeneraLight
yyds-xtt/Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
yyds-xtt/hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
yyds-xtt/kaggle-Digit-Recognizer
kaggle-Digit Recognizer
yyds-xtt/KTM-DRL
yyds-xtt/Learn-CompressCSI-RA-V2X-Code
Code for Learn to Compress CSI and Allocate Resources in Vehicular Networks
yyds-xtt/LSTM-based-A2C
yyds-xtt/MADPL
Task-oriented Dialog Policy Learning with Multi-Agent Reinforcement Learning
yyds-xtt/MEC
PSO和蚁群算法解决MEC计算卸载问题
yyds-xtt/Multi-Agent-Distributed-PPO-Traffc-light-control
multi agent RL for traffic light control in Sumo using distributed PPO
yyds-xtt/MultiAgentPerception
Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"
yyds-xtt/new-actions-rl
yyds-xtt/Online-Flexible-Resource-Allocation
Flexible resource allocation for edge cloud computing with reinforcement learning
yyds-xtt/Paper-code-for-Machine-Learning-based-Wireless-Communication-Optimization-Problems
A collection for Paper/code for Wireless Communication Optimization Problems with PyTorch based DL
yyds-xtt/PGMORL
[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
yyds-xtt/policy-dynamics-value-functions
MO-ADC
yyds-xtt/RL_paper
yyds-xtt/Spectrum-Power-Allocation
Deep Reinforcement Learning for Joint Spectrum and Power Allocation in Cellular Networks code
yyds-xtt/tianshou
An elegant, flexible, and superfast PyTorch deep reinforcement learning platform.
yyds-xtt/UVCO-algorithm