TianLiang96

Pinned Repositories

abbrv.jabref.org
A repository of abbreviations for references, e.g., for conferences, journals, institutes, etc.
Language:Python0 1 00
CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计、Java、Python、C++
Language:Java00
Full-Duplex-User-Pairing
This project is about user pairing in a full-duplex communication system based on the deep reinforcement learning (DQN). We use 32 different groups of users to train the agent, each group include 6 uplink users and 8 downlink users. First, we transform the problem of user pairing to the markov decision process. Then, to accelerate the training speed, we added some expert experience to the replay buffer to make the agent learn quikly.
0 2 00
Full-Duplex-User-Pairing_1
This project is about user pairing in a full-duplex communication system based on the deep reinforcement learning (DQN). We use 32 different groups of users to train the agent, each group include 6 uplink users and 8 downlink users. First, we transform the problem of user pairing to the markov decision process. Then, to accelerate the training speed, we added some expert experience to the replay buffer to make the agent learn quikly.
Language:Python80
Machine-Learning
:zap:机器学习实战（Python3）：kNN、决策树、贝叶斯、逻辑回归、SVM、线性回归、树回归
Language:HTML00
Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials
Language:Python1 1 00
ZJU-nCov-Hitcarder-Sample
Sample of https://github.com/Long0x0/ZJU-nCov-Hitcarder.
00

TianLiang96's Repositories

TianLiang96/Full-Duplex-User-Pairing_1
This project is about user pairing in a full-duplex communication system based on the deep reinforcement learning (DQN). We use 32 different groups of users to train the agent, each group include 6 uplink users and 8 downlink users. First, we transform the problem of user pairing to the markov decision process. Then, to accelerate the training speed, we added some expert experience to the replay buffer to make the agent learn quikly.
Language:Python80
TianLiang96/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials
Language:Python1 1 00
TianLiang96/abbrv.jabref.org
A repository of abbreviations for references, e.g., for conferences, journals, institutes, etc.
Language:Python0 1 00
TianLiang96/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计、Java、Python、C++
Language:Java00
TianLiang96/Full-Duplex-User-Pairing
This project is about user pairing in a full-duplex communication system based on the deep reinforcement learning (DQN). We use 32 different groups of users to train the agent, each group include 6 uplink users and 8 downlink users. First, we transform the problem of user pairing to the markov decision process. Then, to accelerate the training speed, we added some expert experience to the replay buffer to make the agent learn quikly.
0 2 00
TianLiang96/Machine-Learning
:zap:机器学习实战（Python3）：kNN、决策树、贝叶斯、逻辑回归、SVM、线性回归、树回归
Language:HTML00
TianLiang96/ZJU-nCov-Hitcarder-Sample
Sample of https://github.com/Long0x0/ZJU-nCov-Hitcarder.
00