zhangjw-THU

Tsinghua Univ.Beijing，China

zhangjw-THU's Stars

nlohmann/json
JSON for Modern C++
Language:C++42.2k 767 2.2k6.7k
itdevbooks/pdf
编程电子书，电子书，编程书籍，包括C，C#，Docker，Elasticsearch，Git，Hadoop，HeadFirst，Java，Javascript，jvm，Kafka，Linux，Maven，MongoDB，MyBatis，MySQL，Netty，Nginx，Python，RabbitMQ，Redis，Scala，Solr，Spark，Spring，SpringBoot，SpringCloud，TCPIP，Tomcat，Zookeeper，人工智能，大数据类，并发编程，数据库类，数据挖掘，新面试题，架构设计，算法系列，计算机类，设计模式，软件测试，重构优化，等更多分类
18.9k 392 05.9k
datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook9.1k 78 1431.8k
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python8.7k 61 1.5k1.7k
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python3.6k 51 253833
zhoubolei/introRL
Intro to Reinforcement Learning (强化学习纲要）
3.2k 92 8483
Farama-Foundation/HighwayEnv
A minimalist environment for decision-making in autonomous driving
Language:Python2.6k 29 459735
abisee/pointer-generator
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"
Language:Python2.2k 49 155813
starry-sky6688/MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
Language:Python1.4k 13 111279
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
Language:Python1.3k 7 90289
wouterkool/attention-learn-to-route
Attention based model for learning to solve different routing problems
Language:Jupyter Notebook1.1k 23 53340
bstabler/TransportationNetworks
Transportation Networks for Research
Language:Jupyter Notebook761 52 28443
Hanjun-Dai/graph_comb_opt
Implementation of "Learning Combinatorial Optimization Algorithms over Graphs"
Language:C++489 18 33135
mveres01/pytorch-drl4vrp
Implementation of: Nazari, Mohammadreza, et al. "Deep Reinforcement Learning for Solving the Vehicle Routing Problem." arXiv preprint arXiv:1802.04240 (2018).
Language:Python435 14 9118
Acmece/rl-collision-avoidance
Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"
Language:Python329 7 2792
praveen-palanisamy/macad-gym
Multi-Agent Connected Autonomous Driving (MACAD) Gym environments for Deep RL. Code for the paper presented in the Machine Learning for Autonomous Driving Workshop at NeurIPS 2019:
Language:Python327 10 4573
Hanjun-Dai/pytorch_structure2vec
pytorch implementation of structure2vec (https://arxiv.org/abs/1603.05629)
Language:Python305 9 2475
chaitjo/graph-convnet-tsp
Code for the paper 'An Efficient Graph Convolutional Network Technique for the Travelling Salesman Problem' (INFORMS Annual Meeting Session 2019)
Language:Python293 5 1262
Rintarooo/VRP_DRL_MHA
"Attention, Learn to Solve Routing Problems!"[Kool+, 2019], Capacitated Vehicle Routing Problem solver
Language:Python168 2 036
Rintarooo/TSP_DRL_PtrNet
"Neural Combinatorial Optimization with Reinforcement Learning"[Bello+, 2016], Traveling Salesman Problem solver
Language:Python158 3 235
eduardohenriquearnold/CODD
Cooperative Driving Dataset: a dataset for multi-agent driving scenarios
Language:Python136 6 622
decisionforce/pgdrive
PGDrive: an open-ended driving simulator with infinite scenes from procedural generation
Language:Python127 8 15216
facebookresearch/CollaQ
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
Language:Python127 5 1124
vshallc/PtrNets
Pointer Networks
Language:Python100 4 731
yining043/TSP-improve
An improvement-based Deep Reinforcement Learning Algorithm presented in paper https://arxiv.org/abs/1912.05784v2 for solving the TSP problem.
Language:Jupyter Notebook87 2 1128
datawhalechina/releasing-research-code
发布研究论文代码的小技巧
75 3 017
jiminsun/pointer-generator
Pytorch implementation of the ACL paper 'Get To The Point: Summarization with Pointer-Generator Networks (See et al., 2017)', adapted to a Korean dataset
Language:Python32 3 21
eugenevinitsky/decentralized_bottlenecks
Code and figures for bottlenecks paper
Language:Python21 5 23
intelligent-control-lab/Auto_Vehicle_Simulator
Language:Jupyter Notebook21 4 012
decisionforce/pgdrive-generalization-paper
The official material of the paper: "Improving the Generalization of End-to-End Driving through Procedural Generation".
Language:Python4 2 11

zhangjw-THU

zhangjw-THU's Stars

nlohmann/json

itdevbooks/pdf

datawhalechina/easy-rl

DLR-RM/stable-baselines3

AI4Finance-Foundation/ElegantRL

zhoubolei/introRL

Farama-Foundation/HighwayEnv

abisee/pointer-generator

starry-sky6688/MARL-Algorithms

marlbenchmark/on-policy

wouterkool/attention-learn-to-route

bstabler/TransportationNetworks

Hanjun-Dai/graph_comb_opt

mveres01/pytorch-drl4vrp

Acmece/rl-collision-avoidance

praveen-palanisamy/macad-gym

Hanjun-Dai/pytorch_structure2vec

chaitjo/graph-convnet-tsp

Rintarooo/VRP_DRL_MHA

Rintarooo/TSP_DRL_PtrNet

eduardohenriquearnold/CODD

decisionforce/pgdrive

facebookresearch/CollaQ

vshallc/PtrNets

yining043/TSP-improve

datawhalechina/releasing-research-code

jiminsun/pointer-generator

eugenevinitsky/decentralized_bottlenecks

intelligent-control-lab/Auto_Vehicle_Simulator

decisionforce/pgdrive-generalization-paper