zhangjw-THU's Stars
nlohmann/json
JSON for Modern C++
itdevbooks/pdf
编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
zhoubolei/introRL
Intro to Reinforcement Learning (强化学习纲要)
Farama-Foundation/HighwayEnv
A minimalist environment for decision-making in autonomous driving
abisee/pointer-generator
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"
starry-sky6688/MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
wouterkool/attention-learn-to-route
Attention based model for learning to solve different routing problems
bstabler/TransportationNetworks
Transportation Networks for Research
Hanjun-Dai/graph_comb_opt
Implementation of "Learning Combinatorial Optimization Algorithms over Graphs"
mveres01/pytorch-drl4vrp
Implementation of: Nazari, Mohammadreza, et al. "Deep Reinforcement Learning for Solving the Vehicle Routing Problem." arXiv preprint arXiv:1802.04240 (2018).
Acmece/rl-collision-avoidance
Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"
praveen-palanisamy/macad-gym
Multi-Agent Connected Autonomous Driving (MACAD) Gym environments for Deep RL. Code for the paper presented in the Machine Learning for Autonomous Driving Workshop at NeurIPS 2019:
Hanjun-Dai/pytorch_structure2vec
pytorch implementation of structure2vec (https://arxiv.org/abs/1603.05629)
chaitjo/graph-convnet-tsp
Code for the paper 'An Efficient Graph Convolutional Network Technique for the Travelling Salesman Problem' (INFORMS Annual Meeting Session 2019)
Rintarooo/VRP_DRL_MHA
"Attention, Learn to Solve Routing Problems!"[Kool+, 2019], Capacitated Vehicle Routing Problem solver
Rintarooo/TSP_DRL_PtrNet
"Neural Combinatorial Optimization with Reinforcement Learning"[Bello+, 2016], Traveling Salesman Problem solver
eduardohenriquearnold/CODD
Cooperative Driving Dataset: a dataset for multi-agent driving scenarios
decisionforce/pgdrive
PGDrive: an open-ended driving simulator with infinite scenes from procedural generation
facebookresearch/CollaQ
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
vshallc/PtrNets
Pointer Networks
yining043/TSP-improve
An improvement-based Deep Reinforcement Learning Algorithm presented in paper https://arxiv.org/abs/1912.05784v2 for solving the TSP problem.
datawhalechina/releasing-research-code
发布研究论文代码的小技巧
jiminsun/pointer-generator
Pytorch implementation of the ACL paper 'Get To The Point: Summarization with Pointer-Generator Networks (See et al., 2017)', adapted to a Korean dataset
eugenevinitsky/decentralized_bottlenecks
Code and figures for bottlenecks paper
intelligent-control-lab/Auto_Vehicle_Simulator
decisionforce/pgdrive-generalization-paper
The official material of the paper: "Improving the Generalization of End-to-End Driving through Procedural Generation".