wanghuimu

Pinned Repositories

2s-AGCN
Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition in CVPR19
Language:Python00
A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Language:Python00
A3C
Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.
Language:Python00
A3C-LSTM-with-Tensorflow
An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.
Language:Python00
A3C_grid_world
Simple tensorflow implementation of Asynchronous Advantage Actor-Critic (A3C) for a 2-D grid environment
Language:Python00
AirSim
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
Language:C++00
DRL4Recsys
Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
10
LDG
PyTorch code for "Learning Temporal Attention in Dynamic Graphs with Bilinear Interactions"
Language:Python10
Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
Language:Python10

wanghuimu's Repositories

wanghuimu/DRL4Recsys
Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
10
wanghuimu/Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
Language:Python10
wanghuimu/apple-store-helper
Apple Store iPhone预约助手
wanghuimu/Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Learning papers for industrial Search, Recommendation and Advertising. They focus on Embedding, Matching, Ranking (CTR and CVR prediction), Post Ranking, Multi-task Learning, Graph Neural Networks, Transfer Learning, Reinforcement Learning, Self-supervised Learning and so on.
Language:Python1 0
wanghuimu/Batch-Offline--RL-Paper-Lists
Paper Collection for Batch RL with brief introductions.
1 0
wanghuimu/CGCDemandPrediction
wanghuimu/damarl
Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".
wanghuimu/Deep-RL-Notes
A collection of comprehensive notes on Deep Reinforcement Learning, based on UC Berkeley's CS 285 (prev. CS 294-112)
wanghuimu/DeepClustering
Methods and Implements of Deep Clustering
wanghuimu/deeprl_network
multi-agent deep reinforcement learning for networked system control.
wanghuimu/DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (https://arxiv.org/abs/2007.12322)
wanghuimu/exact-k-recommendation
wanghuimu/football-paris
The exact codes used by the team "liveinparis" at the kaggle football competition ranked 8th/1141
wanghuimu/GroupIM
Code for GroupIM: A Mutual Information Maximization Framework for Neural Group Recommendation (SIGIR 2020)
wanghuimu/gumbel_lstm
Experiments with binary LSTM using gumbel-sigmoid
wanghuimu/HuimuWang
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
wanghuimu/jd_seckill
京东茅台抢购，不支持其他商品！愿大家与黄牛站在同一个起跑线，公平的参与这场抢茅大赛。
wanghuimu/LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.（用动画的形式呈现解LeetCode题目的思路）
wanghuimu/LIRD
Deep Reinforcement Learning for Movies Recommendation System
wanghuimu/MaCA
wanghuimu/Meta-MAGIC
wanghuimu/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
wanghuimu/Multi-Agent-Coordination-Google-Football
Coordination between Deep RL Agents for Virtual Football
Language:Python1 0
wanghuimu/multiagent_gnn_policies
Learning multi-agent policies for flocking using graph neural networks
wanghuimu/on-policy
This is the official implementation of Multi-Agent PPO.
wanghuimu/pymarl2
Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning
wanghuimu/ReAgent
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
wanghuimu/RSPapers
A Curated List of Must-read Papers on Recommender System.
wanghuimu/StarCraft
Implementations of QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
wanghuimu/VBC
pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"