hunter55555
My name is Xinchao Wei. Master of AI, Northwest Polytechnical University in Xi'an. Research interest: Reinforcement Learning
Northwest Polytechnical UniversityXi'an china
hunter55555's Stars
leekwoon/hrl-nav
[ICRA 2023] Adaptive and Explainable Deployment of Navigation Skills via Hierarchical Deep Reinforcement Learning
ultralytics/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
XinJingHao/PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
vwxyzjn/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
daiweLi/Fast-Combat-Simulation
A tool that provides fast air combat simulation and display
Linaom1214/RL_air-combat
基于强化学习的空战对抗
walder/Skynet-IADS
Adds IADS (integrated air defence) functionality to Digital Combat Simulator.
WxxShirley/fuzzy_expert_system
基于模糊专家系统的笔记本选购推荐系统,人工智能课程期末pj。
KevinWang15/fuzzy-expert-system-buy-dslr-camera
人工智能课程Project —— 使用模糊专家系统做单反相机购买推荐
chgl16/animal-recognition-expert-system
:tiger: 用产生式系统设计的一个简单动物识别专家系统,正向推理,支持规则增删改查
zhangbincheng1997/expert-system
专家系统作业——井字棋、推理机、决策树
zoeyuchao/mappo
This is the official implementation of Multi-Agent PPO.
OpenLMLab/MOSS-RLHF
MOSS-RLHF
forthespada/CS-Books
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
tangwz/DistSysDeepDive
Code repository for the book 'Distributed System Deep Dive'《深入理解分布式系统》代码仓库
CodingDocs/awesome-cs
计算机优质书籍搜罗+学习路线推荐!
Jackpopc/CS-Books-Store
你想要的计算机经典书籍,这里都有!
justjavac/free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍,欢迎投稿
iptv-org/iptv
Collection of publicly available IPTV channels from all over the world
PKU-MARL/HARL
Official implementation of HARL algorithms based on PyTorch.
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
PhDChe/Poker-1
Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping
inspirai/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
gxywy/rl-plotter
:sparkles: A plotter for reinforcement learning (RL)
younggyoseo/pytorch-nfsp
Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)
ImmanuelXIV/ppo-self-play
Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment
davidADSP/SIMPLE
Selfplay In MultiPlayer Environments
hkinke/sac_ae
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images (Yarats and al.,2020)