hunter55555

My name is Xinchao Wei. Master of AI, Northwest Polytechnical University in Xi'an. Research interest: Reinforcement Learning

Northwest Polytechnical UniversityXi'an china

hunter55555's Stars

leekwoon/hrl-nav
[ICRA 2023] Adaptive and Explainable Deployment of Navigation Skills via Hierarchical Deep Reinforcement Learning
Language:Python948
ultralytics/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Language:Python50k16.2k
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python604118
XinJingHao/PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.
Language:Python11516
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Language:Python1.2k149
vwxyzjn/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
Language:Python61997
daiweLi/Fast-Combat-Simulation
A tool that provides fast air combat simulation and display
Language:C++269
Linaom1214/RL_air-combat
基于强化学习的空战对抗
Language:Python5510
walder/Skynet-IADS
Adds IADS (integrated air defence) functionality to Digital Combat Simulator.
Language:Lua20043
WxxShirley/fuzzy_expert_system
基于模糊专家系统的笔记本选购推荐系统，人工智能课程期末pj。
Language:Python234
KevinWang15/fuzzy-expert-system-buy-dslr-camera
人工智能课程Project —— 使用模糊专家系统做单反相机购买推荐
Language:JavaScript436
chgl16/animal-recognition-expert-system
:tiger: 用产生式系统设计的一个简单动物识别专家系统，正向推理，支持规则增删改查
Language:Java359
zhangbincheng1997/expert-system
专家系统作业——井字棋、推理机、决策树
Language:Python4516
zoeyuchao/mappo
This is the official implementation of Multi-Agent PPO.
Language:Python9019
OpenLMLab/MOSS-RLHF
MOSS-RLHF
Language:Python1.3k98
forthespada/CS-Books
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
20.6k3.7k
tangwz/DistSysDeepDive
Code repository for the book 'Distributed System Deep Dive'《深入理解分布式系统》代码仓库
Language:Go10515
CodingDocs/awesome-cs
计算机优质书籍搜罗+学习路线推荐！
2.4k349
Jackpopc/CS-Books-Store
你想要的计算机经典书籍，这里都有！
4.7k824
justjavac/free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍，欢迎投稿
111k28.2k
iptv-org/iptv
Collection of publicly available IPTV channels from all over the world
Language:JavaScript84.9k2.5k
PKU-MARL/HARL
Official implementation of HARL algorithms based on PyTorch.
Language:Python46655
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python54.4k5.6k
PhDChe/Poker-1
Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping
123
inspirai/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
Language:Python28729
gxywy/rl-plotter
:sparkles: A plotter for reinforcement learning (RL)
Language:Python20630
younggyoseo/pytorch-nfsp
Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)
Language:Python4510
ImmanuelXIV/ppo-self-play
Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment
Language:Python131
davidADSP/SIMPLE
Selfplay In MultiPlayer Environments
Language:Python297103
hkinke/sac_ae
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images (Yarats and al.,2020)
Language:Python4

hunter55555

hunter55555's Stars

leekwoon/hrl-nav

ultralytics/yolov5

hijkzzz/pymarl2

XinJingHao/PPO-Continuous-Pytorch

XinJingHao/DRL-Pytorch

vwxyzjn/ppo-implementation-details

daiweLi/Fast-Combat-Simulation

Linaom1214/RL_air-combat

walder/Skynet-IADS

WxxShirley/fuzzy_expert_system

KevinWang15/fuzzy-expert-system-buy-dslr-camera

chgl16/animal-recognition-expert-system

zhangbincheng1997/expert-system

zoeyuchao/mappo

OpenLMLab/MOSS-RLHF

forthespada/CS-Books

tangwz/DistSysDeepDive

CodingDocs/awesome-cs

Jackpopc/CS-Books-Store

justjavac/free-programming-books-zh_CN

iptv-org/iptv

PKU-MARL/HARL

labmlai/annotated_deep_learning_paper_implementations

PhDChe/Poker-1

inspirai/TimeChamber

gxywy/rl-plotter

younggyoseo/pytorch-nfsp

ImmanuelXIV/ppo-self-play

davidADSP/SIMPLE

hkinke/sac_ae