hunter55555
My name is Xinchao Wei. Master of AI, Northwest Polytechnical University in Xi'an. Research interest: Reinforcement Learning
Northwest Polytechnical UniversityXi'an china
hunter55555's Stars
Linaom1214/RL_air-combat
基于强化学习的空战对抗
walder/Skynet-IADS
Adds IADS (integrated air defence) functionality to Digital Combat Simulator.
WxxShirley/fuzzy_expert_system
基于模糊专家系统的笔记本选购推荐系统,人工智能课程期末pj。
KevinWang15/fuzzy-expert-system-buy-dslr-camera
人工智能课程Project —— 使用模糊专家系统做单反相机购买推荐
chgl16/animal-recognition-expert-system
:tiger: 用产生式系统设计的一个简单动物识别专家系统,正向推理,支持规则增删改查
zhangbincheng1997/expert-system
专家系统作业——井字棋、推理机、决策树
zoeyuchao/mappo
This is the official implementation of Multi-Agent PPO.
OpenLMLab/MOSS-RLHF
MOSS-RLHF
forthespada/CS-Books
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
tangwz/DistSysDeepDive
Code repository for the book 'Distributed System Deep Dive'《深入理解分布式系统》代码仓库
CodingDocs/awesome-cs
计算机优质书籍搜罗+学习路线推荐!
Jackpopc/CS-Books-Store
你想要的计算机经典书籍,这里都有!
justjavac/free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍,欢迎投稿
iptv-org/iptv
Collection of publicly available IPTV channels from all over the world
PKU-MARL/HARL
Official implementation of HARL algorithms based on PyTorch.
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
PhDChe/Poker-1
Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping
inspirai/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
gxywy/rl-plotter
:sparkles: A plotter for reinforcement learning (RL)
younggyoseo/pytorch-nfsp
Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)
ImmanuelXIV/ppo-self-play
Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment
davidADSP/SIMPLE
Selfplay In MultiPlayer Environments
hkinke/sac_ae
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images (Yarats and al.,2020)
tjuHaoXiaotian/pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
TJU-DRL-LAB/Multiagent-RL
The official code releasement of publications in MARL field of TJU RL lab.
nasa/XPlaneConnect
The X-Plane Communications Toolbox is a research tool used to interact with the X-Plane flight simulator
xuhao1/FOXTracker
Facial Head Pose Tracker for Gaming
vazgriz/FlightSim
zhangzhishan/deeplearningthesis
master thesis
polossk/LaTeX-Template-For-NPU-Thesis
西北工业大学本科毕业设计论文模版 | Thesis Template for Northwestern Polytechnical University