mumu-peng's Stars
ZiwenZhuang/parkour
[CoRL 2023] Robot Parkour Learning
histmeisah/Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
facebookresearch/Pearl
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
jackaduma/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
oxwhirl/smacv2
PKU-RL/Plan4MC
Reinforcement learning and planning for Minecraft.
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
makepath/xarray-spatial
Raster-based Spatial Analytics for Python
tencent-ailab/hok_env
Honor of Kings AI Open Environment of Tencent
binary-husky/unreal-map
Multiagent research environment toolbox based on Unreal Engine
DA-southampton/NLP_ability
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
LaoGong-zp/Transformer
Learning materials of Transformer, including my code, XMind, PDF and so on
FengQuanLi/WZCQ
用基于策略梯度得强化学习方法训练AI玩王者荣耀
chscheller/sc2_imitation_learning
StarCraft 2 Imitation Learning
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
me115/linuxtools_rst
Linux工具快速教程
YuhangSong/Arena-Baselines
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
hijkzzz/noisy-mappo
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
dropreg/R-Drop
liuruoze/mini-AlphaStar
(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Research.
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).