mumu-peng

mumu-peng's Stars

ZiwenZhuang/parkour
[CoRL 2023] Robot Parkour Learning
Language:Python55599
histmeisah/Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
Language:Python20414
facebookresearch/Pearl
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
Language:Jupyter Notebook2.6k161
jackaduma/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案
1.2k268
oxwhirl/smacv2
Language:Python21032
PKU-RL/Plan4MC
Reinforcement learning and planning for Minecraft.
Language:Python15520
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
Language:JavaScript5.6k528
makepath/xarray-spatial
Raster-based Spatial Analytics for Python
Language:Python83585
tencent-ailab/hok_env
Honor of Kings AI Open Environment of Tencent
Language:Python63972
binary-husky/unreal-map
Multiagent research environment toolbox based on Unreal Engine
Language:Python19433
DA-southampton/NLP_ability
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识，包括面试题，各种基础知识，工程能力等等，提升核心竞争力
Language:Python6.9k1.2k
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python3.7k847
LaoGong-zp/Transformer
Learning materials of Transformer, including my code, XMind, PDF and so on
Language:Jupyter Notebook33856
FengQuanLi/WZCQ
用基于策略梯度得强化学习方法训练AI玩王者荣耀
Language:Python1.6k392
chscheller/sc2_imitation_learning
StarCraft 2 Imitation Learning
Language:Python293
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python617122
me115/linuxtools_rst
Linux工具快速教程
Language:HTML5.9k1.4k
YuhangSong/Arena-Baselines
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
Language:ASP.NET1018
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python3.9k849
hijkzzz/noisy-mappo
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
Language:Python546
dropreg/R-Drop
Language:Python869107
liuruoze/mini-AlphaStar
(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Research.
Language:Python31257
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
Language:Python1.3k296