Pinned Repositories
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
batch-ppo
Efficient Batched Reinforcement Learning in TensorFlow
CompilerProject-2020Spring
Course Project. PKU Compiler Design. Spring, 2020.
CS294_Fall-2017_HW
Assignments for CS294-112 Fall 2017
CS294_Fall-2018_HW
Assignments for CS294-112 Fall 2018
hbjiang.github.io
白嫖一下github的https🤣
infer-policy-feature
MACE
[AAAI 2024] Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
neurips2020-flatland-starter-kit
Forked from https://gitlab.aicrowd.com/flatland/neurips2020-flatland-starter-kit.git
robosumo-selfplay
Reproduction of self-play described in paper "Emergent Complexity via Multi-Agent Competition", adapted from PPO2 implementation in OpenAI baselines.
SigmaBM's Repositories
SigmaBM/MACE
[AAAI 2024] Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
SigmaBM/robosumo-selfplay
Reproduction of self-play described in paper "Emergent Complexity via Multi-Agent Competition", adapted from PPO2 implementation in OpenAI baselines.
SigmaBM/CLIP4MC
[ECCV 2024] Reinforcement Learning Friendly Vision-Language Model for Minecraft
SigmaBM/neurips2020-flatland-starter-kit
Forked from https://gitlab.aicrowd.com/flatland/neurips2020-flatland-starter-kit.git
SigmaBM/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
SigmaBM/batch-ppo
Efficient Batched Reinforcement Learning in TensorFlow
SigmaBM/CompilerProject-2020Spring
Course Project. PKU Compiler Design. Spring, 2020.
SigmaBM/CS294_Fall-2017_HW
Assignments for CS294-112 Fall 2017
SigmaBM/CS294_Fall-2018_HW
Assignments for CS294-112 Fall 2018
SigmaBM/hbjiang.github.io
白嫖一下github的https🤣
SigmaBM/infer-policy-feature
SigmaBM/lihang-code
《统计学习方法》的代码实现
SigmaBM/COPL
[ECCV 2024] Visual Grounding for Object-Level Generalization in Reinforcement Learning
SigmaBM/meta-mapg-code
Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning"
SigmaBM/MineDojo
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
SigmaBM/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
SigmaBM/nd889
Udacity Artificial Intelligence Nanodegree
SigmaBM/openbilibili-go-common
哔哩哔哩 bilibili 网站后台工程 源码
SigmaBM/pomegranate
Fast, flexible and easy to use probabilistic modelling in Python.
SigmaBM/robosumo
Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"
SigmaBM/spinningup
An educational resource to help anyone learn deep reinforcement learning.
SigmaBM/StarCraft
Implementations of QMIX, VDN, COMA, QTRAN, CommNet, DyMA-CL, G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II