SigmaBM

Peking UniversityChina

Pinned Repositories

baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python00
batch-ppo
Efficient Batched Reinforcement Learning in TensorFlow
Language:Python00
CompilerProject-2020Spring
Course Project. PKU Compiler Design. Spring, 2020.
Language:C++00
CS294_Fall-2017_HW
Assignments for CS294-112 Fall 2017
Language:Python00
CS294_Fall-2018_HW
Assignments for CS294-112 Fall 2018
Language:Python00
hbjiang.github.io
白嫖一下github的https🤣
Language:JavaScript0 1 00
infer-policy-feature
Language:Python00
MACE
[AAAI 2024] Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Language:Python6 1 00
neurips2020-flatland-starter-kit
Forked from https://gitlab.aicrowd.com/flatland/neurips2020-flatland-starter-kit.git
Language:Jupyter Notebook10
robosumo-selfplay
Reproduction of self-play described in paper "Emergent Complexity via Multi-Agent Competition", adapted from PPO2 implementation in OpenAI baselines.
Language:Python6 2 13

SigmaBM's Repositories

SigmaBM/MACE
[AAAI 2024] Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Language:Python6 1 00
SigmaBM/robosumo-selfplay
Reproduction of self-play described in paper "Emergent Complexity via Multi-Agent Competition", adapted from PPO2 implementation in OpenAI baselines.
Language:Python6 2 13
SigmaBM/CLIP4MC
[ECCV 2024] Reinforcement Learning Friendly Vision-Language Model for Minecraft
1
SigmaBM/neurips2020-flatland-starter-kit
Forked from https://gitlab.aicrowd.com/flatland/neurips2020-flatland-starter-kit.git
Language:Jupyter Notebook10
SigmaBM/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python00
SigmaBM/batch-ppo
Efficient Batched Reinforcement Learning in TensorFlow
Language:Python00
SigmaBM/CompilerProject-2020Spring
Course Project. PKU Compiler Design. Spring, 2020.
Language:C++00
SigmaBM/CS294_Fall-2017_HW
Assignments for CS294-112 Fall 2017
Language:Python00
SigmaBM/CS294_Fall-2018_HW
Assignments for CS294-112 Fall 2018
Language:Python00
SigmaBM/hbjiang.github.io
白嫖一下github的https🤣
Language:JavaScript0 1 00
SigmaBM/infer-policy-feature
Language:Python00
SigmaBM/lihang-code
《统计学习方法》的代码实现
Language:Jupyter Notebook00
SigmaBM/COPL
[ECCV 2024] Visual Grounding for Object-Level Generalization in Reinforcement Learning
Language:Python
SigmaBM/meta-mapg-code
Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning"
Language:Python1 0
SigmaBM/MineDojo
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Language:Java
SigmaBM/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python
SigmaBM/nd889
Udacity Artificial Intelligence Nanodegree
Language:Jupyter Notebook
SigmaBM/openbilibili-go-common
哔哩哔哩 bilibili 网站后台工程源码
Language:Go
SigmaBM/pomegranate
Fast, flexible and easy to use probabilistic modelling in Python.
Language:Jupyter Notebook
SigmaBM/robosumo
Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"
Language:Python
SigmaBM/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Language:Python
SigmaBM/StarCraft
Implementations of QMIX, VDN, COMA, QTRAN, CommNet, DyMA-CL, G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

SigmaBM

Pinned Repositories

baselines

batch-ppo

CompilerProject-2020Spring

CS294_Fall-2017_HW

CS294_Fall-2018_HW

hbjiang.github.io

infer-policy-feature

MACE

neurips2020-flatland-starter-kit

robosumo-selfplay

SigmaBM's Repositories

SigmaBM/MACE

SigmaBM/robosumo-selfplay

SigmaBM/CLIP4MC

SigmaBM/neurips2020-flatland-starter-kit

SigmaBM/baselines

SigmaBM/batch-ppo

SigmaBM/CompilerProject-2020Spring

SigmaBM/CS294_Fall-2017_HW

SigmaBM/CS294_Fall-2018_HW

SigmaBM/hbjiang.github.io

SigmaBM/infer-policy-feature

SigmaBM/lihang-code

SigmaBM/COPL

SigmaBM/meta-mapg-code

SigmaBM/MineDojo

SigmaBM/Minigrid

SigmaBM/nd889

SigmaBM/openbilibili-go-common

SigmaBM/pomegranate

SigmaBM/robosumo

SigmaBM/spinningup

SigmaBM/StarCraft