Pinned Repositories
AdHoc_AAMAS-17
Codification used for the AAMAS-17 paper "Simultaneously Learning and Advising in Multiagent Reinforcement Learning"
AI-Toolbox
A C++ framework for MDPs and POMDPs with Python bindings
ATLA_robust_RL
Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework
Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
Awesome-System-for-Machine-Learning
A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
cheryyunl.github.io
Make-An-Agent
MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
MARL-Tutorial
Paper-List-of-MARL
A new paper list for multi-agent reinforcement learning (actively maintained)
cheryyunl's Repositories
cheryyunl/Paper-List-of-MARL
A new paper list for multi-agent reinforcement learning (actively maintained)
cheryyunl/Make-An-Agent
cheryyunl/MARL-Tutorial
cheryyunl/AI-Toolbox
A C++ framework for MDPs and POMDPs with Python bindings
cheryyunl/ATLA_robust_RL
Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework
cheryyunl/Awesome-System-for-Machine-Learning
A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
cheryyunl/cheryyunl.github.io
cheryyunl/DrM-pretrain
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements in sample efficiency and asymptotic performance across diverse domains.
cheryyunl/FAMO
Official PyTorch Implementation for Fast Adaptive Multitask Optimization (FAMO)
cheryyunl/homework
Assignments for CS294-112.
cheryyunl/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
cheryyunl/mfrl
Mean Field Multi-Agent Reinforcement Learning
cheryyunl/models
Models and examples built with TensorFlow
cheryyunl/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
cheryyunl/nerfies.github.io
cheryyunl/Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network diffusion (\textbf{p-diff}, p stands for parameter), which employs a standard latent diffusion model to synthesize a new set of parameters
cheryyunl/nxdo
Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games
cheryyunl/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
cheryyunl/oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
cheryyunl/PolicyGenerator
cheryyunl/POMDPs.jl
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating discrete and continuous, fully and partially observable Markov decision processes.
cheryyunl/pytorch-maddpg
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
cheryyunl/pytorch-summary
Model summary in PyTorch similar to `model.summary()` in Keras
cheryyunl/robust_trainer
Code for robust trainer on MuJoCo
cheryyunl/SA_DQN
[NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning
cheryyunl/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
cheryyunl/StarCraft
Implementations of QMIX, VDN, COMA, QTRAN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
cheryyunl/TensorRT
TensorRT is a C++ library that facilitates high performance inference on NVIDIA GPUs and deep learning accelerators.
cheryyunl/transferlearning
Everything about Transfer Learning and Domain Adaptation--迁移学习
cheryyunl/walk-these-ways
Sim-to-real RL training and deployment tools for the Unitree Go1 robot.