cheryyunl

Reinforcement Learning, Interactive Learning System

Toronto

Pinned Repositories

AdHoc_AAMAS-17
Codification used for the AAMAS-17 paper "Simultaneously Learning and Advising in Multiagent Reinforcement Learning"
Language:Jupyter Notebook00
AI-Toolbox
A C++ framework for MDPs and POMDPs with Python bindings
Language:C++0 1 00
ATLA_robust_RL
Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework
Language:Python0 0 00
Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
Language:Python0 1 00
Awesome-System-for-Machine-Learning
A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
0 1 00
cheryyunl.github.io
Language:JavaScript0 1 01
Make-An-Agent
Language:Python24 2 50
MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
1 1 01
MARL-Tutorial
2 1 01
Paper-List-of-MARL
A new paper list for multi-agent reinforcement learning (actively maintained)
25 5 02

cheryyunl's Repositories

cheryyunl/Paper-List-of-MARL
A new paper list for multi-agent reinforcement learning (actively maintained)
25 5 02
cheryyunl/Make-An-Agent
Language:Python24 2 50
cheryyunl/MARL-Tutorial
2 1 01
cheryyunl/AI-Toolbox
A C++ framework for MDPs and POMDPs with Python bindings
Language:C++0 1 00
cheryyunl/ATLA_robust_RL
Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework
Language:Python0 0 00
cheryyunl/Awesome-System-for-Machine-Learning
A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
0 1 00
cheryyunl/cheryyunl.github.io
Language:JavaScript0 1 01
cheryyunl/DrM-pretrain
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements in sample efficiency and asymptotic performance across diverse domains.
Language:Python0 0
cheryyunl/FAMO
Official PyTorch Implementation for Fast Adaptive Multitask Optimization (FAMO)
Language:Python0 0
cheryyunl/homework
Assignments for CS294-112.
Language:Python0 0
cheryyunl/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
cheryyunl/mfrl
Mean Field Multi-Agent Reinforcement Learning
Language:Python1 0
cheryyunl/models
Models and examples built with TensorFlow
Language:Python0 0
cheryyunl/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python1 0
cheryyunl/nerfies.github.io
cheryyunl/Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network diffusion (\textbf{p-diff}, p stands for parameter), which employs a standard latent diffusion model to synthesize a new set of parameters
Language:Python
cheryyunl/nxdo
Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games
Language:Python0 0
cheryyunl/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Language:C++0 0
cheryyunl/oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
Language:Python0 0
cheryyunl/PolicyGenerator
Language:Python1 0
cheryyunl/POMDPs.jl
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating discrete and continuous, fully and partially observable Markov decision processes.
Language:Julia0 0
cheryyunl/pytorch-maddpg
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
Language:Python0 0
cheryyunl/pytorch-summary
Model summary in PyTorch similar to `model.summary()` in Keras
cheryyunl/robust_trainer
Code for robust trainer on MuJoCo
Language:Python0 0
cheryyunl/SA_DQN
[NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning
Language:Python0 0
cheryyunl/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Language:Python1 0
cheryyunl/StarCraft
Implementations of QMIX, VDN, COMA, QTRAN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
Language:Python1 0
cheryyunl/TensorRT
TensorRT is a C++ library that facilitates high performance inference on NVIDIA GPUs and deep learning accelerators.
Language:C++0 0
cheryyunl/transferlearning
Everything about Transfer Learning and Domain Adaptation--迁移学习
Language:Python1 0
cheryyunl/walk-these-ways
Sim-to-real RL training and deployment tools for the Unitree Go1 robot.
Language:Python