HeyuanMingong
An alchemist on reinforcement learning, an academic laborer in Heyuan.
Nanjing UniversityChina
Pinned Repositories
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
dgr
PyTorch implementation of "Continual Learning with Deep Generative Replay", NIPS 2017
heyuanmingong.github.io
Personal Homepage
irl
Code for "Incremental reinforcement learning"
irl_cs
Code for "Incremental Reinforcement Learning in Continuous Spaces"
iwies
Code for "Instance Weighted Incremental Evolution Strategies (IW-IES)"
llirl
Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"
MARL-code-pytorch
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
VIB-pytorch
Pytorch implementation of Deep Variational Information Bottleneck
HeyuanMingong's Repositories
HeyuanMingong/llirl
Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"
HeyuanMingong/irl
Code for "Incremental reinforcement learning"
HeyuanMingong/iwies
Code for "Instance Weighted Incremental Evolution Strategies (IW-IES)"
HeyuanMingong/heyuanmingong.github.io
Personal Homepage
HeyuanMingong/irl_cs
Code for "Incremental Reinforcement Learning in Continuous Spaces"
HeyuanMingong/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
HeyuanMingong/VIB-pytorch
Pytorch implementation of Deep Variational Information Bottleneck
HeyuanMingong/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
HeyuanMingong/dgr
PyTorch implementation of "Continual Learning with Deep Generative Replay", NIPS 2017
HeyuanMingong/MARL-code-pytorch
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
HeyuanMingong/OLPS
Online Portfolio Selection toolbox
HeyuanMingong/random-network-distillation-pytorch
Random Network Distillation pytorch
HeyuanMingong/Survey_PortfolioSelection
Online portfolio selection is a fundamental problem in computational finance, which has been extensively studied across several research communities, including finance, statistics, artificial intelligence, machine learning, and data mining, etc. This article aims to provide a comprehensive survey and a structural understanding of published online portfolio selection techniques.
HeyuanMingong/attention-learn-to-route
Attention based model for learning to solve different routing problems
HeyuanMingong/ddpm2
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
HeyuanMingong/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
HeyuanMingong/DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
HeyuanMingong/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
HeyuanMingong/GAN
PyTorch implementations of Generative Adversarial Networks.
HeyuanMingong/pearl
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
HeyuanMingong/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
HeyuanMingong/SfBC
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548