HeyuanMingong

An alchemist on reinforcement learning, an academic laborer in Heyuan.

Nanjing UniversityChina

Pinned Repositories

attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Language:Python10
DiffusionQL
Language:Python2 0 01
heyuanmingong.github.io
Personal Homepage
Language:HTML53
irl
Code for "Incremental reinforcement learning"
Language:Python7 1 14
irl_cs
Code for "Incremental Reinforcement Learning in Continuous Spaces"
Language:Python5 1 01
iwies
Code for "Instance Weighted Incremental Evolution Strategies (IW-IES)"
Language:Python6 2 02
llirl
Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"
Language:Python21 2 03
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python20
sllrl
Code for "Scalable lifelong reinforcement learning"
Language:Python30
VIB-pytorch
Pytorch implementation of Deep Variational Information Bottleneck
Language:Python20

HeyuanMingong's Repositories

HeyuanMingong/llirl
Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"
Language:Python21 2 03
HeyuanMingong/irl
Code for "Incremental reinforcement learning"
Language:Python7 1 14
HeyuanMingong/iwies
Code for "Instance Weighted Incremental Evolution Strategies (IW-IES)"
Language:Python6 2 02
HeyuanMingong/heyuanmingong.github.io
Personal Homepage
Language:HTML53
HeyuanMingong/irl_cs
Code for "Incremental Reinforcement Learning in Continuous Spaces"
Language:Python5 1 01
HeyuanMingong/sllrl
Code for "Scalable lifelong reinforcement learning"
Language:Python30
HeyuanMingong/DiffusionQL
Language:Python2 0 01
HeyuanMingong/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python20
HeyuanMingong/VIB-pytorch
Pytorch implementation of Deep Variational Information Bottleneck
Language:Python20
HeyuanMingong/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Language:Python10
HeyuanMingong/dgr
PyTorch implementation of "Continual Learning with Deep Generative Replay", NIPS 2017
Language:Python10
HeyuanMingong/OLPS
Online Portfolio Selection toolbox
Language:MATLAB10
HeyuanMingong/random-network-distillation-pytorch
Random Network Distillation pytorch
1
HeyuanMingong/Survey_PortfolioSelection
Online portfolio selection is a fundamental problem in computational ﬁnance, which has been extensively studied across several research communities, including ﬁnance, statistics, artiﬁcial intelligence, machine learning, and data mining, etc. This article aims to provide a comprehensive survey and a structural understanding of published online portfolio selection techniques.
Language:Python10
HeyuanMingong/attention-learn-to-route
Attention based model for learning to solve different routing problems
HeyuanMingong/CORRO
CORRO code
HeyuanMingong/ddpm2
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
HeyuanMingong/decision-diffuser
HeyuanMingong/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
HeyuanMingong/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
HeyuanMingong/DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
HeyuanMingong/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
HeyuanMingong/GAN
PyTorch implementations of Generative Adversarial Networks.
HeyuanMingong/MTDiff
MTDiff
HeyuanMingong/pearl
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
Language:Python1 0
HeyuanMingong/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
HeyuanMingong/SfBC
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548
Language:Python
HeyuanMingong/varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)