HeyuanMingong
An alchemist on reinforcement learning, an academic laborer in Heyuan.
Nanjing UniversityChina
Pinned Repositories
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
DiffusionQL
heyuanmingong.github.io
Personal Homepage
irl
Code for "Incremental reinforcement learning"
irl_cs
Code for "Incremental Reinforcement Learning in Continuous Spaces"
iwies
Code for "Instance Weighted Incremental Evolution Strategies (IW-IES)"
llirl
Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
sllrl
Code for "Scalable lifelong reinforcement learning"
VIB-pytorch
Pytorch implementation of Deep Variational Information Bottleneck
HeyuanMingong's Repositories
HeyuanMingong/llirl
Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"
HeyuanMingong/irl
Code for "Incremental reinforcement learning"
HeyuanMingong/iwies
Code for "Instance Weighted Incremental Evolution Strategies (IW-IES)"
HeyuanMingong/heyuanmingong.github.io
Personal Homepage
HeyuanMingong/irl_cs
Code for "Incremental Reinforcement Learning in Continuous Spaces"
HeyuanMingong/sllrl
Code for "Scalable lifelong reinforcement learning"
HeyuanMingong/DiffusionQL
HeyuanMingong/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
HeyuanMingong/VIB-pytorch
Pytorch implementation of Deep Variational Information Bottleneck
HeyuanMingong/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
HeyuanMingong/dgr
PyTorch implementation of "Continual Learning with Deep Generative Replay", NIPS 2017
HeyuanMingong/OLPS
Online Portfolio Selection toolbox
HeyuanMingong/random-network-distillation-pytorch
Random Network Distillation pytorch
HeyuanMingong/Survey_PortfolioSelection
Online portfolio selection is a fundamental problem in computational finance, which has been extensively studied across several research communities, including finance, statistics, artificial intelligence, machine learning, and data mining, etc. This article aims to provide a comprehensive survey and a structural understanding of published online portfolio selection techniques.
HeyuanMingong/attention-learn-to-route
Attention based model for learning to solve different routing problems
HeyuanMingong/CORRO
CORRO code
HeyuanMingong/ddpm2
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
HeyuanMingong/decision-diffuser
HeyuanMingong/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
HeyuanMingong/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
HeyuanMingong/DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
HeyuanMingong/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
HeyuanMingong/GAN
PyTorch implementations of Generative Adversarial Networks.
HeyuanMingong/MTDiff
MTDiff
HeyuanMingong/pearl
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
HeyuanMingong/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
HeyuanMingong/SfBC
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548
HeyuanMingong/varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)