Pinned Repositories
offline_bpr
Official implementation of Behavior Prior Representation learning for Offline Reinforcement Learning
SimSR
AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning
facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
a3c-tetris
Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
DeepRL
【深度强化学习社区】一个资料与学习内容最全的服务平台
ItChat
A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。
Offline_Bisimulation
Official implementation of Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
RL_note
zanghyu's Repositories
zanghyu/DeepRL
【深度强化学习社区】一个资料与学习内容最全的服务平台
zanghyu/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
zanghyu/Offline_Bisimulation
Official implementation of Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
zanghyu/RL_note
zanghyu/RLcode
zanghyu/resources
some resources about RL, ML
zanghyu/zanghyu.github.io
Github Pages for academic personal websites
zanghyu/blog
Everything about database,bussiness.(Most for PostgreSQL).
zanghyu/DGI
Add data preprocessing script in this code.
zanghyu/dqn-tetris
zanghyu/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
zanghyu/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
zanghyu/guacamol_baselines
Baselines models for GuacaMol benchmarks
zanghyu/ImageReward
ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
zanghyu/K-FAC-example
zanghyu/Love-1
zanghyu/nn_builder
Build neural networks with less boilerplate code
zanghyu/offline_rl_envs
Implementations of Gridworld, Modelwin, and Modelfail to experiment with offline RL
zanghyu/ORGANIC
Code repo for optimizing distributions of molecules.
zanghyu/python_plot
zanghyu/python_trick
zanghyu/pytorch-seq2seq
An open source framework for seq2seq models in PyTorch.
zanghyu/query_phone_number
手机号码归属地查询
zanghyu/RL100questions
QA about reinforcement learning
zanghyu/seqmnist
zanghyu/stable-baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
zanghyu/tf_project_templete
This is a simple templete of tensorflow project
zanghyu/TMAP
TMAP: Integrating Trust Region and Maximum Entropy with Augmented Bellman Equation for Policy Optimization
zanghyu/toolkit
this is a python toolkit for personal use
zanghyu/visualkit
visualization for rl