daihuiao

student of Tianjin University@Tianjin University

Tianjin University

Pinned Repositories

aricraft_all
Language:Python00
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
Language:Python00
BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
Language:Python00
CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC
Language:Python00
CORL_tobedeleted
Language:Python00
CQL
Code for conservative Q-learning
Language:Python00
CQL-1
Conservative Q Learning on top of SAC
Language:Python00
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python1 0 00
openreview_summarizereviews_2024
Summarizing Mean Review Score for All Submissions for a Conference hosted on Openreview
Language:Python30
stable-diffusion
Language:Jupyter Notebook1 0 00

daihuiao's Repositories

daihuiao/openreview_summarizereviews_2024
Summarizing Mean Review Score for All Submissions for a Conference hosted on Openreview
Language:Python30
daihuiao/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python1 0 00
daihuiao/stable-diffusion
Language:Jupyter Notebook1 0 00
daihuiao/aricraft_all
Language:Python00
daihuiao/BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
Language:Python00
daihuiao/CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC
Language:Python00
daihuiao/CORL_tobedeleted
Language:Python00
daihuiao/CQL-1
Conservative Q Learning on top of SAC
Language:Python00
daihuiao/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python0 0 00
daihuiao/deep_rl_zoo
A collection of Deep RL algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
daihuiao/Diffusion-Policies-for-Offline-RL
Language:Python
daihuiao/dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python
daihuiao/ElegantRL_diffusion_policy
Cloud-native Deep Reinforcement Learning. 🔥
Language:Python
daihuiao/FederalLearning
通过阅读Communication-Efficient Learning of Deep Networks from Decentralized Data与Robust and Communication-Efficient Federated Learning from Non-IID Data两篇论文，复现FedAvg与STC算法，完成LSTM模型+ Shakespeare数据集的字符预测任务
daihuiao/Federated-Learning-PyTorch
Handy PyTorch implementation of a federated learning (especially for painless research)
daihuiao/gym_dockauv
Gym Environment for AUV docking procedure
Language:Python1
daihuiao/inac_pytorch
daihuiao/iris
Transformers are Sample Efficient World Models
Language:Python
daihuiao/Machine-Learning-In-Numpy
纯python实现机器学习算法,非套用sk-learn
daihuiao/maddpg_uncertainty
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
daihuiao/MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
daihuiao/mpo
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
Language:Python
daihuiao/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
daihuiao/noreward-rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
daihuiao/OfflineRL
A collection of offline reinforcement learning algorithms. This is a mirror repo from https://agit.ai/Polixir/OfflineRL
daihuiao/OfflineRL-Lib
Benchmarked implementations of Offline RL Algorithms.
daihuiao/on-policy_uncertainty
This is the official implementation of Multi-Agent PPO (MAPPO).
daihuiao/online-dt
Online Decision Transformer
Language:Python
daihuiao/paper5
Language:Python
daihuiao/pytorch-
Tensors and Dynamic neural networks in Python with strong GPU acceleration