Pinned Repositories
aricraft_all
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC
CORL_tobedeleted
CQL
Code for conservative Q-learning
CQL-1
Conservative Q Learning on top of SAC
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
openreview_summarizereviews_2024
Summarizing Mean Review Score for All Submissions for a Conference hosted on Openreview
stable-diffusion
daihuiao's Repositories
daihuiao/openreview_summarizereviews_2024
Summarizing Mean Review Score for All Submissions for a Conference hosted on Openreview
daihuiao/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
daihuiao/stable-diffusion
daihuiao/aricraft_all
daihuiao/BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
daihuiao/CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC
daihuiao/CORL_tobedeleted
daihuiao/CQL-1
Conservative Q Learning on top of SAC
daihuiao/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
daihuiao/deep_rl_zoo
A collection of Deep RL algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
daihuiao/Diffusion-Policies-for-Offline-RL
daihuiao/dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
daihuiao/ElegantRL_diffusion_policy
Cloud-native Deep Reinforcement Learning. 🔥
daihuiao/FederalLearning
通过阅读Communication-Efficient Learning of Deep Networks from Decentralized Data与Robust and Communication-Efficient Federated Learning from Non-IID Data两篇论文,复现FedAvg与STC算法,完成LSTM模型+ Shakespeare数据集的字符预测任务
daihuiao/Federated-Learning-PyTorch
Handy PyTorch implementation of a federated learning (especially for painless research)
daihuiao/gym_dockauv
Gym Environment for AUV docking procedure
daihuiao/inac_pytorch
daihuiao/iris
Transformers are Sample Efficient World Models
daihuiao/Machine-Learning-In-Numpy
纯python实现机器学习算法,非套用sk-learn
daihuiao/maddpg_uncertainty
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
daihuiao/MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
daihuiao/mpo
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
daihuiao/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
daihuiao/noreward-rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
daihuiao/OfflineRL
A collection of offline reinforcement learning algorithms. This is a mirror repo from https://agit.ai/Polixir/OfflineRL
daihuiao/OfflineRL-Lib
Benchmarked implementations of Offline RL Algorithms.
daihuiao/on-policy_uncertainty
This is the official implementation of Multi-Agent PPO (MAPPO).
daihuiao/online-dt
Online Decision Transformer
daihuiao/paper5
daihuiao/pytorch-
Tensors and Dynamic neural networks in Python with strong GPU acceleration