louieworth's Stars
changgyhub/leetcode_101
LeetCode 101:力扣刷题指南
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
tuna/thuthesis
LaTeX Thesis Template for Tsinghua University
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
acm-clan/algorithm-stone
ACM/LeetCode算法竞赛路线图,最全的算法学习地图!
FenTechSolutions/CausalDiscoveryToolbox
Package for causal inference in graphs and in the pairwise settings. Tools for graph structure recovery and dependencies are included.
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
YixinChen-AI/CVAE-GAN-zoos-PyTorch-Beginner
For beginner, this will be the best start for VAEs, GANs, and CVAE-GAN. This contains AE, DAE, VAE, GAN, CGAN, DCGAN, WGAN, WGAN-GP, VAE-GAN, CVAE-GAN. All use PyTorch.
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
chauncygu/Safe-Reinforcement-Learning-Baselines
The repository is for safe reinforcement learning baselines.
fulifeng/Causal_Reading_Group
We will keep updating the paper list about machine learning + causal theory. We also internally discuss related papers between NExT++ (NUS) and LDS (USTC) by week.
socialfoundations/whynot
A Python sandbox for decision making in dynamics
panxl6/cc150
《程序员面试金典》(cc150)
2019ChenGong/RL-Paper-notes
gxywy/rl-plotter
:sparkles: A plotter for reinforcement learning (RL)
AIR-DISCOVER/VIBUS
OPEN-AIR-SUN/SISC
Semi-supervised Implicit Scene Completion from Sparse LiDAR
deligentfool/dqn_zoo
The implement of all kinds of dqn reinforcement learning with Pytorch
volotat/ARC-Game
The Abstraction and Reasoning Corpus made into a web game
OPEN-AIR-SUN/Cerberus
d3sm0/gym_pomdp
Gym-like extensions for POMDP
ryanxhr/POR
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
CausalRL/DRL
Deconfounding Reinforcement Learning in Observational Settings
Facebear-ljx/DOGE
The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)
rik-helwegen/CEVAE_pytorch
ryanxhr/DWBC
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
Facebear-ljx/SBAC
Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)
jakegrigsby/cc-afbc
Advantage-Filtered Behavioral Cloning for Offline Continuous Control