louieworth

stay hungry, stay healthy.

Tsinghua UniversityShenzhen, China

louieworth's Stars

changgyhub/leetcode_101
LeetCode 101：力扣刷题指南
8.8k 147 901.2k
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Language:Python6.8k 43 881.1k
tuna/thuthesis
LaTeX Thesis Template for Tsinghua University
Language:TeX4.7k 88 6731.1k
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python3.8k 52 263855
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Language:Python2.4k 42 656326
acm-clan/algorithm-stone
ACM/LeetCode算法竞赛路线图，最全的算法学习地图！
Language:C++1.9k 25 7588
FenTechSolutions/CausalDiscoveryToolbox
Package for causal inference in graphs and in the pairwise settings. Tools for graph structure recovery and dependencies are included.
Language:Python1.1k 37 143200
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python1.1k 17 28136
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python963 40 105132
YixinChen-AI/CVAE-GAN-zoos-PyTorch-Beginner
For beginner, this will be the best start for VAEs, GANs, and CVAE-GAN. This contains AE, DAE, VAE, GAN, CGAN, DCGAN, WGAN, WGAN-GP, VAE-GAN, CVAE-GAN. All use PyTorch.
Language:Python726 2 12110
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language:Jupyter Notebook649 13 870
chauncygu/Safe-Reinforcement-Learning-Baselines
The repository is for safe reinforcement learning baselines.
Language:Jupyter Notebook545 13 082
fulifeng/Causal_Reading_Group
We will keep updating the paper list about machine learning + causal theory. We also internally discuss related papers between NExT++ (NUS) and LDS (USTC) by week.
508 35 478
socialfoundations/whynot
A Python sandbox for decision making in dynamics
Language:Python418 44 1043
panxl6/cc150
《程序员面试金典》(cc150)
Language:Jupyter Notebook417 11 086
2019ChenGong/RL-Paper-notes
298 4 029
gxywy/rl-plotter
:sparkles: A plotter for reinforcement learning (RL)
Language:Python214 1 430
AIR-DISCOVER/VIBUS
Language:Python155 1 03
OPEN-AIR-SUN/SISC
Semi-supervised Implicit Scene Completion from Sparse LiDAR
Language:Python118 3 411
deligentfool/dqn_zoo
The implement of all kinds of dqn reinforcement learning with Pytorch
Language:Python92 2 122
volotat/ARC-Game
The Abstraction and Reasoning Corpus made into a web game
Language:JavaScript86 1 86
OPEN-AIR-SUN/Cerberus
Language:Python64 3 17
d3sm0/gym_pomdp
Gym-like extensions for POMDP
Language:Python56 5 315
ryanxhr/POR
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
Language:Python56 3 27
CausalRL/DRL
Deconfounding Reinforcement Learning in Observational Settings
Language:Python48 2 311
Facebear-ljx/DOGE
The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)
Language:Python43 1 02
rik-helwegen/CEVAE_pytorch
Language:Python40 1 521
ryanxhr/DWBC
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
Language:Python34 1 12
Facebear-ljx/SBAC
Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)
Language:Python11 1 03
jakegrigsby/cc-afbc
Advantage-Filtered Behavioral Cloning for Offline Continuous Control
Language:Python3 2 01

louieworth

louieworth's Stars

changgyhub/leetcode_101

AntixK/PyTorch-VAE

tuna/thuthesis

AI4Finance-Foundation/ElegantRL

pytorch/rl

acm-clan/algorithm-stone

FenTechSolutions/CausalDiscoveryToolbox

tinkoff-ai/CORL

PKU-Alignment/omnisafe

YixinChen-AI/CVAE-GAN-zoos-PyTorch-Beginner

ikostrikov/jaxrl

chauncygu/Safe-Reinforcement-Learning-Baselines

fulifeng/Causal_Reading_Group

socialfoundations/whynot

panxl6/cc150

2019ChenGong/RL-Paper-notes

gxywy/rl-plotter

AIR-DISCOVER/VIBUS

OPEN-AIR-SUN/SISC

deligentfool/dqn_zoo

volotat/ARC-Game

OPEN-AIR-SUN/Cerberus

d3sm0/gym_pomdp

ryanxhr/POR

CausalRL/DRL

Facebear-ljx/DOGE

rik-helwegen/CEVAE_pytorch

ryanxhr/DWBC

Facebear-ljx/SBAC

jakegrigsby/cc-afbc