Pinned Repositories
A-Variant-of-XGbooA-Variant-of-XGboost-with-Dynamic-Factorsst-with-Dynamic-Factors
The project paper of AI Class.
Adversarial-Policy-Imitation-with-LFA
Codes for the ICML 2022 accepted paper: Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation
An-Overview-Analysis-of-LASSO
Code-Reproduction-For-Paper
Code reproduction
Implementation-from-scratch-of-CNN-and-DQN
In this project, I build neural network nearly from scratch for two different type of Atari games(One is 1d the other is 2d)
MEX
PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
RAFA_code
RL-for-Markov-Exchange-Economy
Codes for the ICML 2022 accepted paper: *Welfare Maximization in Competitive Equilibrium: Reinforcement Learning for Markov Exchange Economy*.
RL_KIT
YSLIU627's Repositories
YSLIU627/RL-for-Markov-Exchange-Economy
Codes for the ICML 2022 accepted paper: *Welfare Maximization in Competitive Equilibrium: Reinforcement Learning for Markov Exchange Economy*.
YSLIU627/Adversarial-Policy-Imitation-with-LFA
Codes for the ICML 2022 accepted paper: Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation
YSLIU627/RL_KIT
YSLIU627/Code-Reproduction-For-Paper
Code reproduction
YSLIU627/Implementation-from-scratch-of-CNN-and-DQN
In this project, I build neural network nearly from scratch for two different type of Atari games(One is 1d the other is 2d)
YSLIU627/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
YSLIU627/A-Variant-of-XGbooA-Variant-of-XGboost-with-Dynamic-Factorsst-with-Dynamic-Factors
The project paper of AI Class.
YSLIU627/An-Overview-Analysis-of-LASSO
YSLIU627/MEX
YSLIU627/RAFA_code
YSLIU627/code-reproduction
YSLIU627/d4rl
A benchmark for offline reinforcement learning.
YSLIU627/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
YSLIU627/ESPD
[arXiv] Evolutionary Stochastic Policy Distillation
YSLIU627/FinRL
FinRL: The first open-source project for financial reinforcement learning. Please star. 🔥
YSLIU627/gcsl
Code for "Learning to Reach Goals via Iterated Supervised Learning"
YSLIU627/impact-driven-exploration
impact-driven-exploration
YSLIU627/L0regularzation
YSLIU627/lm-evaluation-harness
A framework for few-shot evaluation of language models.
YSLIU627/ML-project-2020
The finally project about Reinforcement Learning
YSLIU627/open-instruct
YSLIU627/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
YSLIU627/Rebuttal-for-RAFA
YSLIU627/Rebuttal-for-RAFA_theory
YSLIU627/rlkit
Collection of reinforcement learning algorithms
YSLIU627/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
YSLIU627/Time-Series-Project