tianxusky
I am Tian Xu. I am currently a CS Ph.D. student at Nanjing University in China. Research interest: reinforcement learning.
Nanjing UniversityNanjing
Pinned Repositories
123
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
CS234
homework for CS234 2017
CS234-1
My Solution to Assignments of CS234
dice_rl
google-research
Google AI Research
hello-world
Interview
Interview = 简历指南 + LeetCode + Kaggle
Inverse-Reinforcement-Learning
Implementations of selected inverse reinforcement learning algorithms.
tianxusky's Repositories
tianxusky/Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
tianxusky/123
tianxusky/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
tianxusky/CS234
homework for CS234 2017
tianxusky/CS234-1
My Solution to Assignments of CS234
tianxusky/dice_rl
tianxusky/google-research
Google AI Research
tianxusky/hello-world
tianxusky/Interview
Interview = 简历指南 + LeetCode + Kaggle
tianxusky/Inverse-Reinforcement-Learning
Implementations of selected inverse reinforcement learning algorithms.
tianxusky/mazelab
A customizable framework to create maze and gridworld environments
tianxusky/offline_IL
tianxusky/policy_optimization
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
tianxusky/ppo-dice
We propose a new way to make policy optimization more stable.
tianxusky/probabilitydistributiontoolbox
Folklore facts on probability distribution learning, testing, and whatever-ing
tianxusky/tabular-ail
tianxusky/tensorflow_tutorials
From the basics to slightly more interesting applications of Tensorflow