tianxusky

I am Tian Xu. I am currently a CS Ph.D. student at Nanjing University in China. Research interest: reinforcement learning.

Nanjing UniversityNanjing

Pinned Repositories

123
Language:Python1 0 00
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 0 00
Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
Language:Python10 1 01
CS234
homework for CS234 2017
Language:Python0 0 00
CS234-1
My Solution to Assignments of CS234
Language:Python0 0 00
dice_rl
Language:Python00
google-research
Google AI Research
Language:Jupyter Notebook0 0 00
hello-world
0 0 00
Interview
Interview = 简历指南 + LeetCode + Kaggle
Language:Jupyter Notebook0 0 00
Inverse-Reinforcement-Learning
Implementations of selected inverse reinforcement learning algorithms.
Language:Python00

tianxusky/Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
Language:Python10 1 01
tianxusky/123
Language:Python1 0 00
tianxusky/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 0 00
tianxusky/CS234
homework for CS234 2017
Language:Python0 0 00
tianxusky/CS234-1
My Solution to Assignments of CS234
Language:Python0 0 00
tianxusky/dice_rl
Language:Python00
tianxusky/google-research
Google AI Research
Language:Jupyter Notebook0 0 00
tianxusky/hello-world
0 0 00
tianxusky/Interview
Interview = 简历指南 + LeetCode + Kaggle
Language:Jupyter Notebook0 0 00
tianxusky/Inverse-Reinforcement-Learning
Implementations of selected inverse reinforcement learning algorithms.
Language:Python00
tianxusky/mazelab
A customizable framework to create maze and gridworld environments
Language:Python0 0
tianxusky/offline_IL
tianxusky/policy_optimization
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
Language:Python
tianxusky/ppo-dice
We propose a new way to make policy optimization more stable.
Language:Python0 0
tianxusky/probabilitydistributiontoolbox
Folklore facts on probability distribution learning, testing, and whatever-ing
Language:TeX0 0
tianxusky/tabular-ail
Language:Python
tianxusky/tensorflow_tutorials
From the basics to slightly more interesting applications of Tensorflow
Language:Jupyter Notebook0 0