aijunbai
Artificial Intelligence, Decision-Theoretical Planning, Reinforcement Learning, Deep Learning
UC BerkeleySan Francisco Bay Area
Pinned Repositories
bandit
Algorithms for multi-armed bandit (MAB) problems
hplanning
Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation
keepaway
Concurrent Hierarchical Reinforcement Learning for RoboCup Keepaway
markov-game
Stochastic Markov Games
pfs
A Particle Filtering over Sets Approach to Multi-Object Tracking
quadrotor_openrave
OpenRAVE based Quadrotor/Quadcopter Simulator with Task/Motion Planning
reversi
A C/S framework for reversi game with a well developed AI player
taxi
Hierarchical Online Planning and Reinforcement Learning on Taxi
thompson-sampling
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
uct
UCT with different parallelization implementations
aijunbai's Repositories
aijunbai/taxi
Hierarchical Online Planning and Reinforcement Learning on Taxi
aijunbai/pfs
A Particle Filtering over Sets Approach to Multi-Object Tracking
aijunbai/thompson-sampling
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
aijunbai/markov-game
Stochastic Markov Games
aijunbai/hplanning
Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation
aijunbai/keepaway
Concurrent Hierarchical Reinforcement Learning for RoboCup Keepaway
aijunbai/uct
UCT with different parallelization implementations
aijunbai/quadrotor_openrave
OpenRAVE based Quadrotor/Quadcopter Simulator with Task/Motion Planning
aijunbai/reversi
A C/S framework for reversi game with a well developed AI player
aijunbai/bandit
Algorithms for multi-armed bandit (MAB) problems
aijunbai/rcg_player
RCG format log file player
aijunbai/aijunbai.github.io
Homepage
aijunbai/bert
TensorFlow code and pre-trained models for BERT
aijunbai/mono-vo
An OpenCV based implementation of Monocular Visual Odometry
aijunbai/notebooks
IPython Notebooks
aijunbai/pole
Reinforcement Learning algorithms for an inverted pendulum with a cart
aijunbai/programmable-reinforcement-learning
Reinforcement learning algorithms constrained by a partial program
aijunbai/quadrotor_moveit
Quadrotor/Quadcopter Motion Planning using MoveIt!
aijunbai/skipoominijool
A Compiler Front End for a Subset Language of Java