Pinned Repositories
bandit-evaluator
Evaluate an index policy for a multi-armed bandit from a given initial state.
graphs
Implementation of some graph algorithms
kbsf
Implementation of the kernel-based reinforcement learning algorithm due to Barreto, Precup, and Pineau (2016).
leak-detection
mdp
Implementation of algorithms for MDPs and controlled discrete-time queues.
mdp-lp
LP formulations of MDPs in Pyomo
porteus
Randomly generate Markov chains using a technique due to Porteus (1981).
procon
Python implementation of the Markov decision process generator PROCON.
jefferh's Repositories
jefferh/kbsf
Implementation of the kernel-based reinforcement learning algorithm due to Barreto, Precup, and Pineau (2016).
jefferh/bandit-evaluator
Evaluate an index policy for a multi-armed bandit from a given initial state.
jefferh/graphs
Implementation of some graph algorithms
jefferh/leak-detection
jefferh/mdp
Implementation of algorithms for MDPs and controlled discrete-time queues.
jefferh/mdp-lp
LP formulations of MDPs in Pyomo
jefferh/porteus
Randomly generate Markov chains using a technique due to Porteus (1981).
jefferh/procon
Python implementation of the Markov decision process generator PROCON.