mewmiyu/MDP_HMM_Solvers
Comparison of RL algorithms (Bandit, Q-Learning etc.) to similar algorithms that use inference. Psi-Auto as an algorithm that automatically tunes the inverse temperature.
PythonNOASSERTION
Stargazers
No one’s star this repository yet.