/MDP_HMM_Solvers

Comparison of RL algorithms (Bandit, Q-Learning etc.) to similar algorithms that use inference. Psi-Auto as an algorithm that automatically tunes the inverse temperature.

Primary LanguagePythonOtherNOASSERTION

Stargazers

No one’s star this repository yet.