mew-two-github/CS6700-Project
Implementation of REINFORCE for open ai env acrobot, epsilon greedy Q-Learning for open ai env taxi & TD(0) for custom gameshow env KBC.
Python
Implementation of REINFORCE for open ai env acrobot, epsilon greedy Q-Learning for open ai env taxi & TD(0) for custom gameshow env KBC.
Python