/Q_learning_template

(Ready). A template for implementing Q-Learning.

Primary LanguagePython

Q-Learning template

A simple template for implementing Q-Learning using the classical update equation:

Q(state, action) = Q(state, action) + learn_rate * (reward + discount_factor * np.max(Q(state', :)) — Q(state, action))