Q_learning_template: A Python repository from tjards

Q-Learning template

A simple template for implementing Q-Learning using the classical update equation:

Q(state, action) = Q(state, action) + learn_rate * (reward + discount_factor * np.max(Q(state', :)) — Q(state, action))