/Rock-Paper-Scissors

Future is Independent of the past given the present.

Primary LanguagePythonMIT LicenseMIT

Rock Paper Scissors

This is an AI simulation to play RPC. It was inspired by a Kaggle competition, to create an RCP playing agent capable of beating human consisitenly. The model is implemented using Q-learning, a value-based learning algorithm, that updates the value function based on an Bellman equation. The values of the Q table are filled using the MDP's (Markov Decision Process) property.

References