Snake-RL

A snake that knows how to win

How to use

For training:

You can modify the model architecture in model.py and the model parameters in agent.py
Specify the trained model name in agent.py
Run

python agent.py train

For evaluating

python agent.py eval

[1,0,0] -> straight [0,1,0] -> turn right [0,0,1] -> turn left

This is danger is close

Where is the snake facing

Where is the food wrt to the snake

State (11) --> Hidden (?) --> Action (3)

Init with some Q values
Predict Action (or Random for exploration)
Perform Action
Measure Reward
Update Q value and train with following params:
1. NewQ(S,a) = Q(S,a) + alpha*[R + gamma*maxQ'(S',a')-Q(S,a)]
2. loss = (NewQ(S,a)-Q(S,a))^2