Sparse Reward Grid: An AI(/RL) puzzle