teaching an agent to play the Lunar Lander game from OpenAI Gym with REINFORCE algorithm
Primary LanguageJupyter Notebook