Welcome to the RL Fundamentals course! This repository contains all the materials, assignments, and examples you'll need to master Reinforcement Learning concepts.
- assignments/
- Bandits: bandits.md
- MDPs: mdps.md
- Dynamic Programming: dynamic_programming.md
- Monte Carlo: monte_carlo.md
- Temporal Difference: temporal_difference.md
- Planning: planning.md
- rl/: Contains the core reinforcement learning algorithms and environment implementations.
- exercises/: Additional exercises to reinforce learning.
- utils/: Utility scripts and helper functions.
Topic | K-Armed Bandits | Analytic solutions to MDPs | Dynamic Programming |
---|---|---|---|
Algorithms | Investigate epsilon-greedy and optimistic initial value solutions. | Expressing Bellman equations as simultaneous equations; using np.linalg to solve for state values |
Implement policy iteration and value iteration |
Environments | The "KAB Testbed" (Sutton & Barto) | Frozen Lake | Jack's Car Rental (Sutton & Barto) |
Topic | Monte Carlo | Temporal Difference | Planning |
---|---|---|---|
Algorithms | Implement first-visit MC prediction; MC control (exploring starts, on-policy, off-policy) | Implement Sarsa, Q-learning and Expected Sarsa | Implement Dyna-Q and Dyna-Q+ |
Environments | Blackjack | Cliff Walking | Planning Mazes (Sutton & Barto) |
-
Clone the Repository:
git clone https://github.com/<yourusername>/rl-fundamentals-assignments.git
-
Navigate to the Repository:
cd rl-fundamentals-assignments
-
Create a Virtual Environment:
Make sure you have Python installed. Then, create a virtual environment:
python -m venv venv
-
Activate the Virtual Environment:
-
On Windows:
venv\Scripts\activate
-
On macOS/Linux:
source venv/bin/activate
-
-
Install the Required Packages:
Once the virtual environment is activated, run:
pip install -r requirements.txt
-
Add the Project Directory to PYTHONPATH:
If you encounter an error like ModuleNotFoundError: No module named 'rl', it is likely that the rl folder is not part of your Python path. To resolve this, you can add the project directory to your Python path:
-
Temporary Solution:
Windows:
Open a terminal in the main directory of the course project (where the rl folder is located).
Run the following command:
set PYTHONPATH=%PYTHONPATH%;%CD%
Mac/Linux:
In a terminal, navigate to the project's main directory (where the rl folder is located).
Run the following command:
export PYTHONPATH="${PYTHONPATH}:$(pwd)"
-
Permanent Solution:
Windows: Add the directory path to your system environment variables in "System Properties" -> "Environment Variables".
Mac/Linux: Add the export
PYTHONPATH="${PYTHONPATH}:$(pwd)"
line to your shell configuration file (e.g., .bashrc, .zshrc). This will ensure that the path is set every time you open a terminal.
-
-
Explore Assignments:
Each assignment has its own markdown file within the
assignments/
directory. Navigate to the desired assignment folder and follow the guidelines to complete the tasks.
-
Books:
- The Bible! Sutton, R. S., & Barto, A. G. (2018). Reinforcement Learning: An Introduction (Second Edition).
- Most of the environments encountered in this course originate from this amazing and seminal book (e.g. Jack's Car Hire), and it has informed luminaries from Demis Hassabis and David Silver at DeepMind to Andrew Ng and Yann LeCun.
- If you are enjoying this course, I would encourage you to buy Reinforcement Learning: An Introduction as it is well worth the investment.
-
Documentation:
For any questions or issues, please contact Tom Walker.