
Lesson 2 tutorial: Versioning Data and Model for the ML REPA School course: Machine Learning experiments reproducibility and engineering with DVC

Primary LanguageJupyter Notebook

Lesson 2 tutorial: Versioning Data and Model

ML REPA School course: Machine Learning experiments reproducibility and engineering with DVC

1. Clone this repository

git clone https://github.com/mlrepa/dvc-2-data-versioning.git
cd dvc-2-data-versioning

2. Create and activate virtual environment

Create virtual environment named dvc-venv (you may use other name)

python3 -m venv dvc-venv
source dvc-venv/bin/activate

3. Install python libraries

pip install -r requirements.txt

4. Add Virtual Environment to Jupyter Notebook

python -m ipykernel install --user --name=dvc-venv

5. Configure ToC for jupyter notebook (optional)

sudo jupyter contrib nbextension install
jupyter nbextension enable toc2/main

6. Run and follow Jupyter Notebook dvc-2-data-versioning.ipynb for instructions:

jupyter notebook