ML Project DSTI

Installing environment

Open conda terminal
Change your directory to location of project : cd ...
Use this command : conda env create -f environment.yml
Now you can activate it : conda activate dsti_project_ml or use it in IDE (restart computer to see it as kernel in VSCode)

You can use requirements.txt to create env (with venv or poetry)

The data folder contains:

OpenLibrary data is retrieved through the following notebooks:

This part regroups:

and is achieved through data_preprocessing.ipynb.

The preprocessed data is then saved into books_preprocessed.csv inside the data_preprocessed folder.