Mining massive datasets

Homework and course project for Mining massive datasets subject in Ukrainian Catholic University.

Prerequisites

First launch

Create virtual environment.

python3 -m venv .venv 

Activate virtual environment.

source .venv/bin/activate

Install dependencies

pip install requirements.txt

Create jupyter kernel, credit

ipython kernel install --user --name=statistics-ucu

Other launches

source .venv/bin/activate

HOWTO

Save Colab notebook to github

  1. (Once) Tools -> Settings -> GitHub -> Access private repositories and organizations
  2. File -> Save a copy in GitHub. Choose repo, branch, file path, commit message. ATTENTION: be careful with paths. If the path will be different from previously saved, it will create new file.