This project predicts the best features of wines that relate to wine quality. This MLOps pipeline uses a self-managed Digital Ocean Github action-runner, Data Version Control to manage data changes across pipeline stages, and Continuous Machine Learning (CML) to support continual integration metrics analysis of models across stages, experiments, and branches.
- On windows, WSL2, install using the WSL2 instructions, I've got the Ubuntu WSL2 environment installed.
- Python3.x and Pip3 are installed, I have a note in here to alias pip to pip3.
- Docker
- VS Code
Modelling a Kaggle dataset of red wine properties and quality ratings.
Original project https://github.com/andronovhopf/wine
Edit this to the end of ~/.bashrc, save.
alias pip="pip3"
source ~/.bashrc
pip install virtualenv
virtualenv --version
mkdir ~/.virtualenvs
virtualenv ~/.virtualenvs/PredictWindFeatures
source ~/.virtualenvs/PredictWindFeatures/bin/activate
python3 -m virtualenv .virtualenv
A dialog window will appear in the bottom right corner of VSCode saying the new virtual folder is detected, press "Yes" to use it. VS Code, install the extension, SonarLint, it will require JRE, let it download and install it. SonarLint will ask to restart VSCode, click "Restart Now".
Now you can run pip installs and they are only installed for your project.
Install from a requirements.txt document with:
pip install -r requirements.txt
You can also run "pip freeze" to output the requirements for your python project or to pipe into a requirements.txt to freeze dependency versions.
pip freeze