This project is part of a two-day practical workshop on Digital Image Processing, focusing on OCR (Optical Character Recognition) with an emphasis on MLOps practices using DVC (Data Version Control) and Guild AI. The workshop covers setting up an OCR pipeline, versioning datasets, running and tracking experiments, hyperparameter tuning, and analyzing results.
Before starting, ensure you have the following installed:
- Python 3.11
- pip
- Git
Install required Python packages:
pip install -r requirements
Verify installations:
dvc doctor