The homework for Cinnamon AI Bootcamp Week 1. The project is a OCR Script that can read all text lines from documents
- Convert [
.png
,.tiff
,.pdf
,.docx
,.heic
] to images. - Extract text from images using OCR.
- (Optional) Upload result to cloud system.
- Linux, MacOS, Windows (WSL):
curl -sSL https://install.python-poetry.org | python3 -
- Windows: create
venv
, thenpip install poetry
- Use
poetry install
-
For MacOS:
brew install poppler
-
For Ubuntu/Linux:
sudo apt update
sudo apt install poppler-utils
-
For Windows: https://poppler.freedesktop.org
- To run the script:
poetry run python your_script.py
- To run the tests:
poetry run pytest
- Add new dependency / install new library:
poetry add <package-name>
- Update dependencies and
poetry.lock
file:poetry update <package-name>
- Remove dependencies:
poetry remove <package-name>
- Activate the environment:
poetry shell
- Deactivate the environement:
exit
- To learn more, go to this link