- Python 3
- Scipy
- OpenCV >= 4
- Tesseract (see below)
- Install
tesseract
4.0 - Download
jpn_vert.traineddata
here - Copy
jpn_vert.traineddata
in/usr/share/tessdata
- Check with
tesseract --list-langs
thatjpn_vert
correctly appears
sudo apt-get install -y tesseract-ocr tesseract-ocr-jpn-vert
python3 -m venv env
source env/bin/activate
# Upgrade pip per opencv-python FAQ
python -m pip install --upgrade pip
pip install -r requirements.txt
./main.py examples/sample_page.jpg
Followed by:
./main.py examples/sample_page.jpg --ocr