ryoju-ohata/marukoshi-deluxe-menu-texts

Analyze texts from pdf of Marukoshi's deluxe menu.

JavaScriptMIT

Marukoshi Deluxe Menu Texts

Get texts by date using image segmentation and OCR from PDF of Marukoshi's Deluxe Menu.

Converted example 2019-10

Converted example 2019-11

Installation

brew install imagemagick graphicsmagick ghostscript tesseract tesseract-lang
npm install

Conversion

Add "deluxe1.pdf" to "./resources" directory.
The following command will convert it.

npm start

"output.json" is created to "./resources" directory.

Raw

https://raw.githubusercontent.com/passionate-engineer/marukoshi-deluxe-menu-texts/master/json/201910.json

https://raw.githubusercontent.com/passionate-engineer/marukoshi-deluxe-menu-texts/master/json/201911.json