Get texts by date using image segmentation and OCR from PDF of Marukoshi's Deluxe Menu.
brew install imagemagick graphicsmagick ghostscript tesseract tesseract-lang
npm install
- Add "deluxe1.pdf" to "./resources" directory.
- The following command will convert it.
npm start
- "output.json" is created to "./resources" directory.