/pdf-to-text

PDF OCR using Pure Javascript by tesseract.js api

Primary LanguageHTML

PDF-TO-TEXT

PDF-to-Text is an OCR, Pure Javascript by tesseract.js api, mobile-ready that convert PDF text-image to text.

Tech

PDF-to-Text uses a number of open source projects to work properly:

  • [JavaScript] - awesome!
  • [HTML] - HTML enhanced for web apps!
  • [CSS] - Fence!
  • [Magic] - that''s nice!

Installation

PDF-to-Text requires Node.js v4+ or any server enviroment to run.

Start the server.

$ npm install http-server -g
$ cd pdf-to-text-master
$ http-server