A PDF toolkit that runs on Node.js or Web Worker
- Import/export annotations
- Extract general PDF file information
- Extract full-text
- Extract PDF file text structure that can be fed into recognizer-server
Build
Clone the repository:
git clone https://github.com/zotero/pdf-worker --recursive
Run npm run build
and look the examples.