/pdf2html-http-parser

This tool allows to parse pdf into html files via http requests to gevent server with WSGI.

Primary LanguagePythonOtherNOASSERTION

PDF2HTML Service

Transforms pdf into html files via http request handled by gevent server.

Setup

  1. Setup virtual env
python3 -m venv pdf2html
  1. Setup dependencies
pip install -r requirements.txt

Run it

  1. Run server
python pdf_2_html_service
  1. Build docker image
docker build -t pdf2html .
  1. Run docker container
docker run -itd -p 9088:9088 pdf2html

Usage

  1. Test with curl
curl --form file='@/path/to/your/pdf' http://0.0.0.0:9088/parse