/pdf2docx

Primary LanguagePython

PDF to DOCX Converter Web App 🚀

Hey there! This is a super cool web app that converts PDF files to DOCX files. It's built using Python, Flask, and some awesome libraries like pdfplumber and python-docx.

Features 🌟

  • Extracts text and tables from PDFs
  • Handles OCR for scanned PDFs using Tesseract
  • Keeps the formatting and structure of the original PDF
  • Easy-to-use web interface

How to run it locally 💻

  1. Clone this repo:
git clone https://github.com/Awis13/pdf2docx.git
  1. Install the required packages:
pip install -r requirements.txt
  1. Run the Flask app:
python app.py
  1. Open your browser and go to http://localhost:5000/.

Deploying to Heroku 🌐

  1. Create a new Heroku app.
  2. Connect your GitHub repo to the Heroku app.
  3. Deploy the main branch to Heroku.
  4. Access your app using the provided Heroku URL.