pypdf

There are 111 repositories under pypdf topic.

  • xhtml2pdf

    xhtml2pdf/xhtml2pdf

    A library for converting HTML into PDFs using ReportLab

    Language:Python2.3k72445656
  • genieincodebottle/parsemypdf

    Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.

    Language:Python1343128
  • hoehermann/pypdf_strreplace

    Search and replace text in PDF files with PyPDF.

    Language:Python46294
  • shine-jayakumar/Extract-Data-From-PDF-In-Python

    Batch-convert pdf to text, extract data from pdf in python

    Language:Python301012
  • lukefire5156/PPTs_TO_PDFs_AND_Merger

    A script to convert MS Office PPT/PPTX files to PDF files and then merge all the PDF files to a single PDF file.

    Language:Python11105
  • SaurabhSSB/PDFMergerCLI

    A lightweight Python CLI tool to merge multiple PDF files into one. Built using the pypdf library, this script prompts users for input and merges selected PDFs into a single output file with a configurable name.

    Language:Python10
  • xreedev/Research-Asist-Tool

    This project aims to simplify and summarize scientific data , convert it to a audio format as a podcast , and create a power point presentation from the paper. This helps researchers, academics and students altogether.

    Language:JavaScript10100
  • pyDF-Bot

    nuhmanpk/pyDF-Bot

    Pydf - Pyrogram Document File Bot, a modular Telegram Bot which provides Pdf Tools Works using Pypdf2

    Language:Python90018
  • nanxstats/pdf-word-extraction

    Extract meaningful words from a collection of PDF documents and count their frequencies

    Language:Python7201
  • peinan/pdfchat

    Gradio demo of LLM chatbot using RAGs

    Language:Python61111
  • Tobi208/pypdf-cli

    A Python-based CLI that allows for comfortable every-day PDF manipulation with pypdf.

    Language:Python6141
  • ClubCedille/rapport_eirik

    Remplissage automatique des demandes de remboursement pour les clubs étudiants de l'ÉTS.

    Language:Python35121
  • farvath/Resume-Parser-and-Analysis

    This application is built for employers looking for candidates against a particular job description .

    Language:Python3202
  • Ranjan2104/Create-Audio-Book-from-pdf

    A Pure-Python library built as a PDF toolkit. It is capable of: extracting document information (title, author, …) splitting documents page by page merging documents page by page cropping pages merging multiple pages into a single page encrypting and decrypting PDF files and more! By being Pure-Python, it should run on any Python platform without any dependencies on external libraries. It can also work entirely on StringIO objects rather than file streams, allowing for PDF manipulation in memory. It is therefore a useful tool for websites that manage or manipulate PDFs. Project description pyttsx3 is a text-to-speech conversion library in Python. Unlike alternative libraries, it works offline, and is compatible with both Python 2 and 3.

    Language:Python3101
  • theshobhitsingh/PDFVoice

    A Python script project that converts PDF text into an MP3 audio file using text-to-speech technology.

    Language:Python310
  • aeksco/jupyter-tabula

    Docker container image built with Jupyter Notebook and Tabula for PDF scraping

    Language:Jupyter Notebook2201
  • MathScore

    AsrtoMichi/MathScore

    Software for counting points in team mathematics competitions

    Language:Python2110
  • AVIPAGHADAR1729/pdf-merge

    Project For PDF-Merge API Build with Flask and PyPDF

    Language:Python2100
  • AWeirdDev/crapdf

    🦀 Extract text from PDF files.

    Language:Python210
  • Bushra-Butt-17/BudgetBuddy-Finance-Chatbot

    Budget Buddy is a finance chatbot built using Chainlit and the LLaMA language model. It analyzes PDF documents, such as bank statements and budget reports, to provide personalized financial advice and insights. The chatbot is integrated with Hugging Face for model management, offering an interactive way to manage personal finances.

    Language:Python2110
  • leomarkcastro/Puzzle-Maker

    A puzzle generator / game and pdf maker. Pretty advance stuff for me. Uses Sciter for UI, pygame for puzzle game and puzzle images.

    Language:Python2101
  • nlutala/pdf-merger

    A script that combines multiple pdfs together to make 1 merged pdf

    Language:Python2100
  • orange2moon/nautilus-scripts

    Graphics automation scripts for the Nautilus file browser (GNOME's file browser).

    Language:Python2100
  • sankethsj/newspaper-bot

    E-Paper Bot is a tool designed to automate the process of downloading the Kannada Prabha newspaper from the official source.

    Language:Python20
  • akhnasj/Automotive-Chatbot

    An automotive customer-support chatbot using LangChain, Pinecone, and Microsoft's Phi-2 with RAG for fast, resource-efficient customer support.

    Language:Jupyter Notebook1
  • anthonymalumbe/llm_products

    This end-to-end assistant leverages the full architecture and implementation details of FastAPI, Gemini, Langchain, ChromaDB, and PyPDF.

    Language:Python1
  • armanjscript/Fusion-RAG

    A powerful web-based application designed to answer questions based on the content of uploaded PDF documents. This project leverages the **Fusion-in-Decoder (FiD)** approach for **Retrieval-Augmented Generation (RAG)**, combining semantic similarity, technical term relevance, and recency to deliver accurate and contextually relevant responses

    Language:Python1
  • codewithdark-git/PDF_web_App

    This Flask web application simplifies PDF tasks. Merge multiple PDFs, convert text to PDF, and split PDFs into downloadable ZIP files. User-friendly, efficient, and built with Python Flask and PyPDF2. Enhance document management and formatting with ease.

    Language:HTML1100
  • d-kavinraja/AI-Powered-PDF-Context-Retrieval-Chatbot-RAG

    The backend that ingests any PDF, indexes . it for semantic search, and answers queries via a RAG pipeline using FastAPI.

    Language:Python1
  • IiroP/pdf-booklet-generator

    Convert PDF to booklet for printing

    Language:Python1100
  • jordiba90/PyBcn_-_Meetup_2024.01.24_-_PyPDF

    Mastering PDF Form-Filling with PyPDF

    Language:Python1100
  • martinheinrich2/PDF-Tool

    Simple PDF-Reader and tools to rotate, merge, split, remove, extract pages.

    Language:Python1100
  • muhammadadilnaeem/Exploding-Population-Myths-1995-using-Google-Gemma-Model

    Welcome to the Exploding Population Myths 1995 repository! This project leverages the Google Gemma model to analyze and debunk common population myths from 1995 book, providing valuable insights into historical population trends.

    Language:Python1100
  • paolpal/PDFWizard

    Toolkit for pdf editing.

    Language:Python1100
  • patel-anshuman/medimate

    MediMate is a friendly health assistant chatbot designed to provide comprehensive health related support. From scheduling doctor appointments, extracting prescription details from PDFs, and offering emergency assistance, to dispensing health tips and home remedies, MediMate is your reliable and friendly companion for all your health-related needs.

    Language:Python1160
  • shadwoods2942/pdf-merger

    A Python utility for merging multiple PDFs and images into a single PDF file. This tool maintains aspect ratios, centers content on custom-sized pages (default A4), and supports recursive directory processing. Perfect for organizing documents and creating cohesive PDF compilations.

    Language:Python1