pypdf2
There are 444 repositories under pypdf2 topic.
py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
pikepdf/pikepdf
A Python library for reading and writing PDF, powered by QPDF
py-pdf/benchmarks
Benchmarking PDF libraries
ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
MicheleCotrufo/pdf2doi
A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.
chazeon/PDF-Watermark-Remover
Remove PDF watermarks from academic papers using pypdf
doolieSoft/PdfPasswordRemover
Tool made to remove password in pdf files
gaborvecsei/Pdf-Split-Merge
simple pdf file split and merge tool
santiago9631/PDF-summarizer-chatbot-using-LLaMa2
A user-friendly application that allows users to upload PDF documents and receive concise summaries generated using advanced Large Language Models (LLMs).
py-pdf/PyPDF-Builder
A cross-platform utility to join, split, stamp, and rotate PDFs written in Python. Yes, Python!
TheWatcherMultiversal/pdfgui_tools
pdfgui_tools is a user interface tool developed in Qt and Python that integrates with poppler-utils and PyPDF2 for PDF document management. It's a simple and user-friendly tool that includes various utilities.
jiandandaoxingfu/pdfdo
pdf文件处理工具, 包含: pdf剪切, pdf旋转, pdf合并, pdf拆分, pdf添加页码, pdf转图片, word转pdf等功能
amitgupta4407/All_About_PDF
This is a complete website in which you can chat with pdf, extract meta data, text, links, image, and lot more . Check my blog for more details: https://medium.com/@amit.2503719/allaboutpdf-tool-for-data-extraction-and-talking-to-pdf-using-chatpdf-feature-f2daea15a59c
shine-jayakumar/Extract-Data-From-PDF-In-Python
Batch-convert pdf to text, extract data from pdf in python
Deba951/Resume-ATS-Tracking-LLM-Project
Smart ATS evaluates resumes against job descriptions, providing match percentage, missing keywords, and improvement suggestions.
sfneal/pdfconduit
Prepare documents for distribution
crispyzingy/PDFExcelWordParser
:rocket:Parse PDFs, Word and Excel documents. Read, Create, Merge/Combine, Extract data from office documents.
nikhilkumarsingh/PDF_AUDIO_READER
A simple and offline PDF audio reader
viveksb007/camscanner_watermark_remover
This Flask App would remove CamScanner watermark from scanned pdfs.
asepmaulanaismail/pdf-to-txt-python
Simple pdf to text with python using PDFtk and PyPDF2
PranjalGupta2199/OFFLINE-ERP
A desktop application which helps students to choose Disciplinary and Open Electives wisely.
Parathantl/tesseract_gui
Simple Python GUI Tool for Tesseract4
gugli28/LocalEPaper
get local e paper ( Dainik Jagron and Hindustan )
jayeshbhandarkar/AgriGenius
AgriGenius: AI-Powered Agriculture Chatbot is a Python web application designed to empower farmers with information accessibility. AgriGenius leverages a Retrieval-Augmented Generation model to address farmer's agricultural queries with precise answers.
Azazel0203/MCQ_GENERATOR
The "MCQ Generator with Streamlit" web app utilizes OpenAI's language models to create multiple-choice questions (MCQs) from uploaded PDF or text files. Users can customize question parameters like quantity, subject, and tone. The app offers real-time complexity feedback and presents MCQs in an easy-to-read tabular format.
IAmMaulik/Audiobook_Maker
You can convert from a PDF to MP3 file using this python code
lukefire5156/PPTs_TO_PDFs_AND_Merger
A script to convert MS Office PPT/PPTX files to PDF files and then merge all the PDF files to a single PDF file.
ajinux/KCT-Academic-Calendar-Converter
A python script to convert the KCT's(Kumaraguru college of technology) academic calendar pdf file into a csv file and will sync the events with google calendar.
nigelreign/pdf-generator
A script that generates a pdf file. You can create a new pdf file from an html file or you can write on top of an already existing pdf
nuhmanpk/pyDF-Bot
Pydf - Pyrogram Document File Bot, a modular Telegram Bot which provides Pdf Tools Works using Pypdf2
r96ahularya/PDF-Player
Simple python utilities to play around with PDF Files
triposat/Python_Beginner_Level_Projects
This Repository consists of some Python Beginner Level Projects.
ZeusSama0001/RAG-chatbot
RAG chatbot using Llama 2, chainlit and Faiss
Karthik-02/plagiarism-detection
Provides a comprehensive solution for detecting plagiarism and finding similarities between text documents
kezb90/PDF_To_Word
A Python-based tool that converts PDF files into editable Word documents, preserving text, images, and layout. Uses PyPDF2, PyMuPDF (fitz), python-docx, and Pillow to accurately transfer content from PDF to .docx. Ideal for transforming complex PDFs into Word format for easy editing.
Sweatnessstrong/pdf-to-word-converter
This Python script converts a PDF file to Word format using OCR (Optical Character Recognition). It extracts text from each page of the PDF, converts the pages to images, performs OCR on the images, and saves the extracted text to text files.