pdf2text
There are 26 repositories under pdf2text topic.
modesty/pdf2json
converts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.
seinecle/nocodefunctions-web-app
The code base of the front-end of nocodefunctions.com
yakovypg/Ypdf
We present Ypdf, a PDF document processing application that combines the best features of existing solutions and provides the most popular and requested functionality for free to its users.
TheLime1/CheatoMate
A collection of scripts to "help" you with your programming exams and assignments.
chiraag-kakar/PyAutomation
Simple and Useful Automation Tools built with the help of modules available with Python published at PyPI.
andrealenzi11/py-poppleract
Python library and Web service based on Poppler Pdftotext utility and Tesseract OCR for extracting text from PDF documents
worldbank/wb-nlp-tools
Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP analysis and modeling.
StephanyBatista/ExtractOcrApi
A API in .Net Core to extract documents OCR with many libs linux
AzozzALFiras/Pdf-OCR
A simple, free tool for extracting text from scanned PDFs and images using OCR, and converting images to PDFs. It processes files locally in the browser, ensuring privacy and security while enabling users to effortlessly convert documents and images into editable text or PDF format.
TanishqChamoli/Newspaper_Mining
Newspaper mining and the analysis of the results using python. Cleaning the text using OCR.
imesut/PdfReg
PdfReg is a web tool, which gets text at selected regions of pdf document.
DrMcCoy/pdftextorizer
Interactively extract text from multi-column PDFs
views63/pdf2text
pdf to text
FastPDFTeam/pdf-to-word-converter
Fast PDF to Word Converter is the Fastest Batch PDF Converter easily converting PDF to fully editable Office Word,Text,RTF,HTML and more
Isaccseven/pdf2text
Extract text from pdf using ocr
johbar/go-poppler
Limited, yet memory-leak-free Go wrapper for a Poppler PDF library
sahil352005/ChatWithPdf-Images
A Streamlit-based app that allows users to upload PDFs or images, extract text, and engage in interactive Q&A. Using Google Generative AI, this app enables insightful conversations based on document contents. Ideal for those seeking quick answers from their files in a simple, intuitive interface.
seinecle/nocodefunctions-io
io for nocodefunctions: csv, txt, pdf, and xlsx so far
senavs/pdfto
:heavy_check_mark: A Python Flask API to manage PDF files.
BinhQuocLy/Pdf2Quiz
A Pdf2Quiz NLP model.
ChrisCraddock/DC-Advanced-Walkthrough
Data Center Advanced Walkthrough. Insert data from a PDF file into MySQL database
SeeligA/OCRstream
Building an OCR pipeline for PDF to TXT
1994nikunj/textify-pdf
Textify-PDF: Extracting Text from PDF Files
davibusanello/pdf2txt
A simple CLI to to convert PDF files into TXT using OCR
fer-aguirre/pdf-2-ner
Web application for information extraction and named entity recognition for PDF files (work-in-progress).
zhangshi0512/DevTools
A lightweight Python-based Software Package for daily use