pdf-processing

There are 210 repositories under pdf-processing topic.

dissorial/doc-chatbot
Document chatbot — multiple files, topics, chat windows and chat history. Powered by GPT.
Language:TypeScript866 12 41145
allenai/papermage
library supporting NLP and CV research on scientific papers
Language:Python785 8 3663
Tele-AI/doc-ops-mcp
MCP server for seamless document format conversion and processing
Language:TypeScript132 1 23
ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
Language:Python131 7 165
postralai/masquerade
The Privacy Firewall for LLMs
Language:Python72 12 121
aws-samples/document-processing-pipeline-for-regulated-industries
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.
Language:Python65 6 014
PSPDFKit/nutrient-dws-client-python
Official Python client library for Nutrient Document Web Services API - PDF processing, OCR, watermarking, and document manipulation with automatic Office format conversion
Language:Python53 0 16
PSPDFKit-labs/nutrient-dws-client-typescript
This library provides a type-safe and ergonomic interface for document processing operations including conversion, merging, compression, watermarking, and text extraction using Nutrient DWS Processor API.
Language:TypeScript35 0 0
autollama/autollama
Anthropic's Contextual Retrieval implementation with visual chunk comparison. Preview context enrichment before/after embedding.
Language:HTML25 1 280
Govind-S-B/pdf-to-text-chroma-search
Python scripts that converts PDF files to text, splits them into chunks, and stores their vector representations using GPT4All embeddings in a Chroma DB. It also provides a script to query the Chroma DB for similarity search based on user input.
Language:Python23 1 07
tetratensor/ML-powered_resume_analyser
Local, privacy-friendly resume analysis: convert, classify, and get advice using TF‑IDF, Logistic Regression, and sentence-transformer embeddings.
Language:Python18
enesmanan/paper-bold
AI-powered RAG-based tool for summarizing, extracting insights, and answering questions about research papers with high accuracy
Language:HTML171
ManasMadan/pdf-actions
A NPM Package built on top of pdf-lib that provides functonalities like merge, rotate, split,download pdf to disk and many more...
Language:JavaScript14 2 28
ranguy9304/LangGraphRAG
LangGraphRAG: A terminal-based Retrieval-Augmented Generation system using LangGraph. Features include message history caching, query transformation, and vector database retrieval. Ideal for NLP researchers and developers working on advanced conversational AI and information retrieval systems.
Language:Python14 2 02
Remy2404/Polymind
A powerful, multi-modal Telegram bot leveraging cutting-edge AI technologies including Gemini, DeepSeek, OpenRouter, and 50+ AI models for comprehensive conversational assistance, media processing, and collaborative features with MCP (Model Context Protocol) integration.
Language:Python100
DioCrafts/ai-book-summarizer
📚 AI-Powered Book EPUB Knowledge Extractor & Summarizer Transform your PDF books into structured knowledge effortlessly! This tool leverages AI to analyze books page by page, extracting key insights, definitions, and concepts, and organizes them into Markdown summaries for easier study
Language:Python9 1 02
ManasMadan/PDFActions
Built with pdf-actions NPM package.
Language:JavaScript7 1 15
Alijanloo/Pdf2Table
A Python library for extracting tables from PDF documents using computer vision and image processing techniques. It converts PDF pages to images, detects tables, recognizes their structure, and outputs clean data in JSON format.
Language:Python60
allanninal/document-summarizer
The Document Summarizer leverages Hugging Face’s facebook/bart-large-cnn model to transform lengthy documents into concise summaries. Built with ReactJS (Vite) for the frontend and Flask for the backend, it supports PDF and text files, offering real-time summarization for researchers, students, and professionals.
Language:JavaScript6 1 03
Inc44/MaTools
An all-in-one GUI management toolkit built with PyQt6, offering a suite of tools for file synchronization, media organization, PDF merging, code formatting, and more.
Language:Python6 1 00
AkshayG999/MistralOCR---AI-Powered-Document-Extraction
MistralOCR is an open-source application that transforms documents into structured data using Mistral AI's OCR capabilities. Built with FastAPI and Streamlit, it provides an intuitive interface for extracting and processing text from PDFs and images, making document digitization effortless and accurate.
Language:Python4 1 0
noorjotk/local-rag-engine
Local RAG app with zero-config Docker setup. FastAPI + Streamlit + Qdrant + Ollama. Just run `docker-compose up --build`! 🚀
Language:Python4
Aleptonic/PdfSnipper
PdfSnipper is a lightweight and efficient Python package designed to simplify the management of PDF files, pages, and their conversions during various NLP, Computer Vision (CV), or other data processing tasks. The package eliminates the need for repetitive code by providing intuitive, ready-to-use functions for common PDF-related operations.
Language:Python3 1 00
arsath-eng/RAG1-NVIDIA-GENAI
A powerful Retrieval Augmented Generation (RAG) application built with NVIDIA AI endpoints and Streamlit. This solution enables intelligent document analysis and question-answering using state-of-the-art language models, featuring multi-PDF processing, FAISS vector store integration, and advanced prompt engineering.
Language:Python3 1 01
gwyndolin75/Document-QA-System
A Streamlit-based app for asking questions directly from uploaded documents using Gemini embeddings and a language model. Supports PDF, TXT, and DOCX files. Fast, simple, and powerful document-based QA.
Language:Jupyter Notebook3 1 03
Rayyan9477/ocr-app
State-of-the-art Optical Character Recognition (OCR) with Vision Language Model (VLM) integration for enhanced accuracy and optimal document processing.
Language:TypeScript3 1 1
thinhuos0913/python_useful_mini_projects
This is some useful mini projects that I had worked for self-learning Python programming.
Language:Python3 1 01
umur957/Custodian
An intelligent, enterprise-grade document management system that automatically sorts, renames, and archives digital documents using state-of-the-art OCR and AI technology.
Language:Python3
wesellis/TECH-Adobe-Enterprise-Automation-PowerShell-Python-Portfolio
[100% Complete] 🎉 Production-ready Adobe CC automation suite. 5,750+ lines: PowerShell + Python. User provisioning, ML license optimization, PDF workflows, compliance auditing. Docker/K8s/Terraform ready.
Language:JavaScript3
Yardenrsk/PsychometryReceiverCV
A side project to easily get and annotate questions and answers to the PsychometryBot project DB using computer vision and pdf parsing
Language:Python3 1 00
Assem-ElQersh/Multi-AI-Consultation
Terminal-based platform where specialized AI experts (Legal, Tech, Business) engage in real-time debates and collaborative problem-solving to provide multi-perspective analysis for complex decisions.
Language:Python2
jonathanfavorite/RAGamuffin
A lightweight, cross-platform .NET library for building RAG (Retrieval-Augmented Generation) pipelines with local embedding models and SQLite vector storage. Perfect for developers who need privacy-focused, offline-capable document search and AI-powered question answering without external API dependencies.
Language:C#20
masfaatanveer/Lease-Summarization-Model-NLP
This project uses OCR and a BART-based NLP pipeline to extract and summarize landlord, tenant, property, and contract details from scanned lease agreements. It combines Tesseract OCR, pdf2image, and HuggingFace Transformers to deliver structured legal summaries in JSON format.
Language:Python2
rlwadh/markitdown-desktop
Professional document converter with Desktop & Web versions. Unlimited PDF processing, multi-file support. Supports kindergarten project.
Language:Python2
Siddharthsinghkumar/auto-job-match-pipeline
AI-powered job search assistant that reads newspapers daily, finds jobs matching your resume using GPT, and alerts you via Telegram. 2025
Language:Python2 0 0
UjjwalSaini07/OllamaMulti-RAG
OllamaMulti-RAG 🚀 is a multimodal AI chat app combining Whisper AI for audio, LLaVA for images, and Chroma DB for PDFs, enhanced with Ollama and OpenAI API. 📄 Built for AI enthusiasts, it welcomes contributions—features, bug fixes, or optimizations—to advance practical multimodal AI research and development collaboratively.
Language:Python21

pdf-processing

dissorial/doc-chatbot

allenai/papermage

Tele-AI/doc-ops-mcp

ahmedkhemiri95/PDFs-TextExtract

postralai/masquerade

aws-samples/document-processing-pipeline-for-regulated-industries

PSPDFKit/nutrient-dws-client-python

PSPDFKit-labs/nutrient-dws-client-typescript

autollama/autollama

Govind-S-B/pdf-to-text-chroma-search

tetratensor/ML-powered_resume_analyser

enesmanan/paper-bold

ManasMadan/pdf-actions

ranguy9304/LangGraphRAG

Remy2404/Polymind

DioCrafts/ai-book-summarizer

ManasMadan/PDFActions

Alijanloo/Pdf2Table

allanninal/document-summarizer

Inc44/MaTools

AkshayG999/MistralOCR---AI-Powered-Document-Extraction

noorjotk/local-rag-engine

Aleptonic/PdfSnipper

arsath-eng/RAG1-NVIDIA-GENAI

gwyndolin75/Document-QA-System

Rayyan9477/ocr-app

thinhuos0913/python_useful_mini_projects

umur957/Custodian

wesellis/TECH-Adobe-Enterprise-Automation-PowerShell-Python-Portfolio

Yardenrsk/PsychometryReceiverCV

Assem-ElQersh/Multi-AI-Consultation

jonathanfavorite/RAGamuffin

masfaatanveer/Lease-Summarization-Model-NLP

rlwadh/markitdown-desktop

Siddharthsinghkumar/auto-job-match-pipeline

UjjwalSaini07/OllamaMulti-RAG