pdf-processing

There are 69 repositories under pdf-processing topic.

  • dissorial/doc-chatbot

    Document chatbot — multiple files, topics, chat windows and chat history. Powered by GPT.

    Language:TypeScript8641240145
  • allenai/papermage

    library supporting NLP and CV research on scientific papers

    Language:Python757103661
  • PDFs-TextExtract

    ahmedkhemiri95/PDFs-TextExtract

    Multiple and Large PDF Documents Text Extraction.

    Language:Python1317165
  • document-processing-pipeline-for-regulated-industries

    aws-samples/document-processing-pipeline-for-regulated-industries

    A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.

    Language:Python646014
  • Govind-S-B/pdf-to-text-chroma-search

    Python scripts that converts PDF files to text, splits them into chunks, and stores their vector representations using GPT4All embeddings in a Chroma DB. It also provides a script to query the Chroma DB for similarity search based on user input.

    Language:Python23107
  • ManasMadan/pdf-actions

    A NPM Package built on top of pdf-lib that provides functonalities like merge, rotate, split,download pdf to disk and many more...

    Language:JavaScript14228
  • DioCrafts/ai-book-summarizer

    📚 AI-Powered Book PDF Knowledge Extractor & Summarizer Transform your PDF books into structured knowledge effortlessly! This tool leverages AI to analyze books page by page, extracting key insights, definitions, and concepts, and organizes them into Markdown summaries for easier study

    Language:Python8102
  • ManasMadan/PDFActions

    Built with pdf-actions NPM package.

    Language:JavaScript7115
  • ranguy9304/LangGraphRAG

    LangGraphRAG: A terminal-based Retrieval-Augmented Generation system using LangGraph. Features include message history caching, query transformation, and vector database retrieval. Ideal for NLP researchers and developers working on advanced conversational AI and information retrieval systems.

    Language:Python7300
  • enesmanan/paper-bold

    AI-powered RAG-based tool for summarizing, extracting insights, and answering questions about research papers with high accuracy

    Language:HTML6
  • Inc44/MaTools

    An all-in-one GUI management toolkit built with PyQt6, offering a suite of tools for file synchronization, media organization, PDF merging, code formatting, and more.

    Language:Python6100
  • AkshayG999/MistralOCR---AI-Powered-Document-Extraction

    MistralOCR is an open-source application that transforms documents into structured data using Mistral AI's OCR capabilities. Built with FastAPI and Streamlit, it provides an intuitive interface for extracting and processing text from PDFs and images, making document digitization effortless and accurate.

    Language:Python310
  • Aleptonic/PdfSnipper

    PdfSnipper is a lightweight and efficient Python package designed to simplify the management of PDF files, pages, and their conversions during various NLP, Computer Vision (CV), or other data processing tasks. The package eliminates the need for repetitive code by providing intuitive, ready-to-use functions for common PDF-related operations.

    Language:Python3100
  • allanninal/document-summarizer

    The Document Summarizer leverages Hugging Face’s facebook/bart-large-cnn model to transform lengthy documents into concise summaries. Built with ReactJS (Vite) for the frontend and Flask for the backend, it supports PDF and text files, offering real-time summarization for researchers, students, and professionals.

    Language:JavaScript30
  • thinhuos0913/python_useful_mini_projects

    This is some useful mini projects that I had worked for self-learning Python programming.

    Language:Python3101
  • Yardenrsk/PsychometryReceiverCV

    A side project to easily get and annotate questions and answers to the PsychometryBot project DB using computer vision and pdf parsing

    Language:Python3100
  • 9-5/Chromium-Intelligence

    A powerful Chromium extension that leverages the multiple AI APIs to assist with various text operations, image analysis, and PDF processing.

    Language:JavaScript2101
  • Al-shwaib/Book-Preparation-for-Printing

    A web application for preparing books and magazines for offset printing. Automatically arranges PDF pages for commercial A3 printing, supporting both Arabic (RTL) and English (LTR) books. تطبيق ويب لتحضير الكتب والمجلات للطباعة على مطابع الأوفست. يقوم تلقائياً بترتيب صفحات PDF للطباعة التجارية على ورق A3، مع دعم الكتب العربية والإنجليزية.

    Language:Python210
  • arsath-eng/RAG1-NVIDIA-GENAI

    A powerful Retrieval Augmented Generation (RAG) application built with NVIDIA AI endpoints and Streamlit. This solution enables intelligent document analysis and question-answering using state-of-the-art language models, featuring multi-PDF processing, FAISS vector store integration, and advanced prompt engineering.

    Language:Python2101
  • dsckiet/covid-tracker-android-app

    A statistical data display and notifier app for Covid-19 pandemic.

    Language:Kotlin21152
  • Farhaj499/RAG_with_Weaviate_DB

    This project implements a Retrieval Augmented Generation (RAG) system that answers questions based on the PDF document. It utilizes Weaviate as a vector database for efficient retrieval of relevant information and Gemini to generate natural language responses.

    Language:Jupyter Notebook2100
  • rithulkamesh/docproc

    Opinionated and Sophisticated Document Region Analyzer.

    Language:Python2190
  • setuc/pdf-annotation-with-azure-doc-intel

    Azure Document Intelligence Result Processor: A toolset for annotating PDFs based on Azure Document Intelligence analysis results, featuring a React web application and a standalone Python script for processing and visualizing extracted data with confidence indicators.

    Language:JavaScript2100
  • ydvrahul19/Invoice-Manager

    A modern, intelligent invoice processing system with advanced multi-format data extraction capabilities. Process invoices from PDFs, Excel files, and images with smart data recognition.

    Language:JavaScript2100
  • akshatpunia26/berrylit_pdf_chat

    Berrylit is a simple chatbot interface that allows users to upload a PDF file and ask a question related to its contents. The chatbot uses the Berri API for processing.

    Language:Python1101
  • Aumlo123/pdfdoom

    DOOM in a PDF (as ascii art)

  • eddieyg/freedomfile

    Freedom to use PDF, DOC and other document processing

    Language:TypeScript1110
  • Francesco-Sovrano/Swiss-G2C-User-Guide-Analysis

    Extensive analysis of user guides in Swiss government-to-citizen software, correlating guide features with canton socio-economic factors.

    Language:Python110
  • FurqanHun/textnomnom-py

    Extract text from PDFs, PPTs, & URLs (with OCR support). Converts PPT to PDF & handles files or folders. 🦍

    Language:Python1110
  • gs-ai/PDFProfessor

    PDF Professor 2.0 extracts and processes PDF text, analyzed by Ollama for summarization, data extraction, and insights. More coming soon!

    Language:Python1100
  • HemantM29/Multimodal-Document-Analysis-and-Query-Retrieval

    This project performs multimodal document analysis and query retrieval by downloading PDFs, converting pages to images, indexing them for semantic search, and analyzing retrieved images using visual-language models like Qwen2VL and Blip2.

    Language:Jupyter Notebook110
  • king04aman/PDF-Extractor-API

    PDF Extractor API is a FastAPI project for extracting information from PDFs. It includes user authentication, PDF uploading, and text extraction. The API supports secure PDF uploads, keyword-based extraction, and rate limiting.

    Language:Python1102
  • Mateusz2734/pdf-cli

    CLI tool to merge, compress, extract or delete pages from PDF

    Language:Python1100
  • mohamedelareeg/ImageAutomaticCroppingWatcher

    Image Automatic Cropping Watcher: A tool that automatically detects PDF files, converts them to images, corrects perspective distortion, and compiles them back into PDFs.

    Language:C#110
  • RajnishProgrammer/PDF-Info-Processing-Service

    A PDF processing project with backend integration using Python-Flask 🚀

    Language:HTML1
  • Remisu/GajyunETL

    The goal of this project is to eliminate the need for paper by digitizing the process of handling client passport information.

    Language:C#10