docling
There are 61 repositories under docling topic.
shoryasethia/markdrop
A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.
genieincodebottle/parsemypdf
Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
fahdmirza/doclingwithollama
Docling with Ollama - RAG on Local Files with Local Models
garyzava/chat-to-database-chatbot
Chat to your Database GenAI Chatbot
ghodsizadeh/pdf2csv
A python library and CLI tool to convert PDF files to CSV files.
versionHQ/multi-agent-system
Autonomous agent networks for task automation that requires multi-step reasoning
docling-project/docling4j
Docling4j brings the functionalities of Docling in document understanding to Java® projects
Slayer412/docling-bedrock-plugin
Integrates AWS Bedrock's multimodal capabilities (Claude 3) into the Docling framework for generating image descriptions within document processing pipelines.
ya0002/obsidian-assist
Make Zettelkasten-style note-taking the foundation of interactions with Large Language Models (LLMs).
felixdittrich92/docling-OCR-OnnxTR
OnnxTR OCR plugin for Docling
quarkiverse/quarkus-docling
Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem
HaileyTQuach/docchat-docling
DocChat is an AI-powered Multi-Agent RAG system using Docling for structured document parsing and BM25 + vector search retrievers to retrieve fact-checked answers from PDFs, DOCX, and text files, preventing hallucinations. 🚀
btwld/docling-sdk
A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem and JavaScript/TypeScript.
aspose-cells-python/aspose-cells-python
High-performance Python Excel processing library with advanced conversion capabilities
bisonbet/open-health
OpenHealth, AI Health Assistant | Powered by Your Data
NotYuSheng/OmniPDF
OmniPDF is a PDF analyzer capable of translation, summarization, captioning and conversational capabilities through Retrieval-Augmented-Generation (RAG).
Rishang/deep-research
Python SDK for Deep-Research
thevladdo/rag-backend
Retrieval-Augmented Generation server with Pinecone and OpenAI
ibm-granite-community/docling-workshop
Source code for Docling Workshop
ramona1999/Contract-Risk-Assessment
This project is an AI-powered Contract Risk Assessment and Legal Assistant designed to analyze legal documents, extract key clauses, assess risks, and provide actionable recommendations. Additionally, a fine-tuned conversational chatbot is integrated for interactive legal Q&A based on contract-specific knowledge.
serkanyasr/ntt_rag_project
Scalable Agentic RAG system using Pydantic AI, FastAPI & pgvector. Modular, production-ready foundation for document-based AI apps
TM9657/docling-binary
Docling Binary Server.
workloads/pathfinder-prism
Pathfinder Prism - AI-powered Knowledge Base
amirkiarafiei/docling-processor
A Docling extension for superior PDF/DOCX to Markdown conversion, featuring smart image understanding with Gemini VLM.
Danitilahun/Document-processing-Pdf-Structured-Data-Extractor
This project demonstrates how to extract structured information from PDF documents using a combination of Langchain, OpenAI models, and the DocLing library. It provides a framework for parsing PDFs and leveraging LLMs to identify and format key data points.
kwame-mintah/python-langchain-chainlit-qdrant-ollama-stack-template
📄 A template for project for creating a chainlit application, using a locally run model via ollama and qdrant vector database for document retrieval.
ParthaPRay/docling_RAG_langchain_colab
This repo contains codes for RAG using docling on colab notebook with langchain, milvus, huggingface embedding model and LLM
ParthaPRay/gradio_docling_rag_langchain
This repo provide RAG using Docling, langchain, milvus, sentence transformers, huggingface LLMs
shrimantasatpati/Document_Parser_using_AI
Parse documents using AI - any document converted to markdown suitable for RAG applications
AirtonLira/rag_rerank_n8n_minio
Esta é uma stack completa para automação de workflows, processamento de documentos e busca semântica usando ferramentas open-source. O projeto integra n8n para automação, Supabase como backend com suporte a vetores, MinIO para armazenamento de objetos e Ollama para geração de embeddings locais.
hemanthkt/impactoverse-AI-mentor
Developed an intelligent AI chatbot utilizing the DeepSeek LLM, designed for efficient interaction with large documents such as textbooks and study materials. Integrated Docling for parsing and processing large files, and implemented a Retrieval-Augmented Generation (RAG) pipeline using FAISS and Sentence Transformers to optimize context retrieval
Jarus77/markdrop
A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.
Mil100057/Myr-Ag-system
Local RAG system with Docling, LEANN, and Ollama - 97% storage efficient
PiMaV/Hybrid_Data_RAGfinery
AS-IS hybrid RAG reference (ArangoDB + Qdrant + Docling + n8n + openwebui). Requires HCL Notes → JSON export. Archived; no maintenance.
shijincai/fast360
The industry's first "Open Source OCR Arena," a free, no-login utility for one-click benchmarking of 7 top-tier models (Marker, MinerU, MonkeyOCR, Docling, Dolphin, OCRFlux, PP-StructureV3) on your PDF/image files, specializing in PDF-to-Markdown conversion.
vane/audiobook
convert pdf document to audiobook cli