pdf-converter
There are 1034 repositories under pdf-converter topic.
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
docling-project/docling
Get your documents ready for gen AI
wmjordan/PDFPatcher
PDF补丁丁——PDF工具箱,可以编辑书签、剪裁旋转页面、解除限制、提取或合并文档,探查文档结构,提取图片、转成图片等等
gotenberg/gotenberg
A developer-friendly API for converting numerous document formats into PDF files, and more!
C4illin/ConvertX
💾 Self-hosted online file converter. Supports 1000+ formats ⚙️
bytedance/Dolphin
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
marcbachmann/node-html-pdf
This repo isn't maintained anymore as phantomjs got dreprecated a long time ago. Please migrate to headless chrome/puppeteer.
borb-pdf/borb
borb is a library for reading, creating and manipulating PDF files in python.
ArtifexSoftware/pdf2docx
Open source Python library for converting PDF to DOCX.
xhtml2pdf/xhtml2pdf
A library for converting HTML into PDFs using ReportLab
arachnys/athenapdf
Drop-in replacement for wkhtmltopdf built on Go, Electron and Docker
modesty/pdf2json
converts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.
sajari/docconv
Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text
MarkPDFdown/markpdfdown
A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
jzillmann/pdf-to-markdown
A PDF to Markdown converter
DocumindHQ/documind
Open-source platform for extracting structured data from documents using AI.
zelon88/HRConvert2
A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 445 file formats in 13 languages.
Swati4star/Images-to-PDF
An app to convert images to PDF file!
rdvojmoc/DinkToPdf
C# .NET Core wrapper for wkhtmltopdf library that uses Webkit engine to convert HTML pages to PDF.
spatie/pdf-to-text
Extract text from a pdf
booktype/Booktype
Booktype is a free, open source platform that produces beautiful, engaging books formatted for print, Amazon, iBooks and almost any ereader within minutes.
adithya-s-k/marker-api
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
DDULDDUCK/every-pdf
✍️ A powerful, all-in-one desktop PDF toolkit to edit, convert, merge, and secure your documents. Built with Electron, Next.js, and Python.
elliotblackburn/mdpdf
Markdown to PDF command line app with support for stylesheets
explosion/spacy-layout
📚 Process PDFs, Word documents and more with spaCy
adrienjoly/npm-pdfreader
🚜 Parse text and tables from PDF files.
drmingler/docling-api
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
lampnick/doctron
html转pdf , html转图片 , Docker-powered html convert to pdf(html2pdf), html to image(html2image like jpeg,png),which using chrome(golang) kernel.
avidLearnerInProgress/python-automation-scripts
Simple yet powerful automation stuffs.
GowenGit/docnet
DocNET is as fast PDF editing and reading library for modern .NET applications
vladholubiev/serverless-libreoffice
Run LibreOffice in AWS Lambda to create PDFs & convert documents
bitcrowd/chromic_pdf
Convenient HTML to PDF/A rendering library for Elixir based on Chrome & Ghostscript
abarker/pdfCropMargins
pdfCropMargins -- a program to crop the margins of PDF files
erayakartuna/pdf-flipbook
Browse PDF document like a book turning its pages