pdf

There are 10844 repositories under pdf topic.

  • justjavac/free-programming-books-zh_CN

    :books: 免费的计算机编程类中文书籍,欢迎投稿

  • microsoft/markitdown

    Python tool for converting files and office documents to Markdown.

    Language:Python74.3k2553294.1k
  • Stirling-Tools/Stirling-PDF

    #1 Locally hosted web application that allows you to perform various operations on PDF files

    Language:Java67.3k2051.4k5.7k
  • opendatalab/MinerU

    A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

    Language:Python43.9k1491.3k3.6k
  • docling

    docling-project/docling

    Get your documents ready for gen AI

    Language:Python38.7k1661.2k2.7k
  • siyuan

    siyuan-note/siyuan

    A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

    Language:TypeScript37.2k15915.1k2.3k
  • paperless-ngx/paperless-ngx

    A community-supported supercharged document management system: scan, index and archive all your documents

    Language:Python32.1k1211.9k2k
  • OCRmyPDF

    ocrmypdf/OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

    Language:Python31.2k1921.3k2.2k
  • PDFMathTranslate

    Byaidu/PDFMathTranslate

    PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

    Language:Python27.4k796692.4k
  • hehonghui/awesome-english-ebooks

    经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新

    Language:CSS25.7k65702.1k
  • Awesome-CV

    posquit0/Awesome-CV

    :page_facing_up: Awesome CV is LaTeX template for your outstanding job application

    Language:TeX25.3k2162895.1k
  • forthespada/CS-Books

    🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~

  • koodo-reader

    koodo-reader/koodo-reader

    A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux, Android, iOS and Web

    Language:JavaScript23.8k1211.2k1.8k
  • koreader/koreader

    An ebook reader application supporting PDF, DjVu, EPUB, FB2 and many more formats, running on Cervantes, Kindle, Kobo, PocketBook and Android devices

    Language:Lua23.2k3196.7k1.5k
  • ether/etherpad-lite

    Etherpad: A modern really-real-time collaborative document editor.

    Language:TypeScript17.7k3523.2k3k
  • salomonelli/best-resume-ever

    :necktie: :briefcase: Build fast :rocket: and easy multiple beautiful resumes and create your best CV ever! Made with Vue and LESS.

    Language:Vue16.4k3131402.3k
  • diegomura/react-pdf

    📄 Create PDF files using React

    Language:TypeScript16k992k1.3k
  • mayooear/ai-pdf-chatbot-langchain

    AI PDF chatbot agent built with LangChain & LangGraph

    Language:TypeScript16k1523003.2k
  • sumatrapdfreader/sumatrapdf

    SumatraPDF reader

    Language:C15.4k3163.6k1.8k
  • janishar/mit-deep-learning-book-pdf

    MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville

    Language:Java13.6k411182.8k
  • xournalpp/xournalpp

    Xournal++ is a handwriting notetaking software with PDF annotation support. Written in C++ with GTK3, supporting Linux (e.g. Ubuntu, Debian, Arch, SUSE), macOS and Windows 10. Supports pen input from devices such as Wacom Tablets.

    Language:C++13.4k1073.8k936
  • kekingcn/kkFileView

    Universal File Online Preview Project based on Spring-Boot

    Language:Java13.3k653853.1k
  • QuestPDF/QuestPDF

    Generate and edit PDF documents in your .NET applications using the open-source QuestPDF library and its C# Fluent API. Build invoices, reports and data exports with ease.

    Language:C#13.3k96722708
  • Unstructured-IO/unstructured

    Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

    Language:HTML12.7k681.2k1k
  • readest

    readest/readest

    Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.

    Language:TypeScript12.5k22318670
  • h2oai/h2ogpt

    Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

    Language:Python11.9k1571.2k1.3k
  • Zettlr

    Zettlr/Zettlr

    Your One-Stop Publication Workbench

    Language:TypeScript11.8k893.4k726
  • getomni-ai/zerox

    OCR & Document Extraction using vision models

    Language:TypeScript11.8k5382804
  • documenso

    documenso/documenso

    The Open Source DocuSign Alternative.

    Language:TypeScript11.6k425822k
  • hmemcpy/milewski-ctfp-pdf

    Bartosz Milewski's 'Category Theory for Programmers' unofficial PDF and LaTeX source

    Language:TeX11.3k239170619
  • libvips/libvips

    A fast image processing library with low memory needs.

    Language:C10.7k1392.4k715
  • 0voice/expert_readed_books

    2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,**类,数学类,人物传记书籍

  • wojtekmaj/react-pdf

    Display PDFs in your React app as easily as if they were images.

    Language:TypeScript10.5k581.2k957
  • wmjordan/PDFPatcher

    PDF补丁丁——PDF工具箱,可以编辑书签、剪裁旋转页面、解除限制、提取或合并文档,探查文档结构,提取图片、转成图片等等

    Language:C#10.3k982021.3k
  • rnote

    flxzt/rnote

    Sketch and take handwritten notes.

    Language:Rust10.3k56816393
  • gotenberg/gotenberg

    A developer-friendly API for converting numerous document formats into PDF files, and more!

    Language:Go10k70682673