scanned-documents

There are 48 repositories under scanned-documents topic.

  • papermerge

    ciur/papermerge

    Open Source Document Management System for Digital Archives (Scanned Documents)

    Language:Python2.6k51502269
  • 4lex4/scantailor-advanced

    ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.

    Language:C++1.2k65180130
  • Udayraj123/OMRChecker

    Evaluate OMR sheets fast and accurately using a scanner 🖨 or your phone 🤳.

    Language:Python79226107329
  • ahmetozlu/signature_extractor

    A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.

    Language:Python4631510142
  • ad-si/awesome-scanning

    A curated list of awesome projects to simplify and improve paper and document scanning.

  • susam/tucl

    The first-ever paper on the Unix shell written by Ken Thompson in 1976 scanned, transcribed, and redistributed with permission

    Language:Makefile36112121
  • papermerge/papermerge-core

    Papermerge DMS core backend, REST API server, and frontend UI

    Language:Python313116962
  • brakmic/OpenCV

    :camera: Computer-Vision Demos

    Language:C#26621055
  • ispras/dedoc

    Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

    Language:Python197122222
  • atgreen/paperless

    Emacs-assisted PDF document filing

    Language:Emacs Lisp13491511
  • karolzak/boxdetect

    BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on scanned forms.

    Language:Python10592320
  • apurvmishra99/pdf-to-scan

    Make your PDFs look like they were scanned

    Language:Python84327
  • beast/react-native-scan-doc

    A document scanner that automatically trims the edge with perspective transform

    Language:Java40649
  • ApryseSDK/pdftron-android-ocr-scanner-sample

    Android Scanner with OCR support using PDFTron

    Language:Kotlin35808
  • maxim2266/go-ocr

    A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

    Language:Go34418
  • baltpeter/scanprep

    Small utility to prepare scanned documents. Supports separating PDF files by separator pages and removing blank pages.

    Language:Python323311
  • NjoyimPeguy/ICDAR-2019-RRC-SROIE

    ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction

    Language:Python324410
  • papermerge/documentation

    Documentation for Papermerge DMS - Installation, Help, User Manual, REST API

    Language:HTML14456
  • goodday451999/Character-Segmentation-of-Scanned-Text

    Segmentation of Scanned Text upto Character Level

    Language:Python11006
  • AdroitAnandAI/Multilingual-Text-Inversion-Detection-of-Scanned-Images

    Efficient Text Localization Algorithm, Image Inversion Detection of Scanned Documents & Language Identification based on Shape Context and Traditional Computer Vision.

    Language:Python8200
  • skconan/Scanned-Document-Rotation-Correction

    The project creates the models and service API for predicting scanned document images' angles ranging between -90° to 90° from the vertical.

    Language:Python7101
  • dsabarinathan/DocumentTableSeg

    Implementation of scanned document table segmentation with U-net

    Language:Python6102
  • timberger/Searchable-Image-PDF-Creat-O-Mat

    This batch script creates a searchable PDF of a PDF with one or more scanned pages which contain images.

    Language:Batchfile6200
  • imakashsahu/Images-or-Scanned-Documents-into-Searchable-PDFs

    This is a Flask Based Project to convert Images, Scanned Documents or Multiple Page PDF into Searchable PDF

    Language:CSS5101
  • papermerge/papermerge-cli

    Papermerge DMS command line utility

    Language:Python5183
  • svitlana1209/OCR-search

    Searching for a text using OCR, detection and recognition of tables in scanned documents.

    Language:Python5101
  • vijayengineer/PDFTextSpeechConverter

    Converts scanned documents and ordinary documents into speech mp3 using Amazon Polly

    Language:Python5101
  • binDebug3/scanner_automation

    A program to automate simple and repetitive tasks while scanning documents by Dallin Stewart

    Language:Python4200
  • deckerego/docmag

    The web UI for Facile Search. Together with DocIndex, this UI can help you search the myriad of scanned documents you have been accumulating over the years. Using the power of Docker & Elasticsearch you can run a powerful search engine that lets you convert scanned (image-based) PDFs to searchable text, group documents by letterhead, run fuzzy searches by date and view document metadata.

    Language:Groovy42150
  • milahu/document-photo-auto-threshold

    auto-correct contrast and brightness of photographed document

    Language:Python330
  • scan

    hacker-or-id/scan

    {{scan|tools|software|headware|progress|open|template|log|log|log|softwaretool|}}{[[:wikt:Scan|log scan]]}. #[[:wikt:log scan|log copyright]]. *[[:wikt:log is log|log]]. *[[:wikt:log scan|txt]]. *[[:wikt:log scan|png]]. *[[:wikt:log scan|image image image/category user/category is /category talkname/category username/category done/category in progress/category open]]. -------------------------------------------------------------------------------------------------------------

  • Hawk453/OCR_FOR_PDFS

    Optical Character Recognition for Scanned Documents

    Language:Python2100
  • hnjm/papermerge

    Open Source Document Management System for Digital Archives (Scanned Documents)

    Language:Python210
  • MaxineXiong/Scraping-Scanned-PDF-Docs-using-OCR-with-RPA

    This repository contains automation solutions that efficiently extracts text from scanned PDF documents with consistent layouts. Utilizing Tesseract OCR engine, the UiPath RPA robot achieves nearly 90% accuracy, streamlining the process and significantly reducing manual workload.

  • paulveillard/cybersecurity-internet-scanning

    An ongoing & curated collection of awesome software best practices and techniques, libraries and frameworks, E-books and videos, websites, blog posts, links to github Repositories, technical guidelines and important resources about Internet Scanning in Cybersecurity

  • PkuCuipy/scanned-doc-vectorizer

    Vectorize scanned documents into PDFs with a data-driven approach.

    Language:Python1100