scanned-documents

There are 48 repositories under scanned-documents topic.

ciur/papermerge
Open Source Document Management System for Digital Archives (Scanned Documents)
Language:Python2.6k 51 502269
4lex4/scantailor-advanced
ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
Language:C++1.2k 65 180130
Udayraj123/OMRChecker
Evaluate OMR sheets fast and accurately using a scanner 🖨 or your phone 🤳.
Language:Python792 26 107329
ahmetozlu/signature_extractor
A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.
Language:Python463 15 10142
ad-si/awesome-scanning
A curated list of awesome projects to simplify and improve paper and document scanning.
417 14 3023
susam/tucl
The first-ever paper on the Unix shell written by Ken Thompson in 1976 scanned, transcribed, and redistributed with permission
Language:Makefile361 12 121
papermerge/papermerge-core
Papermerge DMS core backend, REST API server, and frontend UI
Language:Python313 11 6962
brakmic/OpenCV
:camera: Computer-Vision Demos
Language:C#266 21 055
ispras/dedoc
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
Language:Python197 12 2222
atgreen/paperless
Emacs-assisted PDF document filing
Language:Emacs Lisp134 9 1511
karolzak/boxdetect
BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on scanned forms.
Language:Python105 9 2320
apurvmishra99/pdf-to-scan
Make your PDFs look like they were scanned
Language:Python84 3 27
beast/react-native-scan-doc
A document scanner that automatically trims the edge with perspective transform
Language:Java40 6 49
ApryseSDK/pdftron-android-ocr-scanner-sample
Android Scanner with OCR support using PDFTron
Language:Kotlin35 8 08
maxim2266/go-ocr
A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.
Language:Go34 4 18
baltpeter/scanprep
Small utility to prepare scanned documents. Supports separating PDF files by separator pages and removing blank pages.
Language:Python32 3 311
NjoyimPeguy/ICDAR-2019-RRC-SROIE
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
Language:Python32 4 410
papermerge/documentation
Documentation for Papermerge DMS - Installation, Help, User Manual, REST API
Language:HTML14 4 56
goodday451999/Character-Segmentation-of-Scanned-Text
Segmentation of Scanned Text upto Character Level
Language:Python11 0 06
AdroitAnandAI/Multilingual-Text-Inversion-Detection-of-Scanned-Images
Efficient Text Localization Algorithm, Image Inversion Detection of Scanned Documents & Language Identification based on Shape Context and Traditional Computer Vision.
Language:Python8 2 00
skconan/Scanned-Document-Rotation-Correction
The project creates the models and service API for predicting scanned document images' angles ranging between -90° to 90° from the vertical.
Language:Python7 1 01
dsabarinathan/DocumentTableSeg
Implementation of scanned document table segmentation with U-net
Language:Python6 1 02
timberger/Searchable-Image-PDF-Creat-O-Mat
This batch script creates a searchable PDF of a PDF with one or more scanned pages which contain images.
Language:Batchfile6 2 00
imakashsahu/Images-or-Scanned-Documents-into-Searchable-PDFs
This is a Flask Based Project to convert Images, Scanned Documents or Multiple Page PDF into Searchable PDF
Language:CSS5 1 01
papermerge/papermerge-cli
Papermerge DMS command line utility
Language:Python5 1 83
svitlana1209/OCR-search
Searching for a text using OCR, detection and recognition of tables in scanned documents.
Language:Python5 1 01
vijayengineer/PDFTextSpeechConverter
Converts scanned documents and ordinary documents into speech mp3 using Amazon Polly
Language:Python5 1 01
binDebug3/scanner_automation
A program to automate simple and repetitive tasks while scanning documents by Dallin Stewart
Language:Python4 2 00
deckerego/docmag
The web UI for Facile Search. Together with DocIndex, this UI can help you search the myriad of scanned documents you have been accumulating over the years. Using the power of Docker & Elasticsearch you can run a powerful search engine that lets you convert scanned (image-based) PDFs to searchable text, group documents by letterhead, run fuzzy searches by date and view document metadata.
Language:Groovy4 2 150
milahu/document-photo-auto-threshold
auto-correct contrast and brightness of photographed document
Language:Python3 3 0
hacker-or-id/scan
{{scan|tools|software|headware|progress|open|template|log|log|log|softwaretool|}}{[[:wikt:Scan|log scan]]}. #[[:wikt:log scan|log copyright]]. *[[:wikt:log is log|log]]. *[[:wikt:log scan|txt]]. *[[:wikt:log scan|png]]. *[[:wikt:log scan|image image image/category user/category is /category talkname/category username/category done/category in progress/category open]]. -------------------------------------------------------------------------------------------------------------
2 1 00
Hawk453/OCR_FOR_PDFS
Optical Character Recognition for Scanned Documents
Language:Python2 1 00
hnjm/papermerge
Open Source Document Management System for Digital Archives (Scanned Documents)
Language:Python2 1 0
MaxineXiong/Scraping-Scanned-PDF-Docs-using-OCR-with-RPA
This repository contains automation solutions that efficiently extracts text from scanned PDF documents with consistent layouts. Utilizing Tesseract OCR engine, the UiPath RPA robot achieves nearly 90% accuracy, streamlining the process and significantly reducing manual workload.
2 1 01
paulveillard/cybersecurity-internet-scanning
An ongoing & curated collection of awesome software best practices and techniques, libraries and frameworks, E-books and videos, websites, blog posts, links to github Repositories, technical guidelines and important resources about Internet Scanning in Cybersecurity
1 2 0
PkuCuipy/scanned-doc-vectorizer
Vectorize scanned documents into PDFs with a data-driven approach.
Language:Python1 1 00

scanned-documents

ciur/papermerge

4lex4/scantailor-advanced

Udayraj123/OMRChecker

ahmetozlu/signature_extractor

ad-si/awesome-scanning

susam/tucl

papermerge/papermerge-core

brakmic/OpenCV

ispras/dedoc

atgreen/paperless

karolzak/boxdetect

apurvmishra99/pdf-to-scan

beast/react-native-scan-doc

ApryseSDK/pdftron-android-ocr-scanner-sample

maxim2266/go-ocr

baltpeter/scanprep

NjoyimPeguy/ICDAR-2019-RRC-SROIE

papermerge/documentation

goodday451999/Character-Segmentation-of-Scanned-Text

AdroitAnandAI/Multilingual-Text-Inversion-Detection-of-Scanned-Images

skconan/Scanned-Document-Rotation-Correction

dsabarinathan/DocumentTableSeg

timberger/Searchable-Image-PDF-Creat-O-Mat

imakashsahu/Images-or-Scanned-Documents-into-Searchable-PDFs

papermerge/papermerge-cli

svitlana1209/OCR-search

vijayengineer/PDFTextSpeechConverter

binDebug3/scanner_automation

deckerego/docmag

milahu/document-photo-auto-threshold

hacker-or-id/scan

Hawk453/OCR_FOR_PDFS

hnjm/papermerge

MaxineXiong/Scraping-Scanned-PDF-Docs-using-OCR-with-RPA

paulveillard/cybersecurity-internet-scanning

PkuCuipy/scanned-doc-vectorizer