aidanielson's Stars
Dicklesworthstone/llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
paperless-ngx/paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
RapidAI/RapidOCR
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO and PaddlePaddle.
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
IllinoisLegalAidOnline/docassemble-USCISApplications
A docassemble extension.
jpagh/vscode-docassemble
VS Code Syntax Highlighting for Docassemble YAML (incl. Python, Mako, and Jinja)
reteps/redfin
A Python wrapper around redfin's unofficial API.
SharadKumar97/OSINT-SPY
Performs OSINT scan on email/domain/ip_address/organization using OSINT-SPY. It can be used by Data Miners, Infosec Researchers, Penetration Testers and cyber crime investigator in order to find deep information about their target. If you want to ask something please feel free to reach out to me at robotcoder@protonmail.com
skconan/Scanned-Document-Rotation-Correction
The project creates the models and service API for predicting scanned document images' angles ranging between -90° to 90° from the vertical.
TheAlgorithms/Python
All Algorithms implemented in Python
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
scrapinghub/dateparser
python parser for human readable dates
seekr-osint/seekr
A multi-purpose OSINT toolkit with a neat web-interface.
deanmalmgren/textract
extract text from any document. no muss. no fuss.
SuffolkLITLab/docassemble-AssemblyLine
Quickly go from a paper court form to a runnable, guided, step-by-step web application powered by Docassemble. Swap out branding and pre-built questions to meet your needs.
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
HazyResearch/legalbench
An open science effort to benchmark legal reasoning in foundation models
jhpyle/docassemble-profileme
A universal user profile that can be reduced to JSON and shared across interviews and platforms.
chrislim2888/IP2Location-Python
This module is a Python Library that enables the user to find the country, region, city, coordinates, zip code, ISP, domain name, timezone, connection speed, IDD code, area code, weather station code, weather station name, mobile, usage types, address type and IAB category that any IP address or host name originates from.
houfu/docassemble-googleTTS
A docassemble interview that performs text to speech with Google Cloud
4lex4/scantailor-advanced
ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
freelawproject/juriscraper
An API to scrape American court websites for metadata.
jhpyle/docassemble
A free, open-source expert system for guided interviews and document assembly, based on Python, YAML, and Markdown.
BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Lucksi/Mr.Holmes
A Complete Osint Tool :mag:
mscarey/justopinion
Download client for legal opinions