textract
There are 92 repositories under textract topic.
srcecde/aws-tutorial-code
AWS tutorial code.
danthelion/doc2audiobook
Convert text documents to high fidelity audio(books).
aeksco/aws-pdf-textract-pipeline
:mag: Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
likerRr/code4goal-resume-parser
Solution for Code4Goal challenge
simonw/s3-ocr
Tools for running OCR against files stored in S3
NanoNets/ocr-python
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
mylukin/Textractor
一个高效的从HTML中提取正文的类库。An efficient class library for extracting text from HTML.
fourdigits/wagtail_textract
Text extraction for Wagtail document search
sergiocorreia/quipucamayoc
dev repo for article
Mkranj/PapersCited
List all unique citations in your document
muhimasri/aws-textract-helper
Aws Textract Helper
AvinashDalvi89/list-of-AWS-kickstart-projects
Learn AWS by Doing: Project Ideas
shanuhalli/Project-Resume-Classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
aws-samples/mask-words-in-image
A tool that can mask words that match regular expression, keywords or PII (Personally Identifiable Information) in an image file.
onify/blueprint-aws-textract-pdf-to-form
Onify Blueprint: Amazon AWS Textract - PDF to form example
RocktimRajkumar/ATS
:trophy: An applicant tracking system (ATS) is a software application that enables the electronic handling of recruitment and hiring needs. Corporate recruiters or hiring managers can then search and sort through the resumes in a number of ways, depending on the needs
t04glovern/aws-textract-adoption-forms
Using Serverless to consume and processing WA Animals adoption forms using Amazon Textract and placing that data in DynamoDB
Bmitch44/textract-demo
This repository is a demo for using AWS Textract to get data from scanned pdf files
slub/textract2page
Convert AWS Textract JSON to PRImA PAGE XML
AWS-HumanInTheLoop/TabularDocumentDigitization
Human Reviewed Tabular Document Digitization with Amazon Textract and Amazon A2I
aws-samples/aws-textract-e2e-processing
This repo contains all the code required to do an IDP solution on AWS from document splitting, classification to extraction.
aws-samples/winform-amazon-bedrock-document-bot
A conversational document bot Windows Forms desktop application that allows users to upload PDF or Word files and ask questions about their content, with the bot keeping track of the conversation history and providing contextual responses based on the whole conversation.
build-on-aws/aiml-like-api-in-your-app
Sample code for adding AI/ML services to your app
edelgm6/ledger
Personal accounting tool with Django backend, HTMX+Alpine frontend, and AWS Textract
hupe1980/go-textractor
📄 Amazon textract response parser written in go.
manuel-lang/Autonomous-Semantic-Search-Engine
Submission for HackDataKIBots 2018 - Web crawler combined with document analysis
muhimasri/aws-textract-app
Convert an image to an HTML form using Amazon Textract and NodeJS
jurest82/Captcha
This repository contains a Python implementation to solve captchas using AWS Textract
RodrigoRVieira/theHunterCOTWCompanion
This repository mantains the Visual Studio solution used to build the COTWOCRConsole application that works as companion to track harvests during theHunter COTW game :)
sakshi360/Medi-Scanner
This is the repo for submission in AWS Health AI Hackathon hosted on Devpost.
simonkeng/pdf_parser
Textual & numeric data extraction with Python using textract, easily shareable with Docker.
Devanshu-17/HackScript-Hackathon
AI-powered Invoice and Form Label-Fields Extraction for Document Management using OpenAI & Hugging Face Transformers
MoinDalvs/Resume_Classification
Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention
rauanisanfelice/aws-textract
:robot: Ferramenta que lê os arquivos PDFs, realiza OCR e salva em JSON.
rbsathish/amazon_textract
Extracting text,form,table using Textract