document-ai
There are 29 repositories under document-ai topic.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
deepdoctection/deepdoctection
A Repo For Document AI
tstanislawek/awesome-document-understanding
A curated list of resources for Document Understanding (DU) topic
jpWang/LiLT
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
SCUT-DLVCLab/Document-AI-Recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
clovaai/webvicob
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
doc-analysis/ReadingBank
ReadingBank: A Benchmark Dataset for Reading Order Detection
nttmdlab-nlp/SlideVQA
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
ZeningLin/ViBERTgrid-PyTorch
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
googleapis/python-documentai-toolbox
Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from JSON files in Cloud Storage, local JSON files, or output directly from the Document AI API.
DunnBC22/Vision_Audio_and_Multimodal_Projects
This repository includes all computer vision, audio, document AI, and multimodal projects.
Unstructured-IO/community
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
whn09/table_structure_recognition
Table detection and table structure recognition using Yolov5
chenxn2020/GOSE
[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"
SCUT-DLVCLab/RFUND
Official release of RFUND introduced in the paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction" (arXiv:2401.03472).
NirmalNagaraj/DocGPT
A Chatbot for the Document Analysis .
dhorvay/document-understanding-ebook
(WIP) ✨ A comprehensive resource for understanding the world of software used in the Document Understanding field. 🧙✨
bwnyasse/dart-documentai-samples
A hands-on CLI tool sample showcasing the integration of Dart with Google Cloud's DocumentAI.
bhadreshpsavani/SmartOCR-with-LayoutLM
Exploring LayoutLM for Smart OCR Capabilities
wintermi/ocr-runner
OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.
ajaycode/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
masoudshab/Doc2Edi
Extracting Data from Document PDF and Converting to EDI211 Files Using GCP and Google Document AI
Purushothaman-natarajan/Custom-NER-Model-using-Spacy-Fine-Tuning
Spacy for Key:Value pairs
samkenxstream/SamKenX_documents-ai
SamKenX applications and Document AI, the end-to-end document processing platform on Cloudstorage warehouse.
conditionedstimulus/DocumentClassifier
FastAPI application for document classification using a multimodal LayoutLM model, designed to classify PDF documents into RVL-DCIP categories.
marcusmonteirodesouza/google-cloud-document-ai-rest-api-demo
Create an Identity Auto-Filler API with Google Cloud Document AI
OleksiiLatypov/Google_Cloud
AI & Data, Google Cloud Skills Boost
ricardolsmendes/gcp-documentai-custom-extractors
Custom data extractors that use Google Cloud's Document AI