hadican's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
jgm/pandoc
Universal markup converter
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
keycloak/keycloak
Open Source Identity and Access Management For Modern Applications and Services
PaddlePaddle/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
authelia/authelia
The Single Sign-On Multi-Factor portal for web apps
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
goauthentik/authentik
The authentication glue you need.
supertokens/supertokens-core
Open source alternative to Auth0 / Firebase Auth / AWS Cognito
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
ory/kratos
The most scalable and customizable identity server on the market. Replace your Homegrown, Auth0, Okta, Firebase with better UX and DX. Has all the tablestakes: Passkeys, Social Sign In, Multi-Factor Auth, SMS, SAML, TOTP, and more. Written in Go, cloud native, headless, API-first. Available as a service on Ory Network and for self-hosters.
apereo/cas
Apereo CAS - Identity & Single Sign On for all earthlings and beyond.
Megvii-BaseDetection/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
explodinggradients/ragas
Supercharge Your LLM Application Evaluations 🚀
jsvine/pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
pdfminer/pdfminer.six
Community maintained fork of pdfminer - we fathom PDF
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
ory/keto
The most scalable and customizable permission server on the market. Fix your slow or broken permission system with Google's proven "Zanzibar" approach. Supports ACL, RBAC, and more. Written in Go, cloud native, headless, API-first. Available as a service on Ory Network and for self-hosters.
deepdoctection/deepdoctection
A Repo For Document AI
kdeldycke/awesome-iam
👤 Identity and Access Management knowledge for cloud platforms
ibm-aur-nlp/PubLayNet
poloclub/unitable
UniTable: Towards a Unified Table Foundation Model