huzhiguan's Stars
xingshaocheng/architect-awesome
后端架构师技术图谱
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
starship/starship
☄🌌️ The minimal, blazing-fast, and infinitely customizable prompt for any shell!
v2ray/v2ray-core
A platform for building proxies to bypass network restrictions.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
imarvinle/awesome-cs-books
🔥 经典编程书籍大全,涵盖:计算机系统与网络、系统架构、算法与数据结构、前端开发、后端开发、移动开发、数据库、测试、项目与团队、程序员职业修炼、求职面试等
apache/arrow
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
ruanyf/free-books
互联网上的免费书籍
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
bloomrpc/bloomrpc
Former GUI client for gRPC services. No longer maintained.
Alluxio/alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
elsa-workflows/elsa-core
A .NET workflows library
NMAC427/SwiftOCR
Fast and simple OCR library written in Swift
Xabaril/AspNetCore.Diagnostics.HealthChecks
Enterprise HealthChecks for ASP.NET Core Diagnostics Package
kermitt2/grobid
A machine learning software for extracting information from scholarly documents
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
atlanhq/camelot
Camelot: PDF Table Extraction for Humans
camelot-dev/camelot
A Python library to extract tabular data from PDFs
luizdepra/hugo-coder
A minimalist blog theme for hugo.
WenmuZhou/OCR_DataSet
收集并整理有关OCR的数据集并统一标注格式,以便实验需要
hikopensource/DAVAR-Lab-OCR
OCR toolbox from Davar-Lab
lidangzzz/Best-Practice-for-Building-A-Startup-in-Delaware-with-Tech-Tools
美国华人技术创业的快速公司/银行/税务/投资操作白皮书
allenai/pdffigures2
Given a scholarly PDF, extract figures, tables, captions, and section titles.
xLightsSequencer/xLights
xLights is a sequencer for Lights. xLights has usb and E1.31 drivers. You can create sequences in this object oriented program. You can create playlists, schedule them, test your hardware, convert between different sequencers.
GowenGit/docnet
DocNET is as fast PDF editing and reading library for modern .NET applications
openzipkin/zipkin4net
A .NET client library for Zipkin
elifesciences/sciencebeam-parser
A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools together to generate a full XML document.
probcomp/PClean
A domain-specific probabilistic programming language for scalable Bayesian data cleaning
xylcbd/ocr-open-dataset
list all open dataset about ocr.
xigt/freki
Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)