zhujun5164's Stars
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
SpursGoZmy/Tabular-LLM
本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。
catqaq/OpenTextClassification
OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全面的开源文本分类项目,支持中英双语、多种模型、多种任务。
wang-zhix/seal_generate
Gmgge/TrOCR-Seal-Recognition
基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用
breezedeus/Pix2Text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
jacket230/damai
大麦抢票damai,piao,qiangpiao 余票监控,逆向破解,加密算法,frida,hook,https加解密,app端请求,演唱会,演出,猫眼,票星球pxq,纷玩岛fwd,周杰伦jay,林俊杰 JJ,王嘉尔,伍佰,邓紫棋,杭州,北京,上海,泉州 薛之谦,刘德华,千人q群即将满员,不设二群。
taishan1994/awesome-relation-extraction
关系抽取
gxh27954/damai_requests
大麦网H5、小程序、APP抢票解决
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
zhujun5164/DB_torchvision-deform
fork from github https://github.com/MhLiao/DB. Change the deformconv to torchvision verison, and fix some problom
MhLiao/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
CLUEbenchmark/CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
zhanlaoban/EDA_NLP_for_Chinese
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
IDEA-Research/awesome-detection-transformer
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
FudanVI/benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
open-mmlab/mmengine
OpenMMLab Foundational Library for Training Deep Learning Models
jozhang97/DETA
Detection Transformers with Assignment
HDETR/H-Deformable-DETR
[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".
IDEA-Research/detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
subex/STDW
Subex Table Detection Code and Dataset
GXYM/TextPMs
Arbitrary Shape Text Detection via Segmentation with Probability Maps; accepted by TPAMI2022
buptlihang/CDLA
CDLA: A Chinese document layout analysis (CDLA) dataset
AILab-UniFI/GNN-TableExtraction
Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"
PaddlePaddle/PaddleHub
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)【安全加固,暂停交互,请耐心等待】
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
lucasjinreal/yolovn
Just another yolo variant.