wmingkai's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
lencx/ChatGPT
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
jgraph/drawio
draw.io is a JavaScript, client-side editor for general diagramming.
ultralytics/ultralytics
Ultralytics YOLO11 🚀
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
HarisIqbal88/PlotNeuralNet
Latex code for making neural networks diagrams
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
judasn/Linux-Tutorial
《Java 程序员眼中的 Linux》
open-mmlab/mmyolo
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
aerkalov/ebooklib
Python E-book library for handling books in EPUB2/EPUB3 format -
ibm-aur-nlp/PubLayNet
universal-ie/UIE
Unified Structure Generation for Universal Information Extraction
hikopensource/DAVAR-Lab-OCR
OCR toolbox from Davar-Lab
BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
wenwenyu/PICK-pytorch
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)
baaivision/tokenize-anything
[ECCV 2024] Tokenize Anything via Prompting
nyu-dl/dl4marco-bert
BDBC-KG-NLP/IE-Survey
北京航空航天大学大数据高精尖中心自然语言处理研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行调研。
clovaai/cord
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
DS4SD/DocLayNet
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
hrwleo/dwnlpinterview
Datawhale NLP 面筋
ayanban011/SwinDocSegmenter
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
iscc/mobi
python based software to unpack kindlegen generated ebooks
xrr233/Webformer
SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval
kailigo/cddod
Project page for "Cross-Domain Document Object Detection: Benchmark Suite and Method, CVPR 2020"
johnson-magic/Awesome-Document-Layout-Analysis
A curated list of resources dedicated to document layout analysis