shudct's Stars
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
AppFlowy-IO/AppFlowy
Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Tencent/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
doccano/doccano
Open source annotation tool for machine learning practitioners.
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
Kedreamix/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
Clouditera/SecGPT
SecGPT网络安全大模型
DevashishPrasad/CascadeTabNet
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
liucongg/NLPDataSet
记录本人整理的一些数据集
lmmlzn/Awesome-LLMs-Datasets
Summarize existing representative LLMs text datasets.
xbdcc/GrabRedEnvelope
微信抢红包Android APP
DS4SD/DocLayNet
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
buptlihang/CDLA
CDLA: A Chinese document layout analysis (CDLA) dataset
IBM/SynthTabNet
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
RapidAI/RapidLayout
Analysis of Chinese and English layouts 中英文版面分析
xuewenyuan/TGRNet
TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition
FutureRising007/Table_Structure_Recognition
Table Structure Recognition
PaddlePaddle/EasyData
ZZR8066/SEMv2
X-LANCE/Mobile-Env
A Universal Platform for Training and Evaluation of Mobile Interaction
SWHL/ChineseDocumentPDF
中文论文、证券类、财报类PDF数据