steptian's Stars
google/magika
Detect file content types with deep learning
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
ymcui/Chinese-Mixtral
中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)
ml-explore/mlx-data
Efficient framework-agnostic data loading
ml-explore/mlx-examples
Examples in the MLX framework
ml-explore/mlx
MLX: An array framework for Apple silicon
fetchai/uAgents
A fast and lightweight framework for creating decentralized agents with ease.
dot-agent/nextpy
🤖Self-Modifying Framework from the Future 🔮 World's First AMS
mindsdb/mindsdb
AGI's query engine - Platform for building AI that can learn and answer questions over federated data.
langchain-ai/kork
Natural Language Interfaces Powered by LLMs
Stability-AI/StableCascade
Official Code for Stable Cascade
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
cv-small-snails/Awesome-Table-Recognition
A curated list of resources dedicated to table recognition
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
ibaiGorordo/ONNX-HAWP-Line-Detection
Python scripts for performing line detection using the HAWP model in ONNX.
dpiresearch/OCR_craft
clovaai/CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
faustomorales/keras-ocr
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
autonise/CRAFT-Remade
Implementation of CRAFT Text Detection
FreedomIntelligence/Medical_NLP
Medical NLP Competition, dataset, large models, paper
lancopku/pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
tianchiguaixia/medical_ocr_streamlit
该项目主要是为了识别图片里面的表格数据,并将表格数据抽取处理,导出成csv的文件。整个项目会使用streamlit进行部署和展示。使用的技术:paddleocr,PPStructure,streamlit
RapidAI/RapidLaTeXOCR
Formula recognition based on LaTeX-OCR and ONNXRuntime.
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Zecat/cardscan
Extract black bordered frame in an image, adjust perspective, crop and rotate.
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
xavctn/img2table
img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing
TransformerOptimus/SuperAGI
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI