Pinned Repositories
OCRFlux
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex layout handling, complicated table parsing and cross-page content merging.
llava-test
texture-analysis
The repository includes texture representation, recognition, segmentation and others of texture analysis.
ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).
MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
TableGeneration
通过浏览器渲染生成表格图像
LRHstudy's Repositories
LRHstudy/llava-test
LRHstudy/texture-analysis
The repository includes texture representation, recognition, segmentation and others of texture analysis.