mingyangAbc's Stars
photoprism/photoprism
AI-Powered Photos App for the Decentralized Web 🌈💎✨
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
unclecode/crawl4ai
🚀🤖 Crawl4AI: Crawl Smarter, Faster, Freely. For AI.
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
fishaudio/fish-speech
SOTA Open Source TTS
DS4SD/docling
Get your documents ready for gen AI
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
getomni-ai/zerox
PDF to Markdown with vision models
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
InternLM/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
microsoft/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
modelscope/data-juicer
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
severian42/GraphRAG-Local-UI
GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.
itsOwen/CyberScraper-2077
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
tomasonjo/blogs
Jupyter notebooks that support my graph data science blog posts at https://bratanic-tomaz.medium.com/
memfreeme/memfree
MemFree - Hybrid AI Search Engine & AI Page Generator
VikParuchuri/texify
Math OCR model that outputs LaTeX and markdown
opendatalab/labelU
Data annotation toolbox supports image, audio and video data.
OpenSPG/openspg
OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable Graph) framework. Core Capabilities: 1) domain model constrained knowledge modeling, 2) facts and logic fused representation, 3) natively support KAG...
kijai/ComfyUI-Florence2
Inference Microsoft Florence2 VLM
THUDM/AutoWebGLM
An LLM-based Web Navigating Agent (KDD'24)
gotonote/Autopilot-Notes
自动驾驶笔记,以解析各模块知识点、整合行业优秀解决方案进行阐述,以帮助自己及有需要的读者;包含深度学习、deeplearning、无人驾驶、BEV、Transformer、ADAS、CVPR、特斯拉AI DAY、大模型、chatgpt等内容.
opendatalab/DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
opendatalab/magic-doc
ranpox/awesome-computer-use
This is a collection of resources for computer-use agents, including videos, blogs, papers, and projects.
mayubo2333/MMLongBench-Doc
Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations
CycloneBoy/pdf_table
A Unified Toolkit for Deep Learning-Based Table Extraction