Kouuh

Kouuh's Stars

opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。
Language:Python22.2k 116 7821.6k
Byaidu/PDFMathTranslate
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/Docker
Language:Python11.7k 52 280850
onyx-dot-app/onyx
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
Language:Python11.3k 105 5851.4k
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Language:Python5.9k 36 81475
meta-llama/llama-stack-apps
Agentic components of the Llama Stack APIs
4k 120 63639
johnlui/PPHC
📙《高并发的哲学原理》开源图书（CC BY-NC-ND）https://pphc.lvwenhan.com
Language:Rust3.8k 38 14343
huggingface/smol-course
A course on aligning smol models.
Language:Jupyter Notebook3.6k 27 281.1k
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Language:Jupyter Notebook2.6k 34 50211
tencentmusic/supersonic
SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.
Language:Java2.6k 31 598444
rio-labs/rio
WebApps in pure Python. No JavaScript, HTML and CSS needed
Language:Python2.1k 27 16274
datawhalechina/hugging-multi-agent
A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基于MetaGPT的多智能体入门与开发教程
Language:CSS1.4k 139 7199
BinNong/meet-libai
李白 :bust_in_silhouette: 作为唐代杰出诗人，其诗歌作品在**文学史上具有重要地位。近年来，随着数字技术和人工智能的快速发展，传统文化普及推广的形式也面临着创新与变革。国内外对于李白诗歌的研究虽已相当深入，但在数字化、智能化普及方面仍存在不足。因此，本项目旨在通过构建李白知识图谱，结合大模型训练出专业的AI智能体，以生成式对话应用的形式，推动李白文化的普及与推广。
Language:Python1.3k 12 8164
AnswerDotAI/rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
Language:Python1.2k 14 2665
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python985 13 1574
multimodal-art-projection/MAP-NEO
Language:Python892 11 3482
datawhalechina/dive-into-cv-pytorch
动手学CV-Pytorch版
Language:Python867 20 11181
TinyLLaVA/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
Language:Python699 11 14975
shibing624/ChatPilot
ChatPilot: Chat Agent Web UI，实现Chat对话前端，支持Google搜索、文件网址对话（RAG）、代码解释器功能，复现了Kimi Chat(文件，拖进来；网址，发出来)。
Language:Svelte522 6 1651
cognitivetech/ollama-ebook-summary
LLM for Long Text Summary (Comprehensive Bulleted Notes)
Language:Python451 10 1033
simbianai/taskgen
Task-based Agentic Framework using StrictJSON as the core
Language:Jupyter Notebook441 9 945
cjinhuo/text-search-engine
A text search engine that supports mixed Chinese and English fuzzy search.
Language:TypeScript435 3 516
weavel-ai/Ape
Your first AI prompt engineer
Language:Python352 2 714
wenge-research/YAYI-UIE
雅意信息抽取大模型：在百万级人工构造的高质量信息抽取数据上进行指令微调，由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)
278 4 813
facebookresearch/sscd-copy-detection
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
Language:Python269 6 1020
hzeyuan/x-cards
Easy share X anywhere,in any format
Language:TypeScript208 2 620
IDEA-Research/ChatRex
Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Language:Python118 3 63
iOPENCap/awesome-remote-image-captioning
A list of awesome remote sensing image captioning resources
Language:Python95 3 01
shawnh2/QA-CivilAviationKG
基于民航业知识图谱的自动问答系统
Language:Python90 1 523
xverse-ai/XVERSE-V-13B
Language:Python78 4 64
WangHelin1997/Automatic_Speech_Annotator
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition
Language:Python31 4 04