Kouuh's Stars
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Byaidu/PDFMathTranslate
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker
onyx-dot-app/onyx
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
meta-llama/llama-stack-apps
Agentic components of the Llama Stack APIs
johnlui/PPHC
📙《高并发的哲学原理》开源图书(CC BY-NC-ND)https://pphc.lvwenhan.com
huggingface/smol-course
A course on aligning smol models.
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
tencentmusic/supersonic
SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.
rio-labs/rio
WebApps in pure Python. No JavaScript, HTML and CSS needed
datawhalechina/hugging-multi-agent
A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基于MetaGPT的多智能体入门与开发教程
BinNong/meet-libai
李白 :bust_in_silhouette: 作为唐代杰出诗人,其诗歌作品在**文学史上具有重要地位。近年来,随着数字技术和人工智能的快速发展,传统文化普及推广的形式也面临着创新与变革。国内外对于李白诗歌的研究虽已相当深入,但在数字化、智能化普及方面仍存在不足。因此,本项目旨在通过构建李白知识图谱,结合大模型训练出专业的AI智能体,以生成式对话应用的形式,推动李白文化的普及与推广。
AnswerDotAI/rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
multimodal-art-projection/MAP-NEO
datawhalechina/dive-into-cv-pytorch
动手学CV-Pytorch版
TinyLLaVA/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
shibing624/ChatPilot
ChatPilot: Chat Agent Web UI,实现Chat对话前端,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。
cognitivetech/ollama-ebook-summary
LLM for Long Text Summary (Comprehensive Bulleted Notes)
simbianai/taskgen
Task-based Agentic Framework using StrictJSON as the core
cjinhuo/text-search-engine
A text search engine that supports mixed Chinese and English fuzzy search.
weavel-ai/Ape
Your first AI prompt engineer
wenge-research/YAYI-UIE
雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)
facebookresearch/sscd-copy-detection
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
hzeyuan/x-cards
Easy share X anywhere,in any format
IDEA-Research/ChatRex
Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
iOPENCap/awesome-remote-image-captioning
A list of awesome remote sensing image captioning resources
shawnh2/QA-CivilAviationKG
基于民航业知识图谱的自动问答系统
xverse-ai/XVERSE-V-13B
WangHelin1997/Automatic_Speech_Annotator
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition