xiaokn's Stars
jiegec/blender-scripts
Some useful Blender scripts
Alibaba-NLP/CoFE-RAG
rhymes-ai/Aria
Codebase for Aria - an Open Multimodal Native MoE
GAIR-NLP/MathPile
[NeurlPS D&B 2024] Generative AI for Math: MathPile
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
ziyc/drivestudio
A 3DGS framework for omni urban scene reconstruction and simulation.
nnanhuang/S3Gaussian
Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
yh-hust/PDF-Wukong
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
threedle/text2mesh
3D mesh stylization driven by a text input in PyTorch
buaacyw/MeshAnythingV2
From anything to mesh like human artists. Official impl. of "MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization"
unclecode/crawl4ai
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
Kunhao-Liu/StyleRF
[CVPR 2023] StyleRF: Zero-shot 3D Style Transfer of Neural Radiance Fields
Kunhao-Liu/StyleGaussian
[SIGGRAPH Asia 2024] StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
CaraJ7/MMSearch
The First Multimodal Seach Engine Pipeline and Benchmark for LMMs
huangwl18/ReKep
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
OpenSPG/openspg
OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable Graph) framework. Core Capabilities: 1) domain model constrained knowledge modeling, 2) facts and logic fused representation, 3) KAG will be natively supported soon, so please stay tuned...
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
yash9439/Detectron-Layout-Parser
This code performs PDF layout analysis and optical character recognition (OCR) using the layoutparser library and Tesseract OCR Engine. It detects the layout of a PDF document and extracts text from specific regions. The code is divided into several sections, each serving a specific purpose.
huridocs/pdf-document-layout-analysis
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.
tickstep/aliyunpan
阿里云盘命令行客户端,支持JavaScript插件,支持同步备份功能。
tkfy920/PythonQuantitativeFinance
专注于分享Python在金融领域的应用,欢迎关注微信公众号: Python金融量化 (id:tkfy920)
netease-youdao/QAnything
Question and Answer based on Anything.
chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
LLM-Red-Team/kimi-free-api
🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。