hanliu9574's Stars
deepseek-ai/awesome-deepseek-integration
SamuelSchmidgall/AgentLaboratory
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
kyegomez/swarms
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.gg/jM3Z6M9uMq
sgl-project/sgl-learning-materials
Materials for learning SGLang
Open-Trader/opentrader
🤖 Open-source crypto trading bot | 📈 DCA & GRID strategies | ✨ UI | ⭐ Star to support the project!
mit-han-lab/nunchaku
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
zilliztech/VectorDBBench
VectorDBBench is a benchmark designed to compare the performance and cost-effectiveness of popular vector databases.
microsoft/RAG_Hack
Hack Together: RAG Hack | Register, Learn, Hack
showlab/computer_use_ootb
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
cline/cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
opendatalab/DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
SylphAI-Inc/LLM-engineer-handbook
A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.
parthsarthi03/raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
zhaochenyang20/Awesome-ML-SYS-Tutorial
My learning notes/codes for ML SYS.
rapidsai/cuvs
cuVS - a library for vector search and clustering on the GPU
rapidsai/cudf
cuDF - GPU DataFrame Library
rapidsai/raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
RussWong/LLM-engineering
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
karpathy/llm.c
LLM training in simple, raw C/CUDA
SUSYUSTC/MathTranslate
translate scientific papers in latex, especially arxiv papers
chonkie-ai/chonkie
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
DS4SD/docling
Get your documents ready for gen AI
mobiusml/hqq
Official implementation of Half-Quadratic Quantization (HQQ)
microsoft/AI-System
System for AI Education Resource.
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
gpu-mode/awesomeMLSys
An ML Systems Onboarding list