raphaelsty's Stars
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
microsoft/pyright
Static Type Checker for Python
Jiayi-Pan/TinyZero
Minimal reproduction of DeepSeek R1-Zero
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
lightpanda-io/browser
Lightpanda: the headless browser designed for AI and automation
arcee-ai/mergekit
Tools for merging pretrained large language models.
gusye1234/nano-graphrag
A simple, easy-to-hack GraphRAG implementation
willccbb/verifiers
Verifiers for LLM Reinforcement Learning
roboflow/maestro
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
Goldziher/kreuzberg
Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.
WLiK/LLM4Rec-Awesome-Papers
A list of awesome papers and resources of recommender system on large language model (LLM).
MinishLab/model2vec
Fast State-of-the-Art Static Embeddings
wasiahmad/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
beam-cloud/beta9
Ultrafast serverless GPU inference, sandboxes, and background jobs
goodreasonai/ScrapeServ
A self-hosted API that takes a URL and returns a file with browser screenshots.
PrunaAI/pruna
Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.
kiwix/kiwix-apple
Kiwix for iOS & macOS
pirate/wikipedia-mirror
🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
sparkfish/augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
coree/awesome-rag
A curated list of retrieval-augmented generation (RAG) in large language models
567-labs/systematically-improving-rag
oxideai/mlx-rs
Unofficial Rust bindings to Apple's mlx framework
ivanleomk/kura
Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embeddings recursively. This helps us understand user behaviour on a higher level without sacrificing PII.
openzim/python-libzim
Libzim binding for Python: read/write ZIM files in Python
mistralai/mistral-evals
KGrewal1/candle-optimisers
A collection of optimisers for use with candle
tomsanbear/candle-einops
nphdang/FS-BBT
Black-box Few-shot Knowledge Distillation
lintool/history-of-open-source-ir-systems
History of Open-Source IR Systems