cmcmaster1's Stars
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
Aider-AI/aider
aider is AI pair programming in your terminal
phidatahq/phidata
Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
supermemoryai/supermemory
Build your own second brain with supermemory. It's a ChatGPT for your bookmarks. Import tweets or save websites and content using the chrome extension.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
nilsherzig/LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
google/mesop
Rapidly build AI apps in Python
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
AnswerDotAI/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
PatrickJS/awesome-cursorrules
📄 A curated list of awesome .cursorrules files
cohere-ai/cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
nus-apr/auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
codelion/optillm
Optimizing inference proxy for LLMs
qnguyen3/chat-with-mlx
An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.
johnmai-dev/ChatMLX
🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.
vgel/repeng
A library for making RepE control vectors
microsoft/TransformerCompression
For releasing code related to compression methods for transformers, accompanying our publications
arcee-ai/DistillKit
An Open Source Toolkit For LLM Distillation
EleutherAI/sae
Sparse autoencoders
armbues/SiLLM
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
willccbb/mlx_parallm
Fast parallel LLM inference for MLX
DataformerAI/dataformer
Solving data for LLMs - Create quality synthetic datasets!
Xalp/ECHO
Official homepage for "Self-Harmonized Chain of Thought"
sumo43/moondream-mlx
andrewgph/streaming-whisper
Techniques for faster streaming Whisper ASR in MLX