eramax's Stars
Mozer/talk-llama-fast
Port of OpenAI's Whisper model in C/C++ with xtts and wav2lip
electric-sql/postgres-wasm
electric-sql/pglite
Lightweight WASM Postgres with real-time, reactive bindings.
OpenCodeInterpreter/OpenCodeInterpreter
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. It significantly enhances code generation capabilities by integrating execution and iterative refinement functionalities.
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
HelgeSverre/ollama-gui
A Web Interface for chatting with your local LLMs via the ollama API
google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
rustdesk/rustdesk
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
vgel/repeng
A library for making RepE control vectors
huggingface/optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
cognitivetech/ollama-ebook-summary
LLM for Long Text Summary (Comprehensive Bulleted Notes)
euclaise/supertrainer2000
ContextualAI/gritlm
Generative Representational Instruction Tuning
multiplexerai/Complex-to-Simple-RAG
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
csurfer/pyheat
pprofile + matplotlib = Python program profiled as an awesome heatmap!
google/magika
Detect file content types with deep learning
Vahe1994/AQLM
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression https://arxiv.org/abs/2405.14852
theroyallab/tabbyAPI
An OAI compatible exllamav2 API that's both lightweight and fast
mvdan/sh
A shell parser, formatter, and interpreter with bash support; includes shfmt
stas00/ml-engineering
Machine Learning Engineering Open Book
Nutlope/notesGPT
Record voice notes & transcribe, summarize, and get tasks
silphendio/sliced_llama
Simple LLM inference server
r-three/phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
leptonai/leptonai
A Pythonic framework to simplify AI service building
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
vosen/ZLUDA
CUDA on non-NVIDIA GPUs
Vaibhavs10/fast-llm.rs