llStringll
Agents that can 'verb'-like humans | Studying loss landscape | Causal inference for learning agents
llStringll's Stars
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
ggerganov/llama.cpp
LLM inference in C/C++
Mintplex-Labs/anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
karpathy/llm.c
LLM training in simple, raw C/CUDA
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
HigherOrderCO/Bend
A massively parallel, high-level programming language
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
bentoml/OpenLLM
Run any open-source LLMs, such as Llama, Gemma, as OpenAI compatible API endpoint in the cloud.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
abetlen/llama-cpp-python
Python bindings for llama.cpp
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
promptfoo/promptfoo
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
Giskard-AI/giskard
🐢 Open-Source Evaluation & Testing for ML & LLM systems
Netflix/maestro
Maestro: Netflix’s Workflow Orchestrator
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
unslothai/hyperlearn
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.
IntelLabs/fastRAG
Efficient Retrieval Augmentation and Generation Framework
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Agenta-AI/agenta
The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.
hao-ai-lab/LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
google-deepmind/recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
AudDMusic/RedditBot
Music recognition bot for Reddit powered by audd.io
salykova/matmul.c
Fast, Multi-threaded Matrix Multiplication in C
johnBuffer/Pendulum-NEAT
encord-team/text-to-image-eval
Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics include Zero-shot accuracy, Linear Probe, Image retrieval, and KNN accuracy.