will-thompson-k's Stars
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
spotify/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
pgvector/pgvector
Open-source vector similarity search for Postgres
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
mistralai/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
karpathy/micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Lightning-AI/litgpt
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
OpenAccess-AI-Collective/axolotl
Go ahead and axolotl questions
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
google-deepmind/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
openai/transformer-debugger
yandex/YaLM-100B
Pretrained language model with 100B parameters
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
openai/weak-to-strong
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
pytorch/torchtitan
A native PyTorch Library for large model training
EdinburghNLP/awesome-hallucination-detection
List of papers on hallucination detection in LLMs.
google/paxml
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.
kingoflolz/swarm-jax
Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes
lucidrains/triton-transformer
Implementation of a Transformer, but completely in Triton
Mihaiii/llm_steer
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors
huggingface/llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
lucidrains/speculative-decoding
Explorations into some recent techniques surrounding speculative decoding
teknium1/LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
XZhang97666/AlpaCare
hwchase17/chain-of-verification
yisding/litllm