AIApprentice101's Stars
meta-llama/llama
Inference code for Llama models
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
ray-project/ray-llm
RayLLM - LLMs on Ray
persimmon-ai-labs/adept-inference
Inference code for Persimmon-8B