Pinned Repositories
AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
ContrastiveDecoding
contrastive decoding
DeLAMA
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
MATRIX
Implementation of the MATRIX framework (ICML 2024)
MATRIX-Gen
O-LoRA
OpenFedLLM
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ShuoTang123's Repositories
ShuoTang123/MATRIX
Implementation of the MATRIX framework (ICML 2024)
ShuoTang123/MATRIX-Gen
ShuoTang123/DeLAMA
ShuoTang123/O-LoRA
ShuoTang123/ContrastiveDecoding
contrastive decoding
ShuoTang123/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
ShuoTang123/OpenFedLLM
ShuoTang123/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ShuoTang123/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
ShuoTang123/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
ShuoTang123/dclm_shuo
DataComp for Language Models
ShuoTang123/ilya-sutskever-recommended-reading
It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.
ShuoTang123/llm-continual-learning-survey
ShuoTang123/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
ShuoTang123/MoA
ShuoTang123/MOSS-RLHF
MOSS-RLHF
ShuoTang123/pal
PaL: Program-Aided Language Models (ICML 2023)
ShuoTang123/self-speculative-decoding
Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**
ShuoTang123/social-media-profile-scrapers
Fetch user's data across social media