PoHaoYen's Stars
Jikai0Wang/OPT-Tree
OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure
goliaro/specinfer-ae
NJUNLP/MCSD
Multi-Candidate Speculative Decoding
hemingkx/SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
romsto/Speculative-Decoding
Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.
microsoft/DeepSpeedExamples
Example models using DeepSpeed
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
lucidrains/speculative-decoding
Explorations into some recent techniques surrounding speculative decoding
uw-mad-dash/decoding-speculative-decoding
lol0963332320/ICLAB
zhijs/-Reinforcement-Learning-five-in-a-row-
基于DQN的五子棋人机对弈
zhiyiYo/Alpha-Gobang-Zero
A gobang robot based on reinforcement learning.
ultralytics/ultralytics
Ultralytics YOLO11 🚀
Entropy-xcy/bitnet158
Beomi/BitNet-Transformers
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture
joey00072/ohara
Collection of autoregressive model implementation
cnclabs/smore
SMORe: Modularize Graph Embedding for Recommendation