tokenizer-decode

tokenizer-decode's Stars

huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python14.9k1.4k
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
Language:Python1k91
superagent-ai/superagent
🥷 Run AI-agents with an API
Language:TypeScript4.9k814
a-real-ai/pywinassistant
The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.
Language:Python1.2k174
leloykun/flash-hyperbolic-attention-minimal
Flash Hyperbolic Attention in ~[...] lines of CUDA
Language:Cuda101
tspeterkim/flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
Language:Cuda48137
Antlera/nanoGPT-moe
Enable moe for nanogpt.
Language:Python183
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda21.4k2.3k
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python6.7k977
moritztng/fltr
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
Language:Rust3516
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python51.3k5.3k
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.3k484
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Language:C++1.6k218
johnma2006/mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
Language:Python2.4k172
xue160709/Local-LLM-User-Guideline
17012
lucidrains/x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python4.4k370
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.1k150
stas00/ml-engineering
Machine Learning Engineering Open Book
Language:Python10.2k605
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python80.1k21.5k
numba/numba
NumPy aware dynamic Python compiler using LLVM
Language:Python9.6k1.1k