llm-optimization
There are 5 repositories under llm-optimization topic.
jxzhangjhu/Awesome-LLM-Prompt-Optimization
Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models
villagecomputing/superpipe
Superpipe - optimized LLM pipelines for structured data
romsto/Speculative-Decoding
Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.
doramirdor/Nadir
Nadir is a Python package designed to dynamically choose the best llm for your prompt by balancing complexity and cost and response time.
BjornMelin/llm-gpu-optimization
🚄 Advanced LLM optimization techniques using CUDA. Features efficient attention mechanisms, custom CUDA kernels for transformers, and memory-efficient training strategies. ⚡