llm-optimization

There are 5 repositories under llm-optimization topic.

  • jxzhangjhu/Awesome-LLM-Prompt-Optimization

    Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models

  • villagecomputing/superpipe

    Superpipe - optimized LLM pipelines for structured data

    Language:Python108123
  • romsto/Speculative-Decoding

    Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.

    Language:Python78228
  • doramirdor/Nadir

    Nadir is a Python package designed to dynamically choose the best llm for your prompt by balancing complexity and cost and response time.

    Language:Python40222
  • BjornMelin/llm-gpu-optimization

    🚄 Advanced LLM optimization techniques using CUDA. Features efficient attention mechanisms, custom CUDA kernels for transformers, and memory-efficient training strategies. ⚡