Pinned Repositories
falcontune
Tune any FALCON in 4-bit
fiddler
Fast Inference of MoE Models with CPU-GPU Orchestration
landmark-attention
llm-autoeval
Automatically evaluate your LLMs in Google Colab
lm-evaluation-harness
A framework for few-shot evaluation of language models.
mpttune
Tune MPTs
rmihaylov's Repositories
rmihaylov/falcontune
Tune any FALCON in 4-bit
rmihaylov/mpttune
Tune MPTs
rmihaylov/fiddler
Fast Inference of MoE Models with CPU-GPU Orchestration
rmihaylov/landmark-attention
rmihaylov/llm-autoeval
Automatically evaluate your LLMs in Google Colab
rmihaylov/lm-evaluation-harness
A framework for few-shot evaluation of language models.