anarchy-ai/llm-speed-benchmark

Benchmarking tool for assessing LLM models' performance across different hardwares

PythonMIT

Issues

question
#7 opened 8 months ago by geraldstanje1
0
Add option to change dtype
#4 opened a year ago by MehmetMHY
1
4bit/Llama-2-7b-chat-hf failed to run
#3 opened a year ago by MehmetMHY
1