ray-project/llmperf

LLMPerf is a library for validating and benchmarking LLMs

PythonApache-2.0

Issues

Error while using pip
#70 opened a month ago by avi7611
0
Contributions guidelines
#69 opened a month ago by miloszwatroba
0
Divide by zero: request_metrics[common_metrics.REQ_OUTPUT_THROUGHPUT] = num_output_tokens / request_metrics[common_metrics.E2E_LAT]
#55 opened 4 months ago by yaronr
5
Azure OpenAi Endpoint Support
#67 opened 2 months ago by gujju62
0
Since it does not have a 'setup.py' nor a 'setup.cfg', it cannot be installed in editable mode PEP 660
#47 opened 5 months ago by focusunsink
1
When max-num-completed-requests is not divisible by num-concurrent-requests, an error will occur.
#63 opened 2 months ago by huangdi614
0
How to use llmperf to test ollama performance (TTFT, etc)
#58 opened 4 months ago by alexhegit
1
Subsequent requests cannot be sent until 'num_concurrent_requests' requests have all finished
#56 opened 4 months ago by llsj14
5
Blocking on pending requests despite block == false
#43 opened 6 months ago by dacorvo
1
Sagemaker client issue
#53 opened 4 months ago by SuchethaChintha
0
error in request_metrics dictionary implementation
#49 opened 5 months ago by Durga2Dash
0
Vertex AI API needs to be updated.
#48 opened 5 months ago by Durga2Dash
0
unable to get the benchmark result
#44 opened 6 months ago by dipshirajput
1
HUGGINGFACE set
#38 opened 7 months ago by capyun
0
bug of counting output tokens
#35 opened 8 months ago by irasin
0
llmperf not working for concurrent users
#34 opened 8 months ago by nkanike07
0
Bug: Hugging Face TGI not working
#33 opened 8 months ago by ptrmayer
0
Concurrency level is not handled properly
#32 opened 9 months ago by alexeykudinkin
0
Add memory bandwidth utilization metric
#31 opened 9 months ago by mmcclean-aws
0
Handling custom codes arguments like trust_remote_code
#30 opened 9 months ago by Akash08naik
0
usage for local models .
#29 opened 9 months ago by Akash08naik
0
Basic usage issue
#24 opened 10 months ago by wangxingjun778
0
Are benchmark results released somewhere
#9 opened 10 months ago by ogencoglu
5
litellm serializable issue ?
#21 opened 10 months ago by ishaan-jaff
0
Crashing with large number of concurrent users
#17 opened a year ago by francescov1
0
Does llmperf support measuring local disk models? What's the meaning of framework in llmperf.py line 355?
#11 opened a year ago by zhangjiawei5911
3