premAI-io/benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
ShellMIT
Issues
- 5
Incorrect results reported for TensorRT-LLM
#187 opened by lopuhin - 0
Max Engine by Mojo
#188 opened by Anindyadeep - 1
- 0
aphrodite-engine
#186 opened by Anindyadeep - 1
- 1
For quality checks reference should be taken from actual PyTorch version of model
#162 opened by Anindyadeep - 1
- 1
Improvements on logging and storing results
#157 opened by Anindyadeep - 1
- 1
- 1
- 0
Add model loading time for each benchmarks
#181 opened by Anindyadeep - 0
Add `torch.inference_session` on runner function
#180 opened by Anindyadeep - 1
Complete ML Engines Table
#149 opened by nsosio - 1
Change Main readme
#176 opened by Anindyadeep - 1
Setup new md files from templates
#179 opened by Anindyadeep - 2
Comments on output quality on main README
#166 opened by Anindyadeep - 3
- 3
ONNX benchmark is not running.
#100 opened by Anindyadeep - 2
Additional performance benchmarks metric to give a overall picture of choosing a backend / framework.
#107 opened by Anindyadeep - 1
An Evaluation Dataset for quality benchmarking of different inference engine implementation.
#116 opened by Anindyadeep - 2
- 2
JAX
#81 opened by Anindyadeep - 2
Check if flash attention is supported on the issue or not, and accordingly update on the benchmark specific readme
#83 opened by Anindyadeep - 0
LM Deploy
#147 opened by Anindyadeep - 0
- 1
Check for FP-8 format also for Optimum Nvidia
#115 opened by Anindyadeep - 0
PowerInfer
#105 opened by filopedraz - 1
An optional docker container creation for each.
#103 opened by Anindyadeep - 0
- 0
AirLLM
#87 opened by Anindyadeep - 0
- 0
Mojo
#82 opened by Anindyadeep - 2
- 1
TinyGrad benchmark not working.
#101 opened by Anindyadeep - 0
MLX
#89 opened by Anindyadeep - 0
New engines
#156 opened by ogencoglu - 0
ML Engines
#148 opened by nsosio - 1
Add a readme under each benchmark.
#88 opened by Anindyadeep - 0
- 0
Latest benchmarks not updating.
#124 opened by Anindyadeep - 0
- 1
Include results in main README
#97 opened by filopedraz - 0
Optimum With Nvidia
#92 opened by Anindyadeep - 0
- 0
DeepSpeed FastGen
#110 opened by Anindyadeep - 0
Lightning AI
#80 opened by Anindyadeep - 0
- 1
- 1