triton-inference-server/model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.
PythonApache-2.0
Stargazers
- alphapibeta@sair-lab
- anubhav4sachanSamsung
- aroraakshitNVIDIA
- bbshocking
- C5YS
- CS-savvyBlackstraw
- day253Baidu
- dcyoung@scanse @MoffettData @detectlabs
- deadeyegoodwin
- dkozlov
- fabitoHalter
- fimm-lundin
- fly51flyPRIS
- ganlerUniversity of Illinois Urbana-Champaign
- gaocegege@TensorChord
- HINATASDK
- HuaizhengZhanghttps://breezeml.ai/
- huangyz0918Los Angeles, CA
- igobypenn
- ishantbansalHyderabad
- jfsantos@NVIDIA
- jiangplusshenzhen, china
- jishminorArm
- lbin
- onriv
- ossdev-somewhere
- philipp-schmidtIsarsoft GmbH
- shernshiouBasel, Switzerland
- sunhailin-LeoYunLu Tech - J&T Express
- t13m
- ukclivecoxCambridge, UK
- whoisjNVIDIA
- wolegechuStepFUN
- wrchen-voxel
- XiaoPengZongjiaxunfeihong
- xieydd@Tensorchord