aws/sagemaker-inference-toolkit
Serve machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.
PythonApache-2.0
Issues
- 3
- 2
- 12
Be able to change SageMaker endpoint log level
#70 opened by oonisim - 0
Is streaming supported?
#137 opened by inf3rnus - 2
Warning: "Calling MMS with mxnet-model-server. Please move to multi-model-server."
#69 opened by mb-dev - 5
Config parsing can be improved
#16 opened by ericangelokim - 0
Add support for downloading inference script and code (entry_point) from s3
#136 opened by urirosenberg - 1
Default output function encodes results to JSON and that seems to add to response latency.
#84 opened by vdantu - 0
Triton container documentation implies py38
#134 opened by david-waterworth - 5
psutil 5.9.6 seems to be throwing ZombieProcess when retrieving the mms process
#132 opened by charlietruong-wk - 0
- 4
- 7
- 0
Tensorflow Inference toolkit
#128 opened by vincentvic - 0
Support for parquet encoder and decoder
#127 opened by lorenzwalthert - 1
OOM errors creating an endpoint for LLMs
#124 opened by jsleight - 0
Support user-defined batch inference logic
#123 opened by dgcnz - 0
- 0
stop_server function
#116 opened by Duncan-Haywood - 0
Launch MMS without repackaging model contents
#102 opened by fm1ch4 - 0
Custom `model_fn` function not found when extending the PyTorch inference container
#86 opened by e13h - 0
Long Model Loading times in Multimodel Server
#113 opened by AlexRaschl - 0
- 0
Add (and test) Support for Python 3.8
#107 opened by ddluke - 0
Include "Requires-Python" in source distributions
#106 opened by ddluke - 0
- 1
- 5
- 2
JVM detect the CPU count as 1 when more CPUs are available for the container.
#82 opened by amaharek - 5
- 0
In a multi-model scenario pass model name as argument to input_fn() and output_fn()
#77 opened by RajeshRamchander - 0
Documentation for creating multi model end point for unsupported algorithms is not clear
#78 opened by raj-vuram - 2
Allow model_fn to get more arguments
#65 opened by ehsanmok - 0
Enhance UX for inference
#64 opened by ehsanmok - 2
logging in JSON format
#59 opened by alext234 - 1
Local development WorkerLifeCycle skip
#41 opened by danielmapar - 3
- 3
- 5
- 1
Ability to define custom log4j properties
#52 opened by arvarik - 0
Update README example
#46 opened by ajaykarpur - 15
- 0
- 2