IBM/text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
PythonApache-2.0
Issues
- 0
Problem loading granite-3b in small MIG partitions
#104 opened by ccamacho - 2
Official container image
#102 opened by josephrocca - 0
- 0
Generation does not terminate with EOS token for the vinai/PhoGPT-4B-Chat model
#91 opened by tjohnson31415 - 2
deepseek-coder-33b-instruct model on tgis fails with flash attention and generates wrong output without flash attention
#92 opened by maxdebayser - 0
- 0
Doc Request: PREFIX_STORE_PATH in README
#19 opened by gabe-l-hart - 0
- 7
TGIS build requires PyTorch nightly
#6 opened by Xaenalt - 4
- 2
- 2
- 1
- 1
- 1