AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

PythonApache-2.0

Issues

Does Dataflow work with JetStream?
#146 opened 19 days ago by salaki
0
Try Google Opinion Rewards باربری یزد 09137236592
#145 opened 20 days ago by barbaryyazd09133545880
0
Ads.google.com
#144 opened 20 days ago by barbaryyazd09133545880
0
Question: `prometheus_port` flag for pytorch server
#143 opened a month ago by JeffLuoo
0
باربری یزد ۰۹۱۳۳۵۴۵۸۸۰
#142 opened a month ago by barbaryyazd09133545880
0
باربری یزد 09133545880
#141 opened a month ago by barbaryyazd09133545880
0
Support using models from HuggingFace directly
#140 opened a month ago by samos123
0
باربری نیسان یزد 09133545880
#139 opened a month ago by barbaryyazd09133545880
0
Understanding the intuition behind `request-rate`
#137 opened 2 months ago by hosseinsarshar
0
Support completions API
#135 opened 2 months ago by nstogner
0
Clean up Model Conversion Script
#131 opened 3 months ago by yeandy
2
when to support gpu?
#120 opened 4 months ago by Mddct
1
Remove jax dependencies in JetStream
#88 opened 6 months ago by FanhaiLu1
0
Add np padding support
#55 opened 6 months ago by FanhaiLu1
1
Support I/O with text and token ids
#79 opened 6 months ago by JoeZijunZhou
2
Refactor jestream to allow different tokenizers
#45 opened 7 months ago by qihqi
1
Detokenize error
#64 opened 6 months ago by yeandy
2
Benchmark serving: Failed to connect to remote host
#58 opened 6 months ago by yeandy
1
float division by zero in benchmark
#61 opened 7 months ago by FanhaiLu1
2
Support on Huggingface transformers
#44 opened 7 months ago by ImKeTT
2
Error with mutable list value in dataclass
#57 opened 7 months ago by yeandy
1
CogVLM support
#46 opened 7 months ago by BitPhinix
1
Feature request: improve documentation
#14 opened 8 months ago by OhadRubin
5