AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

PythonApache-2.0

Readme
27Issues
258Stargazers
19Watchers

Stargazers

Prev
Next

Contact site admin: Geeks.