Pinned Repositories
maxtext
A simple, performant and scalable Jax LLM!
ai-on-gke
JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
maxdiffusion
maxtext
A simple, performant and scalable Jax LLM!
test-infra
Test infrastructure for the Kubernetes project.
wg-serving
k8sgateway
The Cloud-Native API Gateway and AI Gateway
test-infra
Test infrastructure for the Kubernetes project.
Bslabe123's Repositories
Bslabe123/ai-on-gke
Bslabe123/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Bslabe123/lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
Bslabe123/maxdiffusion
Bslabe123/maxtext
A simple, performant and scalable Jax LLM!
Bslabe123/test-infra
Test infrastructure for the Kubernetes project.
Bslabe123/wg-serving