Pinned Repositories
vertex-ai-nas
With Vertex AI NAS, you can search for optimal neural architectures in terms of accuracy, latency, memory, a combination of these, or a custom metric.
vertex-ai-samples
Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
vertex-ai-samples
Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
xla
Enabling PyTorch on XLA Devices (e.g. Google TPU)
xiangxu-google's Repositories
xiangxu-google/vertex-ai-samples
Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud
xiangxu-google/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
xiangxu-google/xla
Enabling PyTorch on XLA Devices (e.g. Google TPU)