lsy323

GoogleSF Bay Area

Pinned Repositories

CS516_DataPipeline
Language:JavaScript0 1 00
docker-python
Kaggle Python docker image
Language:Python0 0 00
Duke-Tsinghua-MLSS-2017
Duke-Tsinghua Machine Learning Summer School 2017
Language:Jupyter Notebook0 1 00
JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Language:Python0 0 00
jetstream-pytorch
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
Language:Python0 0 00
llama
Inference code for LLaMA models
Language:Python0 0 00
lsy323.github.io
github pages test
Language:CSS0 1 00

lsy323's Repositories

lsy323/CS516_DataPipeline
Language:JavaScript0 1 00
lsy323/docker-python
Kaggle Python docker image
Language:Python0 0 00
lsy323/Duke-Tsinghua-MLSS-2017
Duke-Tsinghua Machine Learning Summer School 2017
Language:Jupyter Notebook0 1 00
lsy323/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Language:Python0 0 00
lsy323/jetstream-pytorch
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
Language:Python0 0 00
lsy323/llama
Inference code for LLaMA models
Language:Python0 0 00
lsy323/lsy323.github.io
github pages test
Language:CSS0 1 00
lsy323/ml-auto-solutions
A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across different frameworks.
Language:Python0 0 00
lsy323/ml-testing-accelerators
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
Language:Jsonnet0 0 00
lsy323/onnx
Open standard for machine learning interoperability
Language:C++0 0 00
lsy323/openxla-xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
Language:C++0 0 00
lsy323/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python0 0 00
lsy323/tensorflow
An Open Source Machine Learning Framework for Everyone
Language:C++0 0 00
lsy323/stablehlo
Backward compatible ML compute opset inspired by HLO/MHLO
Language:MLIR0 0
lsy323/tpu_debug
Language:Python0 0
lsy323/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python
lsy323/xla
Enabling PyTorch on Google TPU
Language:C++0 0

lsy323

Pinned Repositories

CS516_DataPipeline

docker-python

Duke-Tsinghua-MLSS-2017

JetStream

jetstream-pytorch

llama

lsy323.github.io

lsy323's Repositories

lsy323/CS516_DataPipeline

lsy323/docker-python

lsy323/Duke-Tsinghua-MLSS-2017

lsy323/JetStream

lsy323/jetstream-pytorch

lsy323/llama

lsy323/lsy323.github.io

lsy323/ml-auto-solutions

lsy323/ml-testing-accelerators

lsy323/onnx

lsy323/openxla-xla

lsy323/pytorch

lsy323/tensorflow

lsy323/stablehlo

lsy323/tpu_debug

lsy323/vllm

lsy323/xla