tpu
There are 175 repositories under tpu topic.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
tensorflow/adanet
Fast and flexible AutoML with learning guarantees.
hollance/neural-engine
Everything we actually know about the Apple Neural Engine (ANE)
imcaspar/gpt2-ml
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
PygmalionAI/aphrodite-engine
Large-scale LLM inference engine
ayaka14732/tpu-starter
Everything you want to know about Google Cloud TPU
chrisbutner/ChessCoach
Neural network-based chess engine capable of natural language commentary
jofrfu/tinyTPU
Implementation of a Tensor Processing Unit for embedded systems and the IoT.
tumaer/JAXFLUIDS
Differentiable Fluid Dynamics Package
magic-blue-smoke/Dual-Edge-TPU-Adapter
Dual Edge TPU Adapter to use it on a system with single PCIe port on m.2 A/B/E/M slot
embedeep/Free-TPU
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.
AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
JuliaGPU/XLA.jl
Julia on TPUs
Kohulan/DECIMER-Image_Transformer
DECIMER: Deep Learning for Chemical Image Recognition using Efficient-Net V2 + Transformer
robotperf/benchmarks
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
hhk7734/tensorflow-yolov4
YOLOv4 Implemented in Tensorflow 2.
nyx-ai/stylegan2-flax-tpu
🖼 Training StyleGAN2 on TPUs in JAX
rwightman/efficientnet-jax
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
HomebrewNLP/revlib
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
cameronshinn/tiny-tpu
Small-scale Tensor Processing Unit built on an FPGA
yapay-ogrenme/googlecodelabs
TPU ile Yapay Sinir Ağlarınızı Çok Daha Hızlı Eğitin
cea-wind/SimpleTPU
A FPGA Based CNN accelerator, following Google's TPU V1.
embedeep/FREE-TPU-V3plus-for-FPGA
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
koshian2/OctConv-TFKeras
Unofficial implementation of Octave Convolutions (OctConv) in TensorFlow / Keras.
sayakpaul/FunMatch-Distillation
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
wmcnally/evopose2d
EvoPose2D is a two-stage human pose estimation model that was designed using neuroevolution. It achieves state-of-the-art accuracy on COCO.
PINTO0309/TPU-MobilenetSSD
Edge TPU Accelerator / Multi-TPU + MobileNet-SSD v2 + Python + Async + LattePandaAlpha/RaspberryPi3/LaptopPC
AI-Hypercomputer/xpk
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
captain-pool/GSOC
Repository for Google Summer of Code 2019 https://summerofcode.withgoogle.com/projects/#4662790671826944
GoogleCloudPlatform/ml-testing-accelerators
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
rickiepark/deep-learning-with-python-2nd
<케라스 창시자에게 배우는 딥러닝 2판> 도서의 코드 저장소
gsarti/t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
andreped/GradientAccumulator
:dart: Accumulated Gradients for TensorFlow 2
instadeepai/sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX