tpu

There are 206 repositories under tpu topic.

vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python58.1k 437 10.8k10.1k
tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Language:Python16.5k 472 1.2k3.7k
skypilot-org/skypilot
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
Language:Python8.7k 72 2.8k775
tensorflow/adanet
Fast and flexible AutoML with learning guarantees.
Language:Jupyter Notebook3.5k 171 114531
hollance/neural-engine
Everything we actually know about the Apple Neural Engine (ANE)
2.3k 82 1483
imcaspar/gpt2-ml
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Language:Python1.7k 37 89334
aphrodite-engine/aphrodite-engine
Large-scale LLM inference engine
Language:C++1.5k 20 234167
sophgo/tpu-mlir
Machine learning compiler based on MLIR for Sophgo TPU.
Language:C++791 21 124176
LuxDL/Lux.jl
Elegant and Performant Deep Learning
Language:Julia614 9 31375
ayaka14732/tpu-starter
Everything you want to know about Google Cloud TPU
Language:Python545 6 530
jofrfu/tinyTPU
Implementation of a Tensor Processing Unit for embedded systems and the IoT.
Language:VHDL508 23 5264
chrisbutner/ChessCoach
Neural network-based chess engine capable of natural language commentary
Language:C++503 8 721
tumaer/JAXFLUIDS
Differentiable Fluid Dynamics Package
Language:Python456 16 1683
AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Language:Python375 18 2752
magic-blue-smoke/Dual-Edge-TPU-Adapter
Dual Edge TPU Adapter to use it on a system with single PCIe port on m.2 A/B/E/M slot
353 54 756
Kohulan/DECIMER-Image_Transformer
DECIMER Image Transformer is a deep-learning-based tool designed for automated recognition of chemical structure images. Leveraging transformer architectures, the model converts chemical images into SMILES strings, enabling the digitization of chemical data from scanned documents, literature, and patents.
Language:Python285 7 5871
embedeep/Free-TPU
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.
Language:Shell263 8 361
JuliaGPU/XLA.jl
Julia on TPUs
Language:Julia222 18 3220
cameronshinn/tiny-tpu
Small-scale Tensor Processing Unit built on an FPGA
Language:Verilog201 7 025
robotperf/benchmarks
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
Language:Python174 11 2418
cea-wind/SimpleTPU
A FPGA Based CNN accelerator, following Google's TPU V1.
Language:C++158 1 141
embedeep/FREE-TPU-V3plus-for-FPGA
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
Language:V156 6 122
AI-Hypercomputer/xpk
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
Language:Python141 20 3532
hhk7734/tensorflow-yolov4
YOLOv4 Implemented in Tensorflow 2.
Language:Python134 10 8375
nyx-ai/stylegan2-flax-tpu
🖼 Training StyleGAN2 at scale on TPUs
Language:Python130 3 511
HomebrewML/revlib
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
Language:Python129 4 56
rwightman/efficientnet-jax
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
Language:Python128 5 314
yapay-ogrenme/googlecodelabs
TPU ile Yapay Sinir Ağlarınızı Çok Daha Hızlı Eğitin
Language:Jupyter Notebook126 13 022
koshian2/OctConv-TFKeras
Unofficial implementation of Octave Convolutions (OctConv) in TensorFlow / Keras.
Language:Jupyter Notebook100 5 1527
sayakpaul/FunMatch-Distillation
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
Language:Jupyter Notebook88 3 18
wmcnally/evopose2d
EvoPose2D is a two-stage human pose estimation model that was designed using neuroevolution. It achieves state-of-the-art accuracy on COCO.
Language:Python84 2 2810
PINTO0309/TPU-MobilenetSSD
Edge TPU Accelerator / Multi-TPU + MobileNet-SSD v2 + Python + Async + LattePandaAlpha/RaspberryPi3/LaptopPC
Language:Python81 10 220
rickiepark/deep-learning-with-python-2nd
<케라스 창시자에게 배우는 딥러닝 2판> 도서의 코드 저장소
Language:Jupyter Notebook79 2 198
AI-Hypercomputer/jetstream-pytorch
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
Language:Python71 8 1417
captain-pool/GSOC
Repository for Google Summer of Code 2019 https://summerofcode.withgoogle.com/projects/#4662790671826944
Language:Python68 2 3522
GoogleCloudPlatform/ml-testing-accelerators
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
Language:Jsonnet65 35 2060

tpu

vllm-project/vllm

tensorflow/tensor2tensor

skypilot-org/skypilot

tensorflow/adanet

hollance/neural-engine

imcaspar/gpt2-ml

aphrodite-engine/aphrodite-engine

sophgo/tpu-mlir

LuxDL/Lux.jl

ayaka14732/tpu-starter

jofrfu/tinyTPU

chrisbutner/ChessCoach

tumaer/JAXFLUIDS

AI-Hypercomputer/JetStream

magic-blue-smoke/Dual-Edge-TPU-Adapter

Kohulan/DECIMER-Image_Transformer

embedeep/Free-TPU

JuliaGPU/XLA.jl

cameronshinn/tiny-tpu

robotperf/benchmarks

cea-wind/SimpleTPU

embedeep/FREE-TPU-V3plus-for-FPGA

AI-Hypercomputer/xpk

hhk7734/tensorflow-yolov4

nyx-ai/stylegan2-flax-tpu

HomebrewML/revlib

rwightman/efficientnet-jax

yapay-ogrenme/googlecodelabs

koshian2/OctConv-TFKeras

sayakpaul/FunMatch-Distillation

wmcnally/evopose2d

PINTO0309/TPU-MobilenetSSD

rickiepark/deep-learning-with-python-2nd

AI-Hypercomputer/jetstream-pytorch

captain-pool/GSOC

GoogleCloudPlatform/ml-testing-accelerators