/Tensile

Stretching GPU performance for GEMMs and tensor contractions.

Primary LanguagePythonMIT LicenseMIT

A tool for creating a benchmark-driven backend library for GEMMs, GEMM-like problems (such as batched GEMM), N-dimensional tensor contractions, and anything else that multiplies two multi-dimensional objects together on a GPU.

See Tensile Wiki for documentation.