Stretching GPU performance for GEMMs and tensor contractions.
Primary LanguageC++MIT LicenseMIT
See Tensile Wiki for documentation.