Issues
- 17
[Bug]: Rocblas errors out on fp16 gemm operations
#2044 opened by IMbackK - 2
[Question] How does tensile avoid bank conflict for matrix core instructions?
#2043 opened by LeiWang1999 - 1
[Feature]: More sample benchmark config.yaml needed to support bf16 and I8II on RDNA arches
#1955 opened by likelovewant - 2
[Issue]: Should Tensile/Tests/disabled be removed?
#2038 opened by littlewu2508 - 1
[Issue]: Calling min(size_t, int) in ContractionSolution.cpp causes ambiguity
#1977 opened by yxsamliu - 2
how to enable t_debug?
#1722 opened by jdgh000 - 2
[Issue]: How is a decision tree type YAML trained?
#1975 opened by lty-qd - 6
[Feature]: Generalize linux distro handling
#1969 opened by trixirt - 2
- 9
- 1
[Feature]: Offload compression
#2031 opened by LunNova - 5
[Feature]: Optional use of joblib
#1989 opened by trixirt - 4
undefined symbol: Tensile::TypedContractionInputs<Tensile::Half>::TypedContractionInputs()
#1833 opened by xiakubaobaore - 28
[Bug]: rocBLAS error: Cannot read TensileLibrary.dat: No such file or directory
#1936 opened by slipperyslipped - 7
Help enabling wmma instructions
#1943 opened by spayne - 6
hipblasdgemm not getting close to peak
#1705 opened by JorgeG94 - 2
Is Tensile adapted to RDNA2 ?
#1579 opened by v01dXYZ - 1
[Feature]: Restructure the code to build a wheel and use importlib to embed non-python files
#1874 opened by bioinfornatics - 8
TypeError: sequence item 0: expected str instance, NoneType found for compilerArgs
#1982 opened by waheedi - 2
kernel.cpp without assembly kernel implement
#1832 opened by DoubleClark - 4
[Feature]: support for gfx1103
#1922 opened by NeoChen1024 - 1
Kernels source code are not generated
#1769 opened by mabdallah89 - 7
[Feature]: about 7900xtx benchmark.
#1935 opened by Axl-zhang - 2
[Feature]: Further FP32 GEMM optimization for gfx11
#1715 opened by littlewu2508 - 3
Out of date tuning documentation
#1310 opened by kahmed10 - 0
[Feature]: Support for gfx1036
#1907 opened by bitozoid - 33
Tensile won't produce backend libraries for archs without optimized logic files when using --separate-architectures
#1757 opened by ulyssesrr - 1
The solution ends with an error.
#1413 opened by vasslavich - 0
- 1
- 0
- 5
build fails with rocm4.1
#1323 opened by gggh000 - 1
How to point an ISA in yaml (example)?
#1412 opened by vasslavich - 1
GlobalReadCoalesceGroupA[B] leads to wrong results
#1426 opened by vasslavich - 1
Set constStrideC0 to 0 instead of 1
#1515 opened by NevesLucas - 2
How to determine the input size to be tested?
#1558 opened by zeroMrCc - 1
Rouding error with Gfx90aFp16altSupport
#1583 opened by sumin-hong - 4
- 3
Enabling UnrollLoopEfficiencyEnable leads to crash during kernel generation
#1500 opened by NevesLucas - 2
`getKernel` should return `hipErrorNotFound` if no module
#1494 opened by v01dXYZ - 11
Trying RX 6700XT
#1410 opened by littlewu2508 - 2
Why MT not equal to WG*TT?
#1482 opened by lingjiew93 - 2
mixed precision casues failure to benchmark
#1490 opened by NevesLucas - 5
‘JoinParameters’ is no longer supported
#1479 opened by flint-stone - 12
Trying to compile with upstream llvm-13
#1455 opened by Maxzor - 2
rocblas installation fails during Tensile run with os.fork(): OSError: [Errno 12] Cannot allocate memory
#1415 opened by UweSauter - 1
No such file or directory: '/root/workspace/ROCm/Tensile/repo/Tensile/Source/SolutionMapper.h'
#1414 opened by vasslavich - 3
tensile_client throwing std::out_of_range
#1341 opened by elliottbinder - 6
- 5
How to specify serveral architectures to be genrated
#1395 opened by littlewu2508