New vector-times-matrix-transposed benchmark fails to run on Nvidia GPUs..
oscarbg opened this issue · 0 comments
oscarbg commented
Hi,
running on Nvidia 4070 I get:
uVkCompute/build/benchmarks/vmt
./vmt_rdna3
2023-11-07T17:08:45+01:00
Running ./vmt_rdna3
Run on (32 X 5881 MHz CPU s)
CPU Caches:
L1 Data 32 KiB (x16)
L1 Instruction 32 KiB (x16)
L2 Unified 1024 KiB (x16)
L3 Unified 32768 KiB (x2)
Load Average: 8.08, 5.68, 2.31
***WARNING*** CPU scaling is enabled, the benchmark real time measurements may be noisy and will incur extra overhead.
***WARNING*** Library was built as DEBUG. Timings may be affected.
uVkCompute/benchmarks/vmt/vmt_main.cc:123: check error: destination buffer element (0) has incorrect value: expected to be 1404 but found -1
^ In shader: Tile[1x16], i8->i32
Abortado (`core' generado)