The buffer_size
variable is currently hard-coded in npu.cpp. It needs to be configurable on a per-benchmark basis, either through the source code (?) or a command-line flag.
Quoth @andreolb:
For inversek2j this should be set to 64. For jmeint it should be 576.