Issues on evaluating latency using TVM.

Question

Issues on evaluating latency using TVM.

Closed this issue 7 months ago · 3 comments

Hi I’m currently working on compiling I-ViT using TVM. On this project, The error appears.

Check failed: value < 1LL << (dtype.bits() - 1) (192 vs. 128) : ValueError: Literal value 192 exceeds maximum of int8

by changing value 192 lower than 128 on build_model.py seems to sove the issue.
if name == 'deit_tiny_patch16_224': #embed_dim = 192 embed_dim = 92 num_heads = 3
But, strictly speaking, 'But this method' involves arbitrarily modifying the model's structure, so it is not an appropriate solution.
should changing TVM's version solve this issue?

Thanks always.

Answer 1 · 2024-01-07T04:17:54.000Z

@zkkli , @rkdgmlqja
I have a same issue.
if embed_dim is higher then 128(int8 range), ValueError occurs. But in buid_model.py file, every embed_dim is higher then 128. So it occurs error inevitably.
How to solve this problem?

Answer 2 · 2024-05-28T05:53:07.000Z

The Error was caused by quantized layer norm from layers.py

def quantized_layernorm(data, 
                    bias_int):
    data = relay.cast(data, 'int32')
    mean = relay.mean(data, axis=2, keepdims=True)
    data = data - mean
    data_sq = data * data

This should fix it

Answer 3 · 2024-09-19T01:06:30.000Z

hi, @rkdgmlqja . did you successfully run tvm deployment(evaluate_accuracy.py, evaluate_latency.py)?
I got stuck with it.
I've tried it in the same environment as the author guided(tvm 0.9dev0 from source build, timm 0.4.2).
but the inference results(top 5) was totally different from torch.
also, I always failed in the auto tuning progress with a unknown sudden error.
what version of tvm, timm or some other settings did you use? is there some more things I should change to run it correctly?
if possible please help me~!