Loading the model with quantized weights , two times corrupts the model

Question

Loading the model with quantized weights , two times corrupts the model

Closed this issue a year ago · 5 comments

to reproduce

call the load weights function two times and run the model . you get NaNs.
Does not happen with normal fp16/32 weights


graph.openStore(sdxl_model_path) {
    $0.read("unet", model: unet , codec: [.q6p, .q8p, .jit, .ezm7] )
  }

graph.openStore(sdxl_model_path) {
    $0.read("unet", model: unet , codec: [.q6p, .q8p, .jit, .ezm7] )
  }

Deleted user commented a year ago

Thanks

Answer 1 · 2023-12-09T23:28:35.000Z

what could be the possible problem and solution?
Thanks

Answer 2 · 2023-12-09T23:43:28.000Z

Probably because unlike normal weights we allocated on nnc side and just read the blob in, for jit weights, we allocated them on s4nnc side: https://github.com/liuliu/s4nnc/blob/main/nnc/Store.swift#L2053

Workaround would be to create new model when you need to load the weights, but otherwise need to look into why this behavior (possible memory corruption) happens and how to fix them.

Answer 3 · 2023-12-10T01:03:53.000Z

Okay thanks

Answer 4 · 2023-12-13T06:28:11.000Z

The limited case fixed in 53f737c

The reason it is limited because if the weight of the same name quantized differently (for example, once in q6p, another in q8p) it will still nan in the future.