Unable to run model on iPhone 14 Pro with error: "failed to load ANE model" — works fine from CLI

Question

Unable to run model on iPhone 14 Pro with error: "failed to load ANE model" — works fine from CLI

jverkoey opened this issue 2 years ago · 7 comments

When I run:

let config = MLModelConfiguration()
config.computeUnits = .all
let pipeline try! StableDiffusionPipeline(resourcesAt: modelUrl, configuration: config)

I get the following error:

[espresso] [Espresso::handle_ex_plan] exception=ANECF error: failed to load ANE model. Error=ANECCompile(/var/mobile/Library/Caches/com.apple.aned/tmp/com.featherless.MLDemo/CD3F6A18321CD0468900D511BF6E116C1AC2F5D1DB1D65F480343B1E5551B8A8/7204A653B1634F14166A639585DE3E3EDCFE052221F97F3476ECE9475CD8A5DE/) FAILED: err=(
    CompilationFailure
)
[coreml] Error plan build: -1.
[client] doUnloadModel:options:qos:error:: nil _ANEModel

The model is a converted stablediffusion model. I converted it using the following command line invocation:

python3 -m python_coreml_stable_diffusion.torch2coreml --convert-unet \
  --convert-text-encoder --convert-vae-decoder --convert-safety-checker \
  -o /Users/featherless/MLDemo/new-model   --model-version featherless/test-model \
  --chunk-unet --bundle-resources-for-swift-cli

The same model runs fine when invoked via command line:

swift run StableDiffusionSample "a digital portrait of an astronaut riding a horse, futuristic, highly detailed, HDR, 4k, illustration" \
  --resource-path /Users/featherless/MLDemo/new-model/Resources  \
  --seed=1235 --output-path /Users/featherless/MLDemo/output

Environment

Xcode Version 14.1 (14B47b)
Apple M1 Max, Ventura 13.1
iPhone 14 Pro, iOS 16.1.2

Answer 1 · 2022-12-09T05:01:29.000Z

Ah, I tried rebuilding the model again and this time I'm getting an Terminated due to memory issue error on the device now.

Answer 2 · 2022-12-09T05:03:06.000Z

https://liuliu.me/eyes/stretch-iphone-to-its-limit-a-2gib-model-that-can-draw-everything-in-your-pocket/ seems relevant.

Answer 3 · 2022-12-09T05:13:29.000Z

Switching to config.computeUnits = .cpuAndNeuralEngine based on the readme recommendation fixed the memory issue failure, but now I'm back to getting the "failed to load ANE model" error :(

Answer 4 · 2022-12-09T05:21:28.000Z

Using config.computeUnits = .cpuOnly fixes the compilation error, but now I'm running out of memory again. It looks like it gets exhausted when the Image Decoder is loaded, so I wonder if it's possible to perhaps lazily load that model...

Answer 5 · 2022-12-09T05:25:59.000Z

See #49 (comment)

Answer 6 · 2022-12-09T05:33:01.000Z

Ah interesting, thank you! It looks like "Personal development teams do not support the Extended Virtual Addressing capability." unfortunately; did you find that that was required?

Answer 7 · 2022-12-10T22:27:09.000Z

Ah interesting, thank you! It looks like "Personal development teams do not support the Extended Virtual Addressing capability." unfortunately; did you find that that was required?

I'm not entirely sure this is accurate (I'm able to enable Extended Virtual Addressing Capability on a personal development team, though perhaps you saw something that indicates an app would be reject from the App Store if it uses this capability)? Nonetheless, to get this to run on iPhone 14 Pro Max for me (and not receive the memory termination), I have to create the pipeline using the MLComputeUnit of .cpuAndGPU, like so;

let config = MLModelConfiguration()
config.modelDisplayName = model.name
config.computeUnits = .cpuAndGPU
let pipeline = try StableDiffusionPipeline(resourcesAt: resourceURL, configuration: config, disableSafety: false)
...

Further, I also had to enable the "Increased Memory Limit" entitlement. I followed the instructions in this post from Quinn at Apple Developer Support on the Apple Developer Forums, following the link provided in that post and taking the steps to enable "Increased Memory Limit" (which doesn't show up as a capability in Xcode's Signing & Capabilities like Extended Virtual Addressing does). Between enabling "Increased Memory Limit" and "Extended Virtual Addressing," and building the pipeline using .cpuAndGPU, I'm able to generate images on-device using Stable Diffusion 2, for example.

(Most of this is from the comment that @outcoldman referenced in the link above).