Can DKM run on CPU only?

Question

Can DKM run on CPU only?

ducha-aiki opened this issue 2 years ago · 9 comments

Answer 1 · 2023-03-17T10:18:33.000Z

I think this issue was brought up previously. I think the code currently has a bunch of .cuda() calls, but in principle (using a low enough resolution) it should work.

Answer 2 · 2023-03-17T10:19:36.000Z

Like LoFTR and other works DKM is quite computationally heavy at high resolution, but I think if you use (384, 512) or similar with fp16 you could probably get a reasonable inference time on CPU.

Answer 3 · 2023-03-17T12:33:41.000Z

@Parskatt Thanks. My concern is not .cuda() calls, but the usage of cupy. Is there a fall-back implementation of local correlation?

Answer 4 · 2023-03-17T13:28:06.000Z

Aha, the cupy calls are actually for us to be able to run PDCNet internally in our framework, and I kind of forgot to remove that import. Our implementation just uses native pytorch operations.

Answer 5 · 2023-03-17T13:28:58.000Z

I should probably just push a fix removing that dep to cause less confusion...

Answer 6 · 2023-03-17T13:48:40.000Z

@ducha-aiki added a pull request for it, however very busy currently so no idea if I broke something, will look into cleaning up the codebase in the weeks to come. Sorry for the mess.

Answer 7 · 2023-03-17T18:57:50.000Z

@Parskatt thank you, that's great news! In particular, I am interested in integrating DKM into kornia, alongside with LoFTR :)
https://github.com/kornia/kornia

And going back to cpu/CUDA question - I would probably do a PR soon, allowing to run on Apple M1 GPU (torch.device('mps')), if the local correlation is not required.

Answer 8 · 2023-03-18T08:13:54.000Z

Sounds great, I think we can provide a "speedy" model as well, running on AMP and lower resolution, should be nice if looking for more realtime applications.

Answer 9 · 2023-03-23T14:03:22.000Z

I have cleaned-up the devices here #26

After merge, we can do integration to kornia