Issues about training on caffe

Question

leochli opened this issue 6 years ago · 1 comments

Hi Austing,
You said there is not an optimized Depthwise conv layer in caffe.
I found there're 2 dw conv implementations:

Are they not good enough during your experiment? Or is there any other reasons for not training in caffe?

Answer 1 · 2018-06-12T18:20:36.000Z

@leochli Those implementation is slower than tensorflow or MXNet, especially when use large batch size, such as > 64. So that's not the first choice.