Convolution layer filter width

Question

Convolution layer filter width

sharpsy opened this issue 6 years ago · 1 comments

In the code that does the convolution, there is a separate implementation for convolution with filter width of 1 and convolution with other filter widths.

Convolution with a filter width of 1 is special cased due to a faster implementation and it is the only filter width used by the current code. Other implementation has a comment #was used to train LM.

Does it mean that a different convolution filter width was used for language modeling or was it just using a less efficient implementation but with the same filter width?

Answer 1 · 2018-08-30T18:44:16.000Z

Less efficient implementation of the same thing!