jason9693/MusicTransformer-tensorflow2.0

Implement QKV logic in terms of einsum

Opened this issue · 0 comments

You could reimplement the QKV / dense logic in terms of einsum for faster computation. An example layer here and the use here. This is how it is is now implemented in the tf2 version of bert / transformer.