keras-team/keras

Multihead Attention Seed Specification

egehancosgun opened this issue · 1 comments

Dropout layer inside the multihead attention layer does not take any seed as an argument. This causes non-deterministic outputs. Can you please add this in future releases?

Thanks for the suggestion. This is now added.