/efficient_softmax

BlackOut and Adaptive Softmax for language models by Chainer

Primary LanguagePython

Watchers