Optimize categorical crossentropy gradient update

Question

Optimize categorical crossentropy gradient update

NicolasHug opened this issue 6 years ago · 3 comments

The gradient and hessian update of the categorical crossentropy loss computes p_k k times, but it only needs to compute it once (see scikit-learn/scikit-learn@9e68984 which led to serious improvement in fit time).

Answer 1 · 2019-02-18T13:57:30.000Z

+1 as well.

Answer 2 · 2019-04-18T05:09:07.000Z

@NicolasHug Can I also work on this project?

Answer 3 · 2019-04-18T10:21:43.000Z

@aditya1702 please feel free to submit a PR.