PersiaML/PERSIA

support low memory adagrad sparse optimizer

NOBLES5E opened this issue · 0 comments

where each embedding vector shares a single float grad running avg