/sgd-sparse-features

SGD with large step sizes learns sparse features [ICML 2023]

Primary LanguageJupyter Notebook