Code for the paper Simulated Annealing in Early Layers Leads to Better Generalization (CVPR 2023)
Primary LanguagePython