CS123n opened this issue 2 years ago · 0 comments
Hi, very interesting work! I use your code and find that a bigger batch size cannot benefit the OOD results. Is this true?