/GrokkedTransformer

Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'

Primary LanguagePythonMIT LicenseMIT

Stargazers