OSU-NLP-Group/GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
PythonMIT
Stargazers
- A-PolarBearSichuan University
- alando46Berkeley
- Cemberk
- DamarPanuluh
- dantodorBusymachines
- drogozhangThe Ohio State University
- EmasoftRome, Italy
- evdcush
- evolu8
- gburachas
- goddoeNAVER Cloud, Hyperscale AI
- HakeemDemiLondon UK
- hjlPalo Alto, California
- huyphan168FPT AI Residency
- jimz7UT Austin
- jonnyli1125University of Toronto
- kashperova
- kenny5s
- MehmetMHYeBay
- MrZilinXiaoRice University
- Nardien
- peiyong-addwater
- pharaouk
- pigtamerTokyo Institute of Technology
- s-mackeMunich
- showgood163
- simpleusername96
- soheeyangUCL/DeepMind
- somvyAIRI
- standardgalacticXanadu
- tokarev-i-v
- TownesZhouPurdue University
- X4Dubai, UAW
- ynyrllw
- ysu1989The Ohio State University / Microsoft Semantic Machines
- zwbxAdelaide, Australia