JeanKaddour/NoTrainNoGain

Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)

PythonMIT

Readme
2Issues
80Stargazers
6Watchers

Watchers

drkostas
University of Tennessee, Knoxville
JeanKaddour
London
michalwols
New York
oscarkey
University College London
proger
Supercomputer City
russelldc
@midjourney

Contact site admin: Geeks.