/gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Primary LanguagePythonMIT LicenseMIT

Watchers