/Griffin-Jax

Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers