/transformer

Implementing a transformer from scratch

Primary LanguagePythonApache License 2.0Apache-2.0

Attention Is All You Need

Implementing "Attention Is All You Need" Vaswani et al. from scratch in Jax and Torch.