/attention.mojo

Comparing multi-head attention performance in Mojo 🔥 vs. Numpy

Primary LanguagePython

Watchers