/Exploring-Attention

Fun Casual Implementation of Attention ✨ | Self Attention 👁️ | Multi-Headed Attention 🧠 | Grouped Query Attention 📊 | KV-Caching 💾 | Masking 🕵️

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Watchers