Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Primary LanguageJupyter NotebookApache License 2.0Apache-2.0
No one’s star this repository yet.