SquareAndCompass-2/LongNet
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
PythonApache-2.0
No issues in this repository yet.
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
PythonApache-2.0
No issues in this repository yet.