Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"
Primary LanguagePythonApache License 2.0Apache-2.0
No issues in this repository yet.