/based

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers