lucidrains/mixture-of-attention

Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts

PythonMIT

Readme
0Issues
108Stargazers
8Watchers

Stargazers

Prev
Next

Contact site admin: Geeks.