john-adeojo/MoA

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

PythonMIT

Readme
0Issues
0Stargazers
0Watchers

Stargazers

No one’s star this repository yet.

Contact site admin: Geeks.