john-adeojo/MoA
The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
PythonMIT
Stargazers
No one’s star this repository yet.
The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
PythonMIT
No one’s star this repository yet.