/ScalingLaws

Scaling Laws for Linear Complexity Language Models

Stargazers