aashiqmuhamed/GRASS

GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients

MIT

Readme
2Issues
10Stargazers
4Watchers

Issues

About the LLaMA-1B trained on C4
#1 opened 6 months ago by shixiangsong
0

Contact site admin: Geeks.