huggingface/ratchet

Llama3 8B? 🦙

Closed this issue · 0 comments

We aren't far off from having everything needed to run it:

  • 1. I'm adding the infra now to load models of any size on the web despite 2GB WASM limits.
  • 2. SILU needed
  • 3. GQA needed