Run LLaMA (and Stanford-Alpaca) inference on Apple Silicon GPUs.
Primary LanguagePythonGNU General Public License v3.0GPL-3.0
No issues in this repository yet.