NSTiwari/Llama3-on-Mobile
This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on-device inference.
MakefileMIT
Issues
- 4
run bug
#1 opened by zyxcambridge
This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on-device inference.
MakefileMIT