how to run llama3.2 1b model ?

Question

how to run llama3.2 1b model ?

Closed this issue 3 months ago · 1 comments

Arslan-Mehmood1 commented 3 months ago

using this

06_gpu_and_ml/llm-serving/text_generation_inference.py

I'm unable to get generated text correctly.

Answer 1 · 2024-10-01T13:41:25.000Z

Hey there!

We reserve the GitHub Issues page for issues with the examples as they are written, but this looks like an issue with the example after it has been edited.

We provide support for general use of the Modal platform in our Community Slack, https://modal.com/slack.

The short answer is that it looks like the chat template here is wrong.