`meta/meta-llama-3-70b` ignores `max_tokens`

Question

Opened this issue 4 months ago · 0 comments

I'm pretty sure I'm sending max_tokens and:

When I use exactly the same code for e.g. meta/llama-2-70b this does not happen, i.e. I really get the requested number of tokens.