Standardize Chat Prompt Templates to Use Jinja Format

Question

Standardize Chat Prompt Templates to Use Jinja Format

calycekr opened this issue 2 months ago · 8 comments

Describe your feature request

Currently, the chatPromptTemplate for each model that can be set in env uses Handlebars format. However, the chat_prompt in the actual model's tokenizer_config.json uses Jinja format. This inconsistency is causing significant inconvenience. Since Jinja is widely used and preferred, it would be beneficial to standardize on Jinja format for both chatPromptTemplate and chat_prompt. This will improve consistency and ease of use for developers.

Screenshots (if relevant)

Implementation idea

To implement this change, the following steps can be taken:

Update Codebase: Update the codebase to handle Jinja templates for chatPromptTemplate.
Documentation: Update the documentation to reflect this change and provide examples of how to use Jinja templates.
Testing: Thoroughly test the changes to ensure compatibility and that all existing templates work correctly with the new format.

Answer 1 · 2024-10-16T05:45:39.000Z

This is a good idea! It would be a breaking change though so we could keep handlebars as a fallback if the jinja evaluator fails, wdyt?

Answer 2 · 2024-10-16T06:33:42.000Z

@nsarrazin Sure, I agree with your suggestion. Keeping Handlebars as a fallback in case the Jinja evaluator fails is a good approach to ensure backward compatibility and smooth transition.

Answer 3 · 2024-10-16T10:18:32.000Z

yep, seems like the field has standardized on Jinja in the past year 😁 cc @Rocketknight1 😮

Answer 4 · 2024-10-16T13:05:36.000Z

Please don't fire me, I know you liked Handlebars! 😰

(but yes transitioning to Jinja is definitely the right decision)

Answer 5 · 2024-10-16T13:20:02.000Z

cements that we're a Python-first company (and community) i guess 😁

Answer 6 · 2024-11-20T00:21:55.000Z

hey guys, did you add this feature?

Answer 7 · 2024-11-20T00:26:47.000Z

@varad0309 the recommended way for now would be to set "tokenizer": to point to the model's repo on the hub so it can pick up the chat template (using Jinja) there.

Answer 8 · 2024-11-20T00:34:50.000Z

@nsarrazin , thanks for the fast reply. Is there a way to do the same if we have a local model?
also, just tried "tokenizer" : Qwen/Qwen2.5-Coder-14B-Instruct. It is not working.