Why add special tokens manually in sql_dataset.py

Question

Why add special tokens manually in sql_dataset.py

Closed this issue 10 months ago · 1 comments

In datasets/sql_dataset.py you manually add a bunch of special tokens in the prompt instead of relying on the tokenizer to handle this. Is there a good reason for that?

Answer 1 · 2024-01-05T14:12:14.000Z

It was copied from the original recipes repo (which we have now deprecated) so unfortunately I'm not sure why they did so originally.