Make token counting faster and more robust

Question

Opened this issue 3 months ago · 0 comments

Make token counting faster and more robust once abetlen/llama-cpp-python#1763 is fixed.