Butter Smoothly running, with multi-modal now. Requirements python 3.12 uv About voice model tldr: The voice is actually voice of game character. We're using elevenlab's instant voice cloning model based on Kal'tsit(JP) and adjusted similarity and stability.