Real Time Speech to text conversation
surendransuri opened this issue · 2 comments
surendransuri commented
Description of the feature request:
Hi, I want to use Gemini 1.5 Pro's newest multi-model. I'm looking for a feature similar to what's in Vertex AI playground, where we can convert speech to text in real-time. Right now, In this Gemini API examples we see that inferencing does this in batches after uploading a file, but I need it to happen in real-time. Can you help me figure out how to do this? Thanks a lot!
What problem are you trying to solve with this feature?
No response
Any other information you'd like to share?
No response
random-forests commented
Hi surendransuri, thanks for your question. Google AI Studio and the Gemini API do not support audio streaming right now. You can ask more about the best ways to work with audio on https://discuss.ai.google.dev/.
rflpsz commented
you can just use speechSynthesis from Google Chrome native API.