Documenting how to access Bhashini Speech to text

Question

Closed this issue a year ago · 0 comments

Documentation for using Bhashini models is provided here

You need to do the following :

I have also created a collab here with an example of the same. You need to provide your own API key in the collab

Create an API Key using the “Generate” button under the “My Profile” section. Ensure that your app name uses lowercase words and underscores.
Use the API provided in my collab to convert wav file to bas64 and run
In the collab, I have combined the pipeline APIs that is used to get the authorization and 'model to hit' with the ASR model to quickly run ASR. I have also added batching to enable it to run for bigger wav files