I deployed a Custom Inference model for Wav2Vec2 Then I connected to it with this app! You can see the custom inference model here.
- Some STUN Servers might not work
- Figuring out a better solution than a timeout to get the audio frames would get much better performance
The biggest challenge by far was getting streamlit-webrtc working.