Issues
- 0
- 1
- 0
20s audio - processing time = 4min
#16 opened by NavodPeiris - 0
English transcription - Whisper Medium (small, tiny models are innaccurate), slow processing,
#15 opened by NavodPeiris - 1
train scream detection, cry detection models
#14 opened by NavodPeiris - 1
audio has noise
#12 opened by NavodPeiris - 1
test with multiple devices
#10 opened by NavodPeiris - 1
numbers not spelled by TTS
#13 opened by NavodPeiris - 1
Run 10 test cases with different ways.
#4 opened by NavodPeiris - 1
speaker recognition not working
#6 opened by NavodPeiris - 1
- 1
Use Pyannote speaker diarization 3.0.0
#8 opened by NavodPeiris - 1
- 1
fail on fd 51, errno: 104, "Connection reset by peer". suddenly while ESP32 streaming it crashed.
#11 opened by NavodPeiris - 1
add document queries through LLM
#5 opened by NavodPeiris - 1
- 1
add printing emotion recognition
#2 opened by NavodPeiris - 1
add emotion recognition
#1 opened by NavodPeiris