MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Jupyter NotebookBSD-2-Clause
Issues
- 2
separating tracks killed
#196 opened by liabozarth - 0
Installation issues with Windows
#195 opened by sharik-siddiqi - 4
msdd_model.diarize() RuntimeError: shape '[138, 50, 16, 192]' is invalid for input of size 84787200
#130 opened by Ko8rah - 3
- 10
CTC forced alignment error
#190 opened by tophee - 1
No such file or directory: '/usr/local/lib/python3.10/dist-packages/ctc_forced_aligner/punctuations.lst'
#194 opened by dr-who123 - 4
- 3
IndexError: list index out of range
#192 opened by pakerfeldt - 2
multiple speaker compatability
#191 opened by pgegg02 - 3
- 10
Failed to install on Apple Silicon
#177 opened by fkostadinov - 5
- 3
Output format
#187 opened by famda - 2
- 1
WhisperX forced alignment
#173 opened by ngcheeyuan - 3
- 5
- 2
Numpy Conflict - current requirements.txt
#157 opened by filmo - 0
Using multi-gpu parameters
#153 opened by Arxad - 3
Danish language support
#134 opened by kasperhk - 3
Json output?
#132 opened by vladgrand2 - 7
- 3
- 1
python version it best works in ??????
#182 opened by gprithvi369 - 1
Issue with an audio/video file
#156 opened by dchapelet - 1
How to use Yaml File
#176 opened by arrrrr3186 - 1
word_timestamps - IndexError: list index out of range
#178 opened by Reinmor - 3
[NeMo W 2023-11-28 20:23:00 transformer_bpe_models:59] Could not import NeMo NLP collection which is required for speech translation model.
#135 opened by vladgrand2 - 2
Language param not working
#171 opened by alexauvray - 2
- 1
Only part of audio transcribed
#151 opened by NasonZ - 3
- 1
install issue
#172 opened by transcriptionstream - 7
Installing from requirements.txt leads to the installation of ?every version of the packages needed
#161 opened by grantbarrett - 2
- 4
- 1
Suggestion to add translation
#146 opened by Asma-droid - 3
- 6
Problems with requirements
#136 opened by federicotorrielli - 1
Please, fix the language bug
#148 opened by agershun - 5
Notebook isn't working
#150 opened by NasonZ - 5
- 0
Error in diarization
#160 opened by rashi-budati - 1
Diarization is not working fine for all audios
#147 opened by Asma-droid - 2
Other languages
#145 opened by peregilk - 0
Dependency conflict with whisperx 3.1.1
#139 opened by gexxxter - 5
about whisperx and diarization.py
#140 opened by vladgrand2 - 0
RuntimeError: Calculated padded input size per channel: (1). Kernel size: (2). Kernel size can't be greater than actual input size
#141 opened by pk41561 - 3
- 0
whisperx
#133 opened by vladgrand2