Test modell

Question

Test modell

Closed this issue 6 years ago · 1 comments

Sorry if this sounds like a dumb question. I am not an expert in eighter python or speaker diarization. After I have trained the model, how can I use it to determine how is speaking from a wave file. I am trying to determine how is speaking from a one audio telephone conversation.

Could I for example use test_test_sequence=wavfile.read(mywav) as a input to
predicted_cluster_id = model.predict(test_sequence, args), and get get a prediction of how spoke from this file?

My question is more about the use of the code. I hope you can help!

Answer 1 · 2019-02-09T16:05:45.000Z

The input must be speaker-discriminative embeddings computed using another library, not raw waveform signals.
UIS-RNN must be trained before you can use it for prediction.
Please read the paper, or at least the README.md file first.