hello, can the model process audio in real-time?

Question

hello, can the model process audio in real-time?

Opened this issue 4 years ago · 8 comments

Answer 1 · 2021-06-25T08:16:37.000Z

Yes, you will have some delay, but it is possible.
Additionally, DCCRN achieved the best MOS in the subjective listening test of the Interspeech 2020 Deep Noise Suppression Challenge.

Answer 2 · 2021-08-24T02:46:14.000Z

Thanks for you reply. Do you train ddrn-e, with 37.5ms delay model？Can I realize real-time audio stream processing through sounddevice? The frame length is 37.5ms and frame shift...

Answer 3 · 2021-08-27T06:01:24.000Z

I train dccrn-e. And yes, you can. In the past, I tried real-time implementation using sounddevice, and I remember the performance was not bad.

+) Sorry. I made a mistake in answering. Exactly, I used dccrn-cl. The difference with dccrn-e is the kernel size.

Answer 4 · 2021-08-28T10:04:16.000Z

Hi, It is such a frame that is sent to the model for processing, so there is noise between frames？

Hi, thank you for replying to me all the time. After this processing, I spliced each frame of audio and heard some noise. What do I need to do between each frame of audio data ?

Answer 5 · 2021-08-31T10:49:43.000Z

You may get some unwanted noise.
Umm... Could you please elaborate a bit more on what you mean by that question?

Answer 6 · 2021-08-31T11:05:53.000Z

Hi, I update comment this question.

Answer 7 · 2021-08-31T14:25:09.000Z

It's hard to give a definitive answer because I don't know the exact process, but I think it's important to know if the unwanted noise is distortion caused by the coding process or if there was an error when merging frames!

+)
If the noise is distortion caused by the coding process, it is effective to learn in advance to estimate the target speech from the speech containing the coding noise when training the DCCRN.

Answer 8 · 2021-09-28T06:14:31.000Z

Hi, have you achieved real-time processing now? @zqlsnr