Streaming decoder is very slow

Question

Streaming decoder is very slow

Closed this issue 8 months ago · 9 comments

I have been experimenting with using the encoder and decoder on some data and I have found that the stream decoder is several orders of magnitude slower than the encoder.

I have made some very minor alterations to the passthrough.py example file, starting a timer with the perf_counter_ns() command from the time module, and printing the result for both encoding and decoding. For example, encoding a file takes 0.59 ms, while decoding it seems to take over 3 seconds.

This appears to be limited to the stream decoder, where the file decoder offers performance that is on parity with the encoder. This is including the overhead that the file decoder has with performing several reads and writes to files, while the stream encoder should be much faster.

Answer 1 · 2023-12-31T15:07:32.000Z

I experience this same super slow behavior when using this decoder in my project. The decoder.finish() takes about 3 seconds. I suspect this has something to do with coordination between the thread requesting the decoding and the thread that is doing the actual decoding, and not the actual decoding time required. I wish there were a way to do it all in the same thread!

If I replace the finish() with a sleep(.1) and then just look at the decoded data area, it is there in all my tests. It's like the sleep allows the decoding thread to do its work, which it can do quickly, and then we can proceed with the decoded data. But if you try to do a finish() then you are stuck waiting for multiple seconds.

If I don't have the sleep() and don't do a finish() and look immediately, then there is no decoded data in the output area.

Answer 2 · 2023-12-31T15:12:16.000Z

Looking at finish() in decoder.py, I bet it is this code that is causing the 3 second delay:

    # --------------------------------------------------------------
    # Instruct the decoder to finish up and wait until it is done
    # --------------------------------------------------------------
    self._done = True
    super().finish()
    self._thread.join(timeout=3)
    if self._error:
        raise DecoderProcessException(self._error)

There is a 3 second timeout coded in there for the thread join, and it takes 3 seconds every time. Very annoying.

Answer 3 · 2023-12-31T15:24:48.000Z

More information: the join() does appear to be timing out and the thread remains running! You can verify this with a check using decoder._thread.is_alive() .

So why is the join() timing out?

Answer 4 · 2023-12-31T15:58:13.000Z

The problem probably only appears when doing in-memory decoding (as opposed to outputting data to a file) because the blocking on I/O helps the threads share work (compute-bound threads with GIL can be problematic), but there is still a bug somewhere because the finish() should yield to the decoder thread, but the join() always takes the full 3 second timeout and the docoder thread still never exits!

Answer 5 · 2023-12-31T21:20:46.000Z

There was a bug in pyflac streaming logic (several actually) and there is a fix, which a colleague may be trying to get pulled into the repo at some point. The basic description of the problem above is correct: the thread join was never happening, the decoding thread never knew it was done.

Answer 6 · 2023-12-31T23:40:08.000Z

I believe this is a duplcate of #22

I have submitted a fix #25

Answer 7 · 2024-03-29T14:51:28.000Z

@kevinmcaughey @rpdrewes @brandyn Thanks for raising this issue, and apologies for the delay. I have submitted a fix for review in #26

Answer 8 · 2024-04-16T15:09:32.000Z

Thank you all for reporting these issues and improving pyFLAC. The fixes should now be released in https://pypi.org/project/pyFLAC/3.0.0/

Answer 9 · 2024-04-16T16:29:51.000Z

@joetoddsonos The main underlying issue, which I meant to report but hadn't gotten around to, was that the super().finish() was being called before the thread join() in finish() in decoder.py. I'm glad to see that has been fixed. Thanks!