progressCallback in decodeAudioData

Question

progressCallback in decodeAudioData

ouhouhsami opened this issue 10 years ago · 35 comments

Hello,

What do you think about having a progressCallback with decodeAudioData. In case of large audio files, it could takes time to "decode" the ArrayBuffer, and so the user could be informed about the progress of the decode function.

We then could have:

decodeAudioData(ArrayBuffer audioData,
                DecodeSuccessCallback successCallback,
                optional DecodeErrorCallback errorCallback,
                optional ProgressCallback progressCallback);

Does it make sens?

Answer 1 · 2014-08-13T13:04:31.000Z

Well, we could also use the Progress Events spec (https://dvcs.w3.org/hg/progress/raw-file/tip/Overview.html) to mesure the decode audio data progression.

Answer 2 · 2014-10-29T17:04:23.000Z

See #337 and #371.

Answer 3 · 2014-11-13T17:29:35.000Z

Check with TAG on pattern for this.

Answer 4 · 2015-04-30T14:19:13.000Z

@cwilson I recall that you did check with the TAG but don't see a resolution -- is this ready for editing?

Answer 5 · 2015-05-14T19:22:05.000Z

Shared feeling that user experience of audio apps is much improved by understanding progress of long-running decode operations. @cwilso proposes subclassing ProgressEvent; we will discuss at F2F. One issue is, what object should act as the dispatcher and target of the event?

Answer 6 · 2015-05-14T19:32:06.000Z

I'd suggest subclassing ProgressEvent just to add the ArrayBuffer to it, and expecting developers to use that as the key for which decode this is a ProgressEvent for, and targeting the events to the AudioContext (which is already an EventTarget).

Answer 7 · 2015-05-14T23:18:31.000Z

@cwilso Ick. What benefit does the ProgressEvent have over a callback? With a callback, there's no need to figure out context; authors can use JavaScript scopes to track what decode operation is associated with what callback. With a ProgressEvent, there can only ever be one handler, on the AudioContext. With a callback, each decode operation can have a separate callback handler.

Answer 8 · 2015-05-15T06:46:26.000Z

@jernoble Doesn't addEventListener allow for multiple event listeners to be added to the AudioContext?

Lets contrast dummy code written against the ProgressEvent and the callback approaches.

With ProgressEvent (AudioContext as target):

var audioContext = ...;
var arrayBuffer = ...;
var progressObserver = function(event) {
    if (arrayBuffer === event.arrayBuffer) {
        // Update UI with progress...
    }
};
audioContext.addEventListener('decodeprogress', progressObserver);
audioContext.decodeAudioData(arrayBuffer).then(function(audioBuffer) {
    audioContext.removeEventListener('decodeprogress', progressObserver);
    // Do something with the audioBuffer...
}, function() {
    audioContext.removeEventListener('decodeprogress', progressObserver);
    // Cry...
});

With callback:

var audioContext = ...;
var arrayBuffer = ...;
var progressObserver = function(progress /* or event? */) {
    // Update UI with progress...
};
audioContext.decodeAudioData(arrayBuffer, null, null, progressObserver).then(function(audioBuffer) {
    // Do something with the audioBuffer...
}, function() {
    // Cry...
});

For the callback approach, I assumed that the signature for decodeAudioData has a new optional callback following the optional legacy (pre-promise) success and failure callbacks.

Answer 9 · 2015-05-15T14:33:12.000Z

@mark-buer

@jernoble Doesn't addEventListener allow for multiple event listeners to be added to the AudioContext?

Yes, that's true. I was thinking of a single onprogress attribute. Still, they'll each get fired for every simultaneous decode progress event, so you end up with--effectively--a single callback containing a giant if-then-else-if statement (as you illustrate with your sample code).

var progressObserver = function(progress /* or event? */) {

I would suggest duration here. There are a few audio formats whose durations cannot be known until the entire file is decoded. Passing a percentage here (length decoded / total duration) would be incorrect for these formats.

Answer 10 · 2015-05-15T14:33:33.000Z

@mark-buer yes, you can have multiple event listeners. I'd expect in most apps that you'd want a consistent UI for decoding progress, so you'd actually WANT to collect the progress all in one place. Also, I doubt (for that reason) that you'd want to remove the listener all the time, and the pattern would be:

var audioContext = ...;
var arrayBuffer = ...;

audioContext.addEventListener('decodeprogress', function(event) {
    // Update UI with progress for given ArrayBuffer...
});

audioContext.decodeAudioData(arrayBuffer).then(function(audioBuffer) {
    // Do something with the audioBuffer...
}, function() {
    // Cry...
});

@jernoble It's not "Ick" - it's the benefit of having a consistent pattern of code for consistent patterns of scenario. This is precisely why we use Promises now rather than success/error callbacks in modern APIs - because it's a consistent pattern, and developers don't have to keep looking up what the right pattern is.

Answer 11 · 2015-05-15T15:22:00.000Z

@cwilso It is "ick", precisely because this breaks the consistent pattern of objects which emit progress events. You don't have a global object which represents all HTTP requests, and force all users to do a giant switch statement to figure out which progress goes with which request. And that's what you're suggesting we add here.

Answer 12 · 2015-05-15T15:23:58.000Z

Or, it'd be like having a single VideoContext object, and forcing clients of HTMLMediaElement to all listen for progress events emitted out that global object, rather than the media element itself.

Answer 13 · 2015-05-15T15:32:32.000Z

@cwilso

...so you'd actually WANT to collect the progress all in one place...

If you had a per-decode callback (or had a per-decode object as the target for a progress event), you could still get that behavior by passing the same callback/listener for each decode, without handicapping other use cases.

Answer 14 · 2015-05-15T15:40:54.000Z

As a developer who has sometimes had to play the game of mapping globally dispatched events back to specific handlers, I have to agree with @jernoble on the undesirability of firing all this out of the AudioContext. I don't think we should assume that progress events are all going to be collected and summarized in one place.

A per-decode object would be nicest. A per-decode callback is not as nice, but we started this method with callbacks.

Perhaps another cleaner possibility is to support a new method, .progress(), on the returned Promise (which would then be a subclass of Promise I guess).

Answer 15 · 2015-05-15T15:47:45.000Z

@joeberkovitz I'd rather add a new per-decode object than try to subclass Promise. I've got a spec patch for adding streaming audio decoding, which might help illustrate what this might look like: http://jernoble.github.io/web-audio-api/#the-audiostreamdecoder-interface | https://github.com/jernoble/web-audio-api/tree/337-audiostreamdecoder

Answer 16 · 2015-05-15T17:24:16.000Z

@jernoble That seems like a very elegant approach.

Answer 17 · 2015-05-18T01:37:22.000Z

@jernoble At the risk of heading off-topic, a per-decode object might also provide a simple API for (potentially large/numerous) decodes to be cancelled. The current API offers no mechanism for this.
In our app, we opportunistically queue up many large decodes ahead of time so that the decoded data is ready if/when we need it. User interactions dictate the set of required decoded data. These interactions can occur in rapid succession. Ideally, our app would cancel unneeded incomplete decodes before initiating new decodes, potentially saving time and energy (especially on mobile).

Answer 18 · 2015-06-02T13:43:13.000Z

Resolution: we will add a third, optional progressCallback argument to decodeAudioData() with this type:

callback ProgressCallback = void (unsigned long long decoded, unsigned long long total);

The decoded and total counts refer to the number of bytes in the input buffer -- i.e. decoded means "number of input bytes decoded so far", not "number of decoded output bytes".

Answer 19 · 2016-06-24T16:29:00.000Z

Hmm. In #30, it has been suggested to add a fourth optional parameter for a dictionary to allow disabling the resampling for decodeAudioData. This now conflicts with #335 (comment) from @joeberkovitz.

How do we reconcile these two? Make the progressCallback method another entry in the dictionary?

Answer 20 · 2016-06-28T13:18:41.000Z

The WG should discuss this. Because the conflict involves a comment from me (which just recorded an earlier resolution by the group) I want to clarify that I'm not opposed to changing the approach to a unified dictionary that includes the callback.

Answer 21 · 2016-06-30T16:42:23.000Z

Per teleconf: Need to look into how merge these two ideas.

Any suggestions, @domenic ?

Answer 22 · 2016-06-30T18:29:02.000Z

I think since callbacks and dictionaries are distinguishable, you could add an overload. Something like:

// legacy version
audioContext.decodeAudioData(arrayBuffer, successCb, errorCb);

// new hotness
audioContext.decodeAudioData(arrayBuffer, {
  progress(decoded, total) { /* ... */  },
  disableResampling: true
})
.then(successCb, errorCb);

Answer 23 · 2016-06-30T20:41:21.000Z

Thanks for the suggestion. @padenot was wondering if this was good Web style to do this kind of overloading. I assume this is ok, based on your suggestion.

Answer 24 · 2016-06-30T20:44:04.000Z

I think in general it's bad style to do that kind of overloading (in particular I personally dislike overloading more than most). However, in my opinion it's fairly justified in this case.

The mitigating factors are: (1) other alternatives all seem worse; (2) this is less about providing two overloads you expect people to use both of, than it is about providing a new method and leaving the old method around without breaking back-compat.

Answer 25 · 2016-06-30T20:53:44.000Z

A minor fix for the property bag:

audioContext.decodeAudioData(arrayBuffer, {
  progressCallback: function (decoded, total) { /* ... */ },
  disableResampling: true
})
.then(successCallback, rejectCallback);

Answer 26 · 2016-06-30T20:54:34.000Z

I'm not sure you need a type suffix in the name (processCallback vs. progress). It's pretty clear that progress(decoded, total) is a function. You don't say disableResamplingBoolean.

Answer 27 · 2016-06-30T20:56:19.000Z

Oh, the first value in the property bag was just a function. It needs to be a key-value pair. doesn't it?

Answer 28 · 2016-06-30T20:57:57.000Z

In modern JavaScript you can omit : function for methods.

Answer 29 · 2016-06-30T20:58:30.000Z

Wow. Good to know!

Answer 30 · 2016-07-28T16:29:46.000Z

Resolved that it's better to defer this capability until we develop a cleaner and more fully thought out decoding API?

Answer 31 · 2016-07-29T18:45:16.000Z

c'mon i'm waitin' on this 4 2 years now...

Answer 32 · 2017-01-29T06:44:58.000Z

any updates?

Answer 33 · 2017-06-14T13:27:34.000Z

@fooSolver
@dukuo

For what it's worth, we've found that, in the meantime, a good approximation of determining decode progress is to use a fake progress that lasts for a duration 2.5x as long as a FileReader.readAsArrayBuffer call. We managed to stay within a sub-50% error margin (Chrome), which, I must say, isn't that bad at all.

If you really need to display progress, I'd say this is still much better than showing no progress at all.

This is of course entirely implementation dependent, and it's likely that browser updates will eventually drift the value.

Answer 34 · 2019-09-17T02:39:29.000Z

This will now be handled by https://discourse.wicg.io/t/webcodecs-proposal/3662

Answer 35 · 2021-09-24T08:56:39.000Z

This will now be handled by https://discourse.wicg.io/t/webcodecs-proposal/3662

Tried to use WebCodecs today to do this with WebCodecs (unfortunately failing so far), and posted w3c/webcodecs#366 about my findings so far. Any help would be appreciated!