Performance in dequant.

Question

Performance in dequant.

Closed this issue 11 years ago · 5 comments

Would it be faster to perform the dequant as each coefficient is decoded? Right now the whole TU is dequantized afterwards, and for typical video that means lots of operations on 0 coefficients.

Doing dequant on the fly would result in way less operations, although it means no opportunity to use SSE2 routines....

Answer 1 · 2013-09-09T18:33:18.000Z

There is a rounding that you have to apply even on zero coefficients.

Mickael

Sent by my iPhone

Le 9 sept. 2013 à 20:23, pieter3d notifications@github.com a écrit :

Would it be faster to perform the dequant as each coefficient is decoded? Right now the whole TU is dequantized afterwards, and for typical video that means lots of operations on 0 coefficients.

Doing dequant on the fly would result in way less operations, although it means no opportunity to use SSE2 routines....

—
Reply to this email directly or view it on GitHub.

Answer 2 · 2013-09-09T18:48:57.000Z

Are you sure? I'm pretty sure that if a decoded coefficient is 0, then the dequant coefficient is always 0 too. This is how we have implemented a HW decoder and it does not have issues with that.

Answer 3 · 2013-09-09T19:06:03.000Z

pretty sure if x == 0 then you have to add "add" in the following code.

#define SCALE(dst, x) (dst) = av_clip_int16(((x) + add) >> shift)

__
Mickaël

Le 9 sept. 2013 à 20:48, pieter3d notifications@github.com a écrit :

Are you sure? I'm pretty sure that if a decoded coefficient is 0, then the dequant coefficient is always 0 too. This is how we have implemented a HW decoder and it does not have issues with that.

—
Reply to this email directly or view it on GitHub.

Answer 4 · 2013-09-09T19:07:51.000Z

Yes, but in that case the downshift will always obliterate whatever add was. "add" does not have any bit position set higher than shift

Answer 5 · 2013-09-09T19:11:26.000Z

you are right.
__
Mickaël

Le 9 sept. 2013 à 21:07, pieter3d notifications@github.com a écrit :

Yes, but in that case the downshift will always obliterate whatever add was. "add" does not have any bit position set higher than shift

—
Reply to this email directly or view it on GitHub.