Integral statistics could be misleading.

Question

Integral statistics could be misleading.

Closed this issue 7 years ago · 12 comments

There is a section of code in tabling routines where we attempt to give an integral median and IQR for the starting steady state iteration.

Suppose we get a symetric IQR (out of luck) 1.5 +/- 0.1. It would look like this:

......|_____________|......
             ^
      1.4   1.5    1.6

How would we round this? Well we need to be safe, so we should round the lower bound of the IQR down and the upper bound up, thus over-approximating error. This would give us:

......|_____________|......
             ^
      1     1.5      2

But what would we do with the median?

If we round down we get:

......|_____________|......
      ^
      1              2

If we round up we get:

......|_____________|......
                    ^
      1              2

In neither case is the median in the middle of the error bound, which I find confusing. More confusing that showing a floating point value for the steady state iter.

Hrm...

vext01 commented 7 years ago

ah!

Answer 1 · 2017-04-11T14:12:40.000Z

@ltratt discussed with @snim2, what do you think?

Answer 2 · 2017-04-11T14:37:14.000Z

I am very glad for the diagrams, otherwise I would have struggled to understand the relation of 1, 1.5, and 2 ;)

It's most common to use the mean of the two values either side of the median point. But I don't have a problem with rounding up/down either -- in the grand scheme of things it's defensible either way I think.

Answer 3 · 2017-04-11T14:45:32.000Z

The mean wont be integral, and rounding is clearly wrong, no?

Answer 4 · 2017-04-11T14:47:21.000Z

"Clearly wrong" is kind of overstating it (there are occasions where the median is presented as a single "whole" value), although I personally tend to present it as the average of the two values either side of the median.

Answer 5 · 2017-04-11T15:06:36.000Z

Hrm. I'm not sure, but as long as we have thought about it.

Answer 6 · 2017-04-11T15:08:54.000Z

Just to be clear: I'm agreeing with you that the non-rounding approach is probably better.

Answer 7 · 2017-04-11T15:10:57.000Z

@ltratt @vext01 just to be pedantically clear, to resolve this bug report we now need to represent all three numbers in the median steady state iter (#) and IQR as floats to 1dp?

Answer 8 · 2017-04-11T15:12:25.000Z

No, just the median (not the IQRs).

Answer 9 · 2017-04-11T15:13:38.000Z

That's OK as long as you are OK with the median (potentially) not being in the middle of the error bounds...

Answer 10 · 2017-04-11T15:14:42.000Z

Good point. Maybe 1dp is the way to go. I mean, it will still sometimes not be quite in the middle, but it'll be close enough not to worry about it.

Answer 11 · 2017-04-11T15:17:19.000Z

Right. So you probably would want the median and the error all as floats.