Inconsistencies with various division operators and NaN/Infinity

Question

Inconsistencies with various division operators and NaN/Infinity

jvoigtlaender opened this issue 9 years ago · 15 comments

jvoigtlaender commented 9 years ago

Here is a collection of facts about current arithmetics behavior in Elm:

1 / 0 results in Infinity (type Float)
1 // 0 results in 0 (type Int)
1 % 0 throws a runtime error (type Int)
1 rem 0 results in NaN (type Int)
functions isNaN and isInfinity both have type Float -> Bool

Here's several oddities/inconsistencies about those:

Saying that 1 divided by 0 is 0 is very strange. (2.)
Throwing a runtime error for one form of "mod division by 0 on integers", namely %, but returning a result for another form of "mod division by 0 on integers", namely rem, is inconsistent. (3. vs. 4.)
Having an expression that results in NaN of type Int, but not having the possibility of checking for NaN on type Int, seems broken design. (4. and 5.)

I propose to change the definitions of //, %, isNaN and isInfinity such that:

1 / 0 results in Infinity (type Float, as before)
1 // 0 results in Infinity (type Int, adapted behavior)
1 % 0 results in NaN (type Int, adapted behavior)
1 rem 0 results in NaN (type Int, as before)
functions isNaN and isInfinity both have type number -> Bool (generalized types, to be applicable to both Float and Int)

In addition to better consistency, and correcting a mathematical wrongness (1 // 0 = 0), this would have benefits in terms of efficiency, since // and % are currently performing extra checks on each invocation that would then go away.

Answer 1 · 2016-05-10T16:06:53.000Z

I agree that it is consistent, but it allows ±Infinity and NaN, IEEE floating point abstractions, into the Int type, which may one day be represented as machine integers. Additionally, what should List.repeat (1//0) "foo" evaluate to? Empty list, sure, but when writing functions that take integers one loses guarantees that they are actually integers. Finally, it doesn't seem like one can perform operations on ±Infinity and NaN to obtain a finite non-integer value, but I'm still concerned about the possibility.

Answer 2 · 2016-05-10T16:14:04.000Z

@mgold, do you have an alternative proposal to handle cases 1.-5. in a consistent way?

Not all of them necessarily need to be consistent. For example, 1. and 2. need not, since they are on different types. But 3. and 4. being inconsistent seems clearly undesirable, since both are some form of "mod division on integers".

Answer 3 · 2016-05-10T16:18:51.000Z

2, 3, and 4 should either all return 0 or all throw runtime errors. 1 and 5 stay unchanged.

Answer 4 · 2016-05-10T16:23:31.000Z

2 returning 0 seems an abomination to me. Is there any precedent (say, a programming language) in which 1 divided by 0 is given as 0?

Making all of 2, 3 and 4 throw runtime errors was exactly the content of my closed #565 and #576. @rtfeldman disagreed.

Answer 5 · 2016-05-10T16:42:08.000Z

Is there any precedent for a (Turing-complete) language to try so hard to avoid runtime errors? Ask a nonsense question, get a nonsense answer. Or, crash and hopefully catch the bug in development or testing. I'm actually undecided between the two.

I'm not sure exactly what @rtfeldman was talking about when he said he wanted to remove the check, so let's see what he has to say now that we've framed the issue a little better.

Answer 6 · 2016-05-16T23:55:49.000Z

To clarify, I'm not advocating one way or the other; I just wanted to note some facts (and an explanation I'd heard at some point for why some of them work the way they do) about the current state of things. 😄

I have yet to encounter any of these edge cases in practice, and don't have particularly strong feelings about this.

Answer 7 · 2016-07-20T04:52:48.000Z

I just ran into case 2.

Anyone else seen a language in which 5 // 0 was 0? I found that really weird.

Answer 8 · 2016-07-21T02:49:30.000Z

Is there any precedent for a (Turing-complete) language to try so hard to avoid runtime errors? Ask a nonsense question, get a nonsense answer.

To be clear, the second sentence refers to division by zero as a nonsense question, not the question that you are asking.

Answer 9 · 2016-09-22T18:37:35.000Z

Consolidated all the math related stuff into the #721 meta issue. Follow along there!

Answer 10 · 2017-05-04T13:34:53.000Z

I just noticed you can actually coerce Infinity to be an Int:

> round (1 / 0)
Infinity : Int

This does not seem intentional, either.

Answer 11 · 2017-07-17T12:32:40.000Z

I strongly recommend to NOT make cases like (x / 0) return Infinity or sqrt(-1) return NaN as proposed above by jvoigtlaender. I come from a numerical computing background and my experience is that it's toxic to allow silent propagation of numeric exceptions. Infinity and NaN are NOT just values like any other float. In the worst case you end up with strange behaviour of your program at some distant point because a NaN result from some calculation spread through your code (NaN + x = NaN). Instead you want your program to crash as soon as possible because it obviously contains a bug.

Answer 12 · 2017-07-17T12:41:25.000Z

For the record:

1 / 0 returning Infinity is what Elm currently does, not something that goes back to a proposal of mine
sqrt(-1) returning NaN isn't part of anything I proposed either

Answer 13 · 2017-07-17T13:23:32.000Z

sorry for my inprecise citation.

I saw that you proposed "1rem0 results in NaN (type Int, as before)" and generalized to "sqrt(-1)". My argument above certainly can be generalized to "1 rem 0" (-> don't return NaN).
Sorry that my description suggested the status quo of "1/0" returning Inifinity to be part of your proposition. I consider this status quo to be a problem.

Answer 14 · 2017-07-17T13:58:13.000Z

Also 1 `rem` 0 resulting in NaN is part of the status quo, not something I thought up. 😄

But I do get that your general thrust is to raise a runtime error much more often than the status quo does. That is certainly one way to make things more consistent (than they are in the status quo, where some things raise runtime errors and others don't).

Answer 15 · 2017-12-29T08:04:40.000Z

I suggest defining m % 0 and rem m 0 to return m. Here is my reasoning:

I would expect m and m % n to be congruent mod n, whenever m % n is defined. There's a reason it's called the modulo operator!
Elm has a policy of "No Runtime Exceptions", so m % n should always be defined.
So for all m and n, m % n exists, and is congruent to m mod n.
By the definition of congruence mod n, m - (m % n) is a multiple of n.
Setting n equal to zero, m - (m % 0) is a multiple of zero.
Therefore, m - (m % 0) = 0, and so m % 0 = m.

By the same logic, rem m 0 should equal m.

Also note that with this definition, the formula m = (m // n) * n + rem m n holds universally.