Integer overflow errors when compiling to JS via jsoo

Question

Integer overflow errors when compiling to JS via jsoo

darrenldl opened this issue 4 years ago · 23 comments

We need to compile containers to JS for our doodlinator project via js_of_ocaml.

During compilation, we'd receive the following integer overflow errors (which I believe comes from jsoo compiling containers):

$ make
dune build bookmarklet/doodlinator.bc.js web/webpage.bc.js
 js_of_ocaml .js/containers/containers.cma.js
Warning: integer overflow: integer 0x5555555555555555 (-3074457345618258603) truncated to 0x55555555 (1431655765); the generated code might be incorrect.
Warning: integer overflow: integer 0x3333333333333333 (3689348814741910323) truncated to 0x33333333 (858993459); the generated code might be incorrect.
Warning: integer overflow: integer 0x3333333333333333 (3689348814741910323) truncated to 0x33333333 (858993459); the generated code might be incorrect.
Warning: integer overflow: integer 0xf0f0f0f0f0f0f0f (1085102592571150095) truncated to 0xf0f0f0f (252645135); the generated code might be incorrect.

The last integer matches Sys.max_array_length, which is used in core/CCVector (and data/CCPersistentArray, data/CCPersistentHashtbl, but we're not using containers-data).

Version info

ocaml: 4.08.1
containers: 3.1
jsoo: 3.8.0

System info

Linux 64 bit

Answer 1 · 2020-12-28T03:20:38.000Z

This is likely caused by the module CCShimsInt_.ml, produced by src/core/mkshims.ml. It's produced because the compiler is in 64 bits mode.

I think we need to use dune configurator to obtain ocamlc information for the target architecture, not the current architecture.

Answer 2 · 2020-12-28T03:25:55.000Z

Can you please try the branch fix-346 ?

Answer 3 · 2020-12-28T03:34:25.000Z

Still gives the same error after opam pin add containers https://github.com/c-cube/ocaml-containers.git#fix-346 - did i get the command wrong?

Answer 4 · 2020-12-28T03:42:16.000Z

Seems like the right command, did it reinstall everything?

Answer 5 · 2020-12-28T03:43:14.000Z

Yep, and I reinstalled everything again as well.

Answer 6 · 2020-12-28T03:45:15.000Z

In _build/log, is there any mention of int_size? or "target word size" ?

Answer 7 · 2020-12-28T03:49:41.000Z

Snippet of _build/log

#  ; ocaml_config =
#      { version = "4.08.1"
...
#      ; architecture = "amd64"
#      ; model = "default"
#      ; int_size = 63
#      ; word_size = 64
...
#      ; host = "x86_64-pc-linux-gnu"
#      ; target = "x86_64-pc-linux-gnu"
...
#      }

Answer 8 · 2020-12-28T03:51:30.000Z

Full build log: https://gist.github.com/darrenldl/9d2fa5ddab27f79bbda6c82b70dbe17e

Answer 9 · 2020-12-28T03:53:17.000Z

that's very unfortunate if that's what dune gives us for a jsoo target. I'll ask how to get the actual int size.

Answer 10 · 2020-12-28T16:27:23.000Z

Can OCaml even handle an int size of 53 bits, which is what JS uses?
Why does JS truncate to 32 bits? Is it because the JS binary functions are limited to 32 bits?

Answer 11 · 2020-12-28T22:03:57.000Z

Can OCaml even handle an int size of 53 bits, which is what JS uses?

Why does JS truncate to 32 bits? Is it because the JS binary functions are limited to 32 bits?

There is no such thing as 53 bits ints. There are 64bit floats that can accurately represent ints up to 53bits.

Javascript knows about 32bit integers (via bits operations, xor, or, and, ...) and that's what jsoo uses to represent ocaml integer (preserving the overflow semantic).

Answer 12 · 2020-12-29T15:37:57.000Z

@darrenldl does the latest commit fix your problem? it checks int_size at runtime.

Answer 13 · 2020-12-29T16:03:07.000Z

I looked at the latest commit, I think it fixes the semantic but will still trigger the warning message.
A way to fix the warning would be to compute the constants instead of using literals (e.g. int_of_string "0x55555")

Answer 14 · 2020-12-29T16:12:04.000Z

It'd fix the warning, but it would produce worse code on x86_64, would it not? The compiler would have no idea what the constants look like, and the whole point of this is to have a reasonably efficient popcount.

Is there an annotation one can put to disable this warning?

Answer 15 · 2020-12-29T18:08:20.000Z

This really needs conditional compilation to be fixed 'properly'. @c-cube, how does this compile on 32-bits?

Answer 16 · 2020-12-29T18:15:15.000Z

@bluddy there already is conditional compilation. Alas, it seems that one can compile on a 64 bits architecture, produce bytecode, and then run this bytecode on a 32 bits architecture (or pass it to jsoo, which is also 32 bits). This is what thwarted the already existing conditional generation of shims based on size_int.

Answer 17 · 2020-12-29T18:26:56.000Z

Then maybe the conditional compilation check should instead also check if we're producing bytecode? Performance isn't such a primary concern then.

Answer 18 · 2020-12-29T18:38:52.000Z

Sadly this is also not provided in the config. The shims are generated once, and used for both native and bytecode targets.

Answer 19 · 2020-12-30T00:26:49.000Z

@darrenldl does the latest commit fix your problem? it checks int_size at runtime.

I still receive the exact same warning

Answer 20 · 2020-12-30T15:33:15.000Z

Wrong close, but I was planning to "wontfix" anyway, I don't see how we can compile for 64 bits and expect things to work in 32 bits.

Answer 21 · 2021-02-17T04:28:26.000Z

Note that the compiler contains similar code (int_of_string "0x1_0000_0000"): https://github.com/ocaml/ocaml/blob/5661bbfe1f5e5384b09125759e1125d519ae69f3/stdlib/int32.ml#L66 It compiles to a move from a global memory location, so it is only slightly less inefficient. Alternatively, large constants may be written using shifts, producing the same code as the current variant: https://gcc.godbolt.org/z/x5bEGe

It think it would be nice to fix this, since the warning shows up in user code, and it also allows remove the shims for it it (since it's selected using Sys.int_size already at runtime).

Answer 22 · 2021-02-17T04:29:43.000Z

For reference:

let popcount_64_ (b:int) : int =
  let b = b - ((b lsr 1) land 0x5555555555555555) in
  let b = (b land 0x3333333333333333) + ((b lsr 2) land 0x3333333333333333) in
  let b = (b + (b lsr 4)) land 0x0f0f0f0f0f0f0f0f in
  let b = b + (b lsr 8) in
  let b = b + (b lsr 16) in
  let b = b + (b lsr 32) in
  b land 0x7f

let popcount_64_shifts (b:int) : int =
  let fives = 0x55555555 lor 0x55555555 lsl 32 in
  let threes = 0x33333333 lor 0x33333333 lsl 32 in
  let fs = 0x0f0f0f0f lor 0x0f0f0f0f lsl 32 in
  let b = b - ((b lsr 1) land fives) in
  let b = (b land threes) + ((b lsr 2) land threes) in
  let b = (b + (b lsr 4)) land fs in
  let b = b + (b lsr 8) in
  let b = b + (b lsr 16) in
  let b = b + (b lsr 32) in
  b land 0x7f

let () =
  for _ = 0 to 100_000_000 do
    let b = Int64.to_int (Random.int64 Int64.max_int) in
    assert (popcount_64_ b = popcount_64_shifts b)
  done

Answer 23 · 2021-02-21T20:08:08.000Z

@copy real nice. Would you mind doing a PR with that code?