Error in Floating Point Representation tool about subnormal numbers?
pacalet opened this issue · 0 comments
According IEEE 754-2019:
-
$emin$ shall be$1 − emax$ for all formats (section 3.3, page 17), - when biased exponent
$E = 0$ and trailing significand$T \neq 0$ , the number is subnormal and the corresponding value is$v = (-1)^S \times 2^{emin} \times (0 + 2^{1-p} \times T)$ (section 3.4, page 19), - for 32 bits precision
$p = 24, emax = 127$ (table 3.5, page 23).
As a consequence for 32 bits precision the value of subnormal numbers shall be 1.4E-45
instead of the wrong
I suggest to replace