Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Details of specifying Decimal numbers and algorithms #122

Closed
waldemarhorwat opened this issue Apr 12, 2024 · 10 comments · Fixed by #138
Closed

Details of specifying Decimal numbers and algorithms #122

waldemarhorwat opened this issue Apr 12, 2024 · 10 comments · Fixed by #138

Comments

@waldemarhorwat
Copy link

Can we set up a video meeting to discuss how to formalize Decimal numbers and the algorithms involving those in the spec? I'd like to ensure that the spec is correct, clear, unambiguous, and easy to read and, if desired, am willing to provide extensive help to achieve that. I'm available next week Tue-Thu.

@waldemarhorwat
Copy link
Author

waldemarhorwat commented Apr 15, 2024

I'd suggest representing Decimal128 values and algorithms analogously to how we represent Number and BigInt values.

Format

A Decimal128 value would be one of the following:

  • NaN𝔻
  • +∞𝔻
  • -∞𝔻
  • «v, q»𝔻, where v and q satisfy:
    • v is +0𝔻, -0𝔻, or a real number. Here +0𝔻 and -0𝔻 are symbols representing special values, not the real number 0.
    • q is an integer that satisfies -6176 ≤ q ≤ 6111
    • If v is a real number, v × 10-q is an integer n that satisfies 0 < |n| < 1034 

Some definitions:

  • A finite Decimal128 is a Decimal128 value of the form «v, q»𝔻.
  • A zero Decimal128 is a Decimal128 value of the form «v, q»𝔻 where v is +0𝔻 or -0𝔻.
  • A finite nonzero Decimal128 is a Decimal128 value of the form «v, q»𝔻 where v is a real number.

Cohort

The cohort of a Decimal128 value is the value without the q. Specifically:

  • cohort(NaN𝔻) is NaN𝔻
  • cohort(+∞𝔻) is +∞𝔻
  • cohort(-∞𝔻) is -∞𝔻
  • cohortv, q»𝔻) is v. Here v is +0𝔻, -0𝔻, or a real number.

The use of cohorts is pervasive in simplifying specifications of spec algorithm steps, the bulk of which do not depend on q.

Exponent and Significand

Every finite nonzero Decimal128 value «v, q»𝔻 has an exponent and a significand. The exponent is the unique integer e and the significand is the unique real number s that satisfy v = s × 10e and 1 ≤ |s| < 10.

Normalized and Denormalized Values

The exponent e of a finite nonzero Decimal128 value will always be in the range 6144 ≥ e ≥ -6176.

  • A finite nonzero Decimal128 value is normalized if its exponent e is in the range 6144 ≥ e ≥ -6143.
  • A finite nonzero Decimal128 value is denormalized if its exponent e is in the range -6144 ≥ e ≥ -6176.

This corresponds to the IEEE notion of normalized and denormalized values. Denormalized values have progressively limited resolution as they approach zero.

Truncated Exponent

Rounding behavior of denormalized Decimal128 values diverges from that of normalized Decimal128 values. For the purposes of describing rounding algorithms it's useful to define the notion of a truncated exponent te of a finite Decimal128 value.

  • The truncated exponent te of a normalized Decimal128 value is the same as its exponent e.
  • The truncated exponent te of a zero or denormalized Decimal128 value is -6143.

Given this, we can define the scaled significand ss of a finite Decimal128 value as:

  • The scaled significand ss of a zero Decimal128 value is 0.
  • The scaled significand ss of a finite nonzero Decimal128 value «v, q»𝔻 is ss = v × 1033-te, where te is the truncated exponent of the Decimal128 value.

ss will always be an integer with an absolute value less than 1034.

The main importance of a scaled significand ss is in rounding behavior, for which IEEE 754 arithmetic algorithms make different choices depending on whether ss is odd or even. The notion of scaled significands should not be exposed to users.

Edits

  • Added 𝔻 subscript to +0𝔻 and -0𝔻 to reduce confusion with the real number 0.
  • Added cohort function.

@waldemarhorwat
Copy link
Author

waldemarhorwat commented Apr 16, 2024

Some tentative items from the discussion on Tuesday, 16 April 2024. Please add comments if I missed anything.

Overall

  • Decimal should be easy to use and understand by folks without extensive numerics training. It should behave as one would expect a numeric class to work given everyday knowledge of math.
  • Decimal should be usable by numerics experts without surprises to them.
  • Decimal should provide enough common operations that folks don't need to convert a Decimal to Number and back just because the Decimal operation they want is missing.

Specification Techniques

  • We should precisely specify the results of Decimal operations, just like we do so for Number and BigInt operations, by performing operations on mathematical values and explicitly listing the rounding steps.
    • It's desirable for the ECMAScript spec to be self-contained. The IEEE specs are hard to obtain, hard to read, contain numerous extraneous functions, modes, encoding formats, and details not used by ECMAScript, and differ in details and nomenclature depending on which version of the IEEE spec one is looking at.
  • Arithmetic operations should have the well-known IEEE numerical behavior, so we need ±0 and NaN's.
  • Reviewed and discussed the proposal above.

Terminology

  • IEEE 754 has several different notions of what an exponent and a significand is, and they're easy to confuse.
  • For any user-visible APIs, exponent should have the everyday scientific notation meaning of floor(log10(|v|)), or perhaps what's called truncated exponent above.
  • IEEE 754 already has definitions of normalized and denormalized values, which are as in the proposed formalism above. We should not overload these terms to also denote the presence or absence of trailing zeroes in a particular encoding.

Quanta

  • IEEE 754 tracks the number of trailing zeroes for some (but not all) Decimal values. The number of trailing zeros has no effect on the mathematical results of the common Decimal operations, but can be used when formatting Decimal values back to strings.
    • The number of trailing zeroes is encoded in the quantum q in the formalism above.
    • The default toString should provide strings independent of q.
    • We should provide an alternate quantum-preserving conversion to strings that distinguishes based on q. Note that this will render a Decimal value equal to 2500 as one of the thirty-three possible cohort members 25e2, 250e1, 2500, 2500.0, 2500.00, …, 2500.000000000000000000000000000000 depending on q.
    • q will have no effect on the results of arithmetic and comparison operations.
      • User-visible comparison operations (=, ≠, <, ≤, >, ≥) should follow the usual IEEE semantics of comparing mathematical values, NaN ≠ NaN, +0 = -0, etc.
      • There's no strong use case for totalOrder. We don't even support it on Numbers yet.
        • For using Decimal values as keys in a map, use either the standard toString or the quantum-preserving conversions to strings depending on whether trailing zeroes are significant or not in your use case.

Naming

  • Temporal has a notion of of typed time values and uses "equals" to mean the same time in the same time zone, whereas "compare" means the same time.
  • Numbers in math have a strong precedent that "equals" means numerical equality. Having it also compare quanta (which most folks would not even be aware of) would be confusing.

Formats

  • We briefly discussed the possibility of typed arrays containing Decimal128 values, perhaps as a future extension. To do this we'd need to define the binary encoding of Decimal128 values, with the IEEE standard offering four mutually incompatible ones. Without typed arrays the choice of representation would be private to an implementation of ECMAScript and not visible via user APIs.

Conversions

  • Conversions to and from Decimal128 should be explicit.
  • We should provide the necessary explicit conversions between the various number formats. Use cases where different inputs have different numeric types will often come up. If we don't support those, users will hack around them and create chaos.
  • The natural way of converting from Decimal128 to Number is to treat the Decimal128 value as a real number and round it to the nearest Number in the usual way.
    • This is the way Number literals work in the language today.
    • Throwing an exception if the conversion is inexact would have significant usability downsides because there are common use cases where such a conversion is needed.
  • There are two possible ways to convert from a Number x to Decimal128:
    • Return the Decimal128 value closest to the mathematical value of x; this corresponds to the decimal string that x.toPrecision(34) would return. Most users who are not numerical experts would consider this behavior, which converts the Number 3.1 to the Decimal128 3.100000000000000088817841970012523 to be weird, so this should not be the default mode of the conversion. We may wish to provide it as a separate function for math experts.
    • Return the Decimal128 value that would be obtained had we converted x to a String s and then s to a Decimal128. This would convert the Number 3.1 to the Decimal128 3.1.
      • The conversion to s may have a bit of implementation latitude in some cases, but there is a way to specify a unique best conversion here.

@waldemarhorwat
Copy link
Author

waldemarhorwat commented Apr 17, 2024

Here's my summary of the discussion on Wednesday, 17 April 2024. Please add comments if I missed anything.

Overall

  • Should we order the goals listed in yesterday's overall section? That would be helpful if some of them conflict.
    • Conflicts are hypothetical at this stage. In practice the goals listed yesterday for Decimal128 shouldn't conflict. Maybe wait until we run into an actual conflict?

Rounding Modes and Implementability

  • The spec currently proposes nine rounding modes and names them "ceil", "floor", "expand", "trunc", "halfEven", "halfExpand", "halfCeil", "halfFloor", and "halfTrunc".
  • IEEE 754 defines only five rounding modes and names them "roundTiesToEven", "roundTiesToAway", "roundTowardPositive", "roundTowardNegative", "roundTowardZero".
  • These rounding modes can be applied to coercions as well as arithmetic operations, where the arithmetic is done exactly and then the exact result is rounded to a representatible Decimal128 value according to the rounding mode.

This disparity leads to questions of both nomenclature and implementability:

  • Which user nomenclature should we adopt to name the rounding modes? To answer this we should take a look at how these are named in existing practice.

  • The spec defines Decimal rounding modes not present in IEEE 754. This has the potential to create trouble for implementations using decimal hardware or existing high-performance libraries, which would generally not be written in ECMAScript. Hardware or libraries support the modes present in IEEE 754 for arithmetic. What should an implementation do if someone tries to take the sine or square root of a Decimal or divide two Decimals using a non-IEEE-754 rounding mode? Hardware or libraries won't provide the unrounded answer, only the already-rounded answer according to a chosen IEEE 754 rounding mode, so one can't just stick a different rounding step on the end of the algorithm. This becomes hard to implement.

    • Preferred solution for initial Decimal128 support: Keep rounding mode parameters for conversions, but omit rounding mode parameters from arithmetic operations; instead, just use roundTiesToEven for those like we do for Numbers.
    • We don't have folks clamoring for arithmetic operations on Numbers using one of the other IEEE 754 rounding modes, so it's likely we won't have folks clamoring for those on Decimal arithmetic operations either.
    • Side note: some arithmetic operations never round and always produce exact results, so they shouldn't take a rounding mode. Examples include comparisons, negation, absolute value, and remainder (this is true for both IEEE remainder and traditional remainder). It's not hard to mathematically prove that these operations are always exact.
    • If we do encounter good use cases, it's easy add a rounding mode parameter to arithmetic operations later in an upwards compatible manner.
  • Depending on what conversions and rounding modes we support, their definitions could use some care. An example is toExponential-style rounding of the Decimal value 9.5𝔻 to one significant digit using the roundTiesToEven mode. The nearest Decimals with one significant digit are 9𝔻 and 10𝔻, they're equally near, and neither has an even least significant digit (9 and 1 respectively).

    • Note that this weirdness arises due to a conversion requesting rounding to one significant digit. It doesn't arise for rounding in, for example, IEEE 754 arithmetic because that's always done to 34 significant digits.

User Operations on Mantissa and Exponent

Some users will want to compose a Decimal out of a mantissa and exponent and decompose a Decimal back into a mantissa and exponent. That's desirable in various applications and we should make sure that how to do this correctly is readily apparent. Some options:

  • Rely only on Decimal↔String conversions.
    • Composition: If someone wants to create a Decimal with a given mantissa and exponent, they can convert the mantissa and exponent into strings, string-concatenate mantissa+"e"+exponent, and convert the resulting string into a Decimal. This works for simple use cases but has some rough edges (what if mantissa is +∞?).
    • Decomposition: Splitting a Decimal into a mantissa and exponent is harder and provides more opportunities for bugs to creep in. Users would have to parse the string and possibly deal with it being or not being in exponential notation.
  • To solve the decomposition problem, provide ways to extract the mantissa and exponent from a Decimal.
    • The easiest approach to understand is to provide mantissa(d) and exponent(d) methods (or accessors) with the following behavior, which matches the everyday concept of scientific notation:
      • exponent(d) returns the exponent (or truncated exponent) defined here. The result is an integer and its type should probably be Number, not Decimal.
      • mantissa(d) returns a Decimal whose absolute value is between 1 inclusive and 10 exclusive, except in the case of zero (or denormals if we choose truncated exponent for the exponent method).
      • If someone wants the mantissa as an integer instead, they can just compute mantissa(d) * 1033. This will always be an exact integer. They can convert it to a BigInt if they like.
  • To help with the composition problem, we could provide a scale10 operation as specified in IEEE 754: Given a Decimal d and an integer Number n, scale10(dn) is the Decimal d × 10n rounded in the usual way. This provides multiplication and division by powers of 10.
    • If we don't provide such a function or equivalent, users will generate powers of 10 by creating a string starting with "1e" followed by the exponent and convert the string to a Decimal. This almost works but can cause intermediate overflow problems: scale10(0.02𝔻, 6145) will correctly return 2e6143𝔻, whereas the string hack will produce an infinity instead.

Explicit Conversions

Decimal ↔ String

Decimal → String

  • The default conversion of a Decimal to a String should be as discussed yesterday.
    • Just like the Number → String conversion, the default Decimal → String conversion should switch to scientific notation for values with sufficiently high or low magnitude. Given Decimal's greater number of significant digits, the thresholds for this switch should be greater than they are for Number.
  • We can also provide a conversion of a Decimal to a String that distinguishes cohort members (reproduces trailing zeroes) as either a separate function or a separate mode. This one is tricky to use due to it unexpectedly generating scientific notation for Decimals with negative numbers of trailing zeroes.
  • To support users explicitly wanting non-scientific or scientific notation strings, we should provide Decimal equivalents of toPrecision, toExponential, etc. Rounding details are important here, so these should take optional rounding modes.

String → Decimal

  • This should be straightforward, treating the string as a mathematical real number and then rounding it to a Decimal using roundTiesToEven.
  • Should we provide a version that can take a different rounding mode?

Decimal ↔ Number

We discussed those yesterday and settled on the choices:

  • Decimal → Number treats the Decimal as a mathematical real number and then rounds it to a Number in the usual way using roundTiesToEven.
  • Number → Decimal should be done as if the conversion were Number → String → Decimal for the reasons outlined yesterday. We want the Number 3.1 to produce the Decimal 3.1, not the Decimal 3.100000000000000088817841970012523.
    • We do not want to throw on inexact conversions here. There will be lots of cases where users need to do heterogeneous arithmetic and need to convert a Number to a Decimal.

Decimal ↔ BigInt

  • BigInt → Decimal treats the BigInt as a mathematical integer and then rounds it to a Decimal using roundTiesToEven.
  • Decimal → BigInt should follow the precedent of Number → BigInt and produce the BigInt if the Decimal is an integer or throw if not.

Conclusion

  • Waldemar is offering to help with writing spec text and algorithms for the July meeting

jessealama added a commit that referenced this issue May 2, 2024
jessealama added a commit that referenced this issue May 2, 2024
* Define a Makefile for basic editorial work

* Add a number of suggested definitions

Relates to #122.
jessealama added a commit that referenced this issue May 15, 2024
* Fix how we refer to exceptions.

+ For `Decimal128.prototype.round`, use `mark` to mark an
incomplete algorithm.

+ Use `*this* value` rather than `**this** value`.

Part of #122
@jessealama jessealama reopened this May 17, 2024
@jessealama
Copy link
Collaborator

(This issue got unintentionally closed; reopening it.)

jessealama added a commit that referenced this issue May 30, 2024
…ion, and remainder (#143)

* Add `toFixed`, `toPrecision`, and `toExponential`

* Define AO to massage a potential Decimal128 value to an actual one

* Say that Decimal128 values aren't ECMAScript language values

* Inline AO used only in the constructor

* Fix references to undefined variables and mark an in-progres spot

* Flesh out `round` method

* Fix specification for exponent in `divide`

* Add checks for rounding mode

* Flesh out constructor

* Add brand checks

* Add links

* Add reference for rounding

* Add more IEEE 754 references

* Use "Otherwise"

* Add note about how we convert from Number

relates to #122
@waldemarhorwat waldemarhorwat mentioned this issue Jun 11, 2024
27 tasks
@waldemarhorwat
Copy link
Author

waldemarhorwat commented Jun 11, 2024

EnsureDecimal128Value is the main nexus of issues in the June 2024 spec. Here's a sketch of how I'd recommend fixing things. We should probably come up with better names for some of these sets and functions, but the general idea is:

  • Let FiniteNonzeroDecimalCohorts = {n × 10q | nq ∈ ℤ, 0 < |n| < 1034, -6176 ≤ q ≤ 6111}. This is the mathematical set of all real numbers that are mathematical values of finite, nonzero Decimal128 values. (Note that mathematical set notation combines duplicates: {1, 3, 2, 3, 2, 5} = {1, 2, 3, 5}.)

  • Let FiniteDecimalCohorts = FiniteNonzeroDecimalCohorts ∪ {+0𝔻, -0𝔻}. This is exactly the set of values that v can take in a Decimal128 value of the form «vq».

  • Let DecimalCohorts = FiniteDecimalCohorts ∪ {NaN𝔻, +∞𝔻, -∞𝔻}. This is exactly the set of values that the cohort function can return when given an arbitrary Decimal128 value.

  • Define a spec function RoundDecimal128(v, roundingMode) that takes v ∈ ℝ and one of the rounding modes. It rounds v to one of the Decimal128 cohorts per the rounding mode and produces a result that is a member of the set FiniteDecimalCohorts ∪ {+∞𝔻, -∞𝔻} — note that there's no way for rounding to produce a NaN𝔻. Use this function whenever rounding is needed. Don't pass ±0𝔻 as the first parameter to this function; instead, handle those cases explicitly in the calling code.

  • Define a spec function PickQuantum(d, v, qPreferred) that takes d ∈ DecimalCohorts, v ∈ ℝ, and qPreferred ∈ ℤ and produces a Decimal128 value as follows:

    • If d is one of NaN𝔻, +∞𝔻, -∞𝔻, PickQuantum just returns d.
    • If d ∈ FiniteDecimalCohorts:
      • Let mv be the mathematical value of d, which is either d itself if it's already a mathematical value or 0 if d is ±0𝔻.
      • PickQuantum returns the Decimal128 value «dq», where q is chosen as follows:
        • If v = mv, we have an exact result. Pick q is as close as possible to qPreferred such that «dq» is a valid Decimal128 value.
        • If v ≠ mv, we have an inexact result. Pick q as low as possible such that «dq» is a valid Decimal128 value.

Code that calls these should work with cohorts and quantums separately. Arithmetic and logic should call the cohort method and do arithmetic cases with that value.

A recurring problem in the current spec is the incorrect querying of q in Decimal128 values «vq». The only places where q should be used are:

  • As inputs to the second argument of a PickQuantum call, possibly with simple integer arithmetic such as +, -, or min.
  • As inputs to a Decimal-to-string conversion algorithm for toString or toExponential, but only if the user requests the quantum-sensitive version.

Any other use of q is a bug. This includes trying to use q in the default toString or toExponential.

Edit: Added third parameter to cover inexact cases to PickQuantum.

@waldemarhorwat
Copy link
Author

waldemarhorwat commented Jun 14, 2024

I recommend providing the following three user methods for working with exponents and significands. I based this design on recommendations in the IEEE754 standard as well as interoperability considerations. In particular, I designed these so that one can feed the output of one method into another without having to check for special cases such as NaNs, infinities, or zeroes.

The nomenclature is just a strawman. We might find better names for these.

Exponent

The exponent user method returns a Number that represents the base-10 exponent of a Decimal128 value d. It has the following cases:

  • If d is NaN𝔻, return NaN𝔽.
  • If d is +∞𝔻 or -∞𝔻, return +∞𝔽.
  • If d is «+0𝔻, q»𝔻 or «-0𝔻, q»𝔻, return -∞𝔽.
  • Otherwise d is a nonzero finite Decimal128 value «v, q»𝔻. Let e be the unique integer for which 10e ≤ |v| < 10e+1. Return 𝔽(e).

This logic is based on the IEEE754 logB method.

QuantumExponent

The quantumExponent user method takes a Decimal128 value d and returns its quantum q as a Number if available. It has the following cases:

  • If d is NaN𝔻, return NaN𝔽.
  • If d is +∞𝔻 or -∞𝔻, return +∞𝔽.
  • Otherwise d is a finite Decimal128 value «v, q»𝔻. Return 𝔽(q).

Why not call this quantum? I would, but the IEEE spec uses the word quantum to mean 10d.quantumExponent. One can easily generate that using the scale method, as illustrated later.

Scale

The scale (maybe we should call it scale10?) user method scales a Decimal128 value's exponent by a given Number n, effectively multiplying it by 10n but without the danger of intermediate overflows. It takes a Decimal128 value d and a Number value n, which must be an integeral Number or one of {NaN𝔽, +∞𝔽, -∞𝔽}. It has the following cases:

  • If n is neither an integral Number nor one of NaN𝔽, +∞𝔽, or -∞𝔽, then throw.
  • If d is NaN𝔻, +∞𝔻, or -∞𝔻, return d (as a new object).
  • Otherwise d is a finite Decimal128 value «v, q»𝔻.
  • If v is +0𝔻 or -0𝔻:
    • If n is NaN𝔽, return d (as a new object).
    • If n is +∞𝔽, return «v, 6111»𝔻.
    • If n is -∞𝔽, return «v, -6176»𝔻.
    • Return PickQuantum(v, v, q + ℝ(n)).
  • Otherwise v is a nonzero real number.
    • If n is NaN𝔽, return NaN𝔻.
    • If n is +∞𝔽, return +∞𝔻 if v > 0 or -∞𝔻 if v < 0.
    • If n is -∞𝔽, return «+0𝔻, -6176»𝔻 if v > 0 or «-0𝔻, -6176»𝔻 if v < 0.
    • Let w = v × 10ℝ(n).
    • Return PickQuantum(RoundDecimal128(w, "halfEven"), w, q + ℝ(n)).

Applications

With these three methods we can readily do all of the common operations on exponents and significands. The scale method lets us construct a Decimal128 value out of a significand and an exponent. For breaking Decimal128 values into parts we have a lot of useful choices:

let d = some_Decimal128_value;

// Get d's exponent
let e = d.exponent;

// Get d's significand as a Decimal.
let s = d.scale(-e);
//   If d is zero, s will be zero of the same sign.
//   If d is NaN or an infinity, s will be the same as d.
//   If d is nonzero finite, |s| will be in the range [1, 10).  The stored precision will carry over,
//   so, for example:
//     if d is 123, then s will be 1.23 and e will be 2
//     if d is 123.00, then s will be 1.2300 and e will be 2
//     if d is -1040e5, then s will -1.040 and e will be 8
//     At this point we can convert s and e to strings and get the parts of d in exponential notation

// Get d's exponent and significand in 34-digit BigInt format instead of Decimal128.
let eInt = e-33;
let sInt = BigInt(d.scale(-eInt));
// This only makes sense if d is neither NaN nor infinite.
// The BigInt constructor will throw if d is NaN or infinite.


// Similar to above, but make Decimal128 denorms gradually reduce the significand to zero,
// as denorms do in IEEE arithmetic.

// Get d's truncated exponent
let eDenorm = Math.Max(e, -6143);
let eIntDenorm = eDenorm-33;
let sIntDenorm = BigInt(d.scale(-eIntDenorm));
// This also only makes sense if d is neither NaN nor infinite.
// The BigInt constructor will throw if d is NaN or infinite.


// Get d's quantum as an exponent
let q = d.quantumExponent;

// Get d's unit of quantum.  d is an integral multiple of unitQ.
// See the examples below.
let unitQ = Decimal128(1).scale(q);

// Get d's significand as a multiple of the quantum.
// This will always be an integer with magnitude below 10^34 for finite d.
let sQ = d.scale(-q);

// For example:
//   if d is 123, then q will be 0, unitQ will be 1, and sQ will be 123
//   if d is 123.00, then q will be -2, unitQ will be 0.01, and sQ will be 12300
//   if d is -1040e5, then q will 5, unitQ will be 1e5, and sQ will be -1040
//   if d is 0.000, then q will be -3, unitQ will be 0.001, and sQ will be 0

jessealama added a commit that referenced this issue Jul 2, 2024
jessealama added a commit that referenced this issue Jul 5, 2024
…m rounding, as well as API additions (#159)

* Get started using the suggested `PickQuantum` AO

As suggested in #122

* Fix and simplify .add, .subtract and .multiply

There were a bunch of errors with handling of zeroes
and infinities. This commit fixes those, and reduces
the number of branches to make the algorithm shorter
and simpler to follow.

* Only define `cohort(x)` for finite numbers

While we also defined the cohort of +inf/-inf/NaN, those values were
never passed to `cohort()` and thus all the algorithms (correctly)
assume that the result of `cohort` is either a zero or a finite
mathematical value. While this is true, it is difficult to read them
because the type definition of  `cohort` says that it can also return
`+inf/-inf/NaN`. This updates the type definition to match usage,
rather than unnecessarily accepting more inputs.

There is one case in .add and friends where we are actually passing
+inf/-inf/NaN to cohort(), but that is just a bug since we then do not
handle the case in which it returns +inf/-inf/NaN.

* Define rounding and use it, along with pick-quantum, throughout

* Ensure .abs returns a non-negative value

* Fix `isFinite`

* Remove references to "rational number"

(Of course, an alternative would be to define the term.)

* Handle second argument NaN in .compare

* round: Do `RequireInternalSlot` before validating arguments

* Unconditionally throw in `valueOf`

Don't even ensure that we have the right internal slot.

* Set variable rather than return early

We need to ensure that `new Number(...)` returns a new
object, rather than a primitive.

* Ensure variable is defined

* Use the term "real number"

* Use `abs(...)` rather than `|...|`

* Use assertion rather than note

* Add <sub>𝔻</sub> after pairs

* Fix return type of ToIntlMathematicalValue

We don't return a decimal.

* Handle returning 0 from remainder

* Qualify argument of rounding (shouldn't be zero)

* Use proper IDs

* When rounding, enumerate 34 significant digits and consult the 35th

* Show where we overflow to infinity in rounding

* Handle denormalized values in rounding

* Add IDs for normalized/denormalized

* Handle underflow with increasingly coarse-grained approximations

* Handle the possibility that rounding returns 0

* Fix busted <sub>D</sub> and calls to `PickQuantum`

* In scale10, allow NaN and infinity as argument

Throw only if we are given a non-integer finite argument.

* Return a +/-0 from RoundToDecimal128Domain

* Fix quantum of division

* Define a separate AO for rounding positive numbers

* Embed utility rounding AO inside the other one

Also, fix a positive infinity sneaking out.

---------

Co-authored-by: Nicolò Ribaudo <[email protected]>
Co-authored-by: Nicolò Ribaudo <[email protected]>
@nicolo-ribaudo
Copy link
Member

nicolo-ribaudo commented Jul 5, 2024

Regarding the proposed notation for decimals, I suggest using something like ~positive-zero~ and ~negative-zero~ instead of +0𝔻 and -0𝔻. It is longer to type/read, but while reading the proposal spec I initially found it very confusing that 𝔻 usually denotes a decimal128 value, but sometimes it doesn't.

@waldemarhorwat
Copy link
Author

Earlier I mentioned the spec function RoundDecimal128. Here's how I recommend implementing it.

RoundDecimal128(v, roundingMode) that takes v ∈ ℝ and one of the rounding modes. It rounds v to one of the Decimal128 cohorts per the rounding mode and produces a result that is a member of the set FiniteDecimalCohorts ∪ {+∞𝔻, -∞𝔻} — note that there's no way for rounding to produce a NaN𝔻. Don't pass ±0𝔻 as the first parameter to this function; instead, handle those cases explicitly in the calling code because IEEE treats negative zeros specially. On the other hand, the mathematical value 0 is perfectly fine as the first parameter of this function and outputs the cohort +0𝔻.

  1. If v = 0, return +0𝔻.
  2. If v < 0:
    1. Let reverseRoundingMode be roundingMode but with floor replaced by ceil and vice versa.
    2. Let d = RoundDecimal128(–v, reverseRoundingMode).
    3. If d is +∞𝔻, return –∞𝔻.
    4. If d is +0𝔻, return –0𝔻.
    5. Return –d.
  3. Let e be the unique integer such that 10e ≤ v < 10e+1.
  4. Let te = e – 33.
  5. If te < –6176, let te = –6176.
  6. Let m = v × 10te.
  7. Let rounded = ApplyRoundingModeToPositive(mroundingMode).
  8. If rounded = 1034:
    1. Let te = te + 1.
    2. Let rounded = 1033.
  9. If te > 6111, return +∞𝔻.
  10. If rounded = 0, return +0𝔻.
  11. Return rounded × 10te.

The spec function ApplyRoundingModeToPositive(mroundingMode) takes a positive number m ∈ ℝ and one of the rounding modes and returns an integer:

  1. Let mLow = ⌊m⌋.
  2. Let fraction = m – mLow.
  3. If fraction = 0, return mLow.
  4. Let mHigh = mLow + 1.
  5. If roundingMode is floor or trunc, return mLow.
  6. If roundingMode is ceil, return mHigh.
  7. If fraction < 0.5, return mLow.
  8. If fraction > 0.5, return mHigh.
  9. If roundingMode is halfExpand, return mHigh.
  10. If mLow is an even integer, return mLow.
  11. Return mHigh.

@waldemarhorwat waldemarhorwat mentioned this issue Jul 6, 2024
15 tasks
@waldemarhorwat
Copy link
Author

This is how I'd recommend implementing the spec function PickQuantum mentioned earlier.

The spec function PickQuantum(d, v, qPreferred) takes d ∈ DecimalCohorts, v ∈ ℝ, and qPreferred ∈ ℤ and produces a Decimal128 value in cohort d. If d is an exact Decimal128 cohort representation of v, PickQuantum picks the quantum as close as possible to qPreferred; if d is not an exact Decimal128 cohort representation of v, PickQuantum picks the lowest possible quantum.

  1. If d is one of NaN𝔻, +∞𝔻, –∞𝔻, return d.
  2. Let inexact = false.
  3. If d is +0𝔻 or –0𝔻:
    1. Let qMax = 6111.
    2. Let qMin = –6176.
    3. If v ≠ 0, let inexact = true.
  4. Else d must be a real number in the set FiniteNonzeroDecimalCohorts:
    1. Let e be the unique integer such that 10e ≤ |v| < 10e+1.
    2. Let qMin = e – 33.
    3. If qMin < –6176, let qMin = –6176.
    4. Let m = v × 10qMin.
    5. Assert m is an integer and 0 < |m| < 1034 and qMin ≤ 6111. This is due to the precondition that d ∈ FiniteNonzeroDecimalCohorts.
    6. Let n be the largest integer for which m × 10n is an integer.
    7. Let qMax = qMin + n.
    8. If qMax > 6111, let qMax = 6111.
    9. If v ≠ d, let inexact = true.
  5. If inexact:
    1. Let q = qMin
  6. Else:
    1. Let q = qPreferred.
    2. If q < qMin, let q = qMin.
    3. If q > qMax, let q = qMax.
  7. Return «dq»𝔻.

PickQuantum is often (but not always) used together with RoundDecimal128, so it's also worthwhile to define a helper spec function that combines the two:

The spec function RoundAndPickQuantum(v, roundingMode, qPreferred) takes v ∈ ℝ, one of the rounding modes, and qPreferred ∈ ℤ and produces a Decimal128 value as follows:

  1. Return PickQuantum(RoundDecimal128(v, roundingMode), v, qPreferred).

@jessealama
Copy link
Collaborator

This has been implemented as suggested. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
3 participants