Floating point precision not preserved when converting floats #332

daniel-shuy · 2021-02-16T16:57:06Z

daniel-shuy
Feb 16, 2021

Indriya retains floating point precision when converting doubles, but not floats.
eg.

System.out.println(Quantities.getQuantity(3.524, Units.GRAM).to(Units.KILOGRAM).getValue());

prints 0.003524, but

System.out.println(Quantities.getQuantity(3.524f, Units.GRAM).to(Units.KILOGRAM).getValue());

prints 0.0035239999294281006.

andi-huber · 2021-02-16T17:17:13Z

andi-huber
Feb 16, 2021
Collaborator

Hi @daniel-shuy - why is that an issue?

0 replies

daniel-shuy · 2021-02-16T17:23:43Z

daniel-shuy
Feb 16, 2021
Author

That's because the precision is not preserved. It should return 0.003524, like how it behaves with doubles.

DefaultNumberSystem uses BigDecimal internally to preserve floating point precision when converting doubles. It attempts to do the same for float, by converting to double. Unfortunately Float#doubleValue() does not preserve floating point precision.

I've created a PR (#331) to fix this.

0 replies

keilw · 2021-02-16T17:31:04Z

keilw
Feb 16, 2021
Maintainer

@daniel-shuy according to https://jcp.org/en/participation/members/S you don't seem to be a JCP member, would you be willing to join because otherwise we cannot merge this PR?

0 replies

andi-huber · 2021-02-16T17:48:07Z

andi-huber
Feb 16, 2021
Collaborator

@daniel-shuy - by reading into your PR, I can see you are converting a float (32bit IEEE 754) into its rounded decimal representation (thats the toString conversion) and then parse it with a BigDecimal, which represents floating point numbers in decimal notation. With this step you are already loosing precision. I believe you are chasing a red herring here. I cannot see any flaws in our current NumberSystem implementation.

If you think there is a bug, please demonstrate the issue with a test case.

0 replies

andi-huber · 2021-02-16T17:54:31Z

andi-huber
Feb 16, 2021
Collaborator

I know that's a bit of a puzzler but eg. 0.1f or even 0.1d in Java does not represent the exact number 0.1, meaning (one tenth in decimal representation)!

0 replies

daniel-shuy · 2021-02-16T17:58:23Z

daniel-shuy
Feb 16, 2021
Author

@andi-huber well to be more exact, we actually don't want to gain precision (eg. we want 3.524 / 1000 to return 0.003524, not 0.0035239999294281006), let me correct my PR description.

You can test the behavior with this:

var number = Float.valueOf(3.524f);
System.out.println(BigDecimal.valueOf(number.doubleValue()));  // 3.5239999294281006
System.out.println(new BigDecimal(number.toString()));  // 3.524

But good point, I'll add some test cases.

0 replies

keilw · 2021-02-16T18:00:04Z

keilw
Feb 16, 2021
Maintainer

+1 it would be nice to back a changed behavior (but also the existing one) with more JUnit tests.
While the test coverage of the API project is around 80% (could still be better but even at corporate clients that is usually decent) Indriya only got 60% so far, thus it could be better wherever people are willing to help.

0 replies

andi-huber · 2021-02-16T18:09:06Z

andi-huber
Feb 16, 2021
Collaborator

My humble advice: please have a short excursion into floating point number representation according to IEEE 754, before writing any supposedly fixing code, which we are not going to merge.

0 replies

keilw · 2021-02-16T18:12:46Z

keilw
Feb 16, 2021
Maintainer

@andi-huber Also added you as reviewer for the PR, if there are functional problems please discuss it there, I think maybe it could be better to call it a Draft PR for now, WDTY?
Changed to draft so it's not accidentially merged. Thanks also @desruisseaux for the additional input. If either of you feel the PR was unnecessary, feel free to close (at least @desruisseaux should be able to close it, not sure about Associate members but for most part I guess every contributor has the same right to close issues or PRs)

0 replies

desruisseaux · 2021-02-16T18:13:42Z

desruisseaux
Feb 16, 2021
Collaborator

Converting an IEEE 754 number with toString(), then parsing as a BigDecimal is the strategy applied by the JDK in BigDecimal.valueOf(double) method. That method is currently implemented as below:

    public static BigDecimal valueOf(double val) {
        return new BigDecimal(Double.toString(val));
    }

So replacing BigDecimal.valueOf(number.doubleValue()) by new BigDecimal(number.toString()) would give the same results and take in account the various types of Number (long, float, double, etc.). Note in particular that current Indriya implementation losts precision for the long type too (conversion from long to double is not always lossless), while the toString() representation avoid that.

Currently it is not even necessary to do a (number instanceof Double) check. Unless BigDecimal.valueOf(double) implementation is changed in future, just using unconditionally the toString() approach will work as well.

0 replies

daniel-shuy · 2021-02-16T18:17:19Z

daniel-shuy
Feb 16, 2021
Author

So replacing BigDecimal.valueOf(number.doubleValue()) by new BigDecimal(number.toString()) would give the same results

@desruisseaux unfortunately, when you pass a Float to BigDecimal.valueOf(double), the Float is unboxed and cast to a double, which has the same problem as Float#doubleValue():

public double doubleValue() {
    return (double)value;
}

Try running this code and you'll see what I mean:

var number = Float.valueOf(3.524f);
System.out.println(BigDecimal.valueOf(number));  // 3.5239999294281006
System.out.println(new BigDecimal(number.toString()));  // 3.524

0 replies

desruisseaux · 2021-02-16T18:21:49Z

desruisseaux
Feb 16, 2021
Collaborator

@daniel-shuy yes I know, that sentence was specifically about the Double case (it was not clear in my comment). I was trying to said that if the code use the toString() representation of Double type, there is no difference with BigDecimal.valueOf(double) current implementation. Consequently there is no need for the if (number instanceof Double) special case.

0 replies

daniel-shuy · 2021-02-16T18:22:28Z

daniel-shuy
Feb 16, 2021
Author

@desruisseaux ah sorry I misunderstood, that's a good point, that will greatly simplify the code, thanks!

0 replies

desruisseaux · 2021-02-16T18:29:18Z

desruisseaux
Feb 16, 2021
Collaborator

No problem. As a side note (but not something needed for this particular case), below is a code for converting a float to a double as if using the toString() base 10 representation, but more efficient:

https://github.com/apache/sis/blob/1.0/core/sis-utility/src/main/java/org/apache/sis/math/DecimalFunctions.java#L142

0 replies

keilw · 2021-02-16T18:51:08Z

keilw
Feb 16, 2021
Maintainer

I made both @desruisseaux and @andi-huber reviewers for the PR, please check it out and comment or suggest changes (or close if you really think it wasn't appropriate) From a process point we should have proof, @daniel-shuy requested to become an Associate JCP Member or his name already showing in https://jcp.org/en/participation/members/S.

0 replies

daniel-shuy · 2021-02-16T18:53:29Z

daniel-shuy
Feb 16, 2021
Author

Thanks @keilw. I'll try to apply to become a JCP Member as soon as possible.

0 replies

daniel-shuy · 2021-02-16T19:35:00Z

daniel-shuy
Feb 16, 2021
Author

A better example:

System.out.println(Quantities.getQuantity(4.1f, Units.METRE).to(MetrixPrefix.CENTI(Units.METRE)).getValue())

prints 409.999990463256800 instead of 410

0 replies

andi-huber · 2021-02-16T19:36:12Z

andi-huber
Feb 16, 2021
Collaborator

The key question I guess then is, as we eventually have to convert a float (binary representation) to decimal representation which method is best. As it seems, we are discussing 2 methods:

'widen' the float to double, then for the double find the nearest decimal representation with an upper bound A of available decimal places (current implementation)
for the float find the nearest decimal representation with an upper bound B of available decimal places (suggested change)

If I understand correctly A is 19 decimal digits, whereas B is only 9 digits. However for the 'widening' of float to double, all binary places we are winning are set to null, which makes the difference between method (1) and (2) less dramatic. But still, its not clear to me why we would want to prefer method (2).

0 replies

desruisseaux · 2021-02-16T20:05:08Z

desruisseaux
Feb 16, 2021
Collaborator

When widening from float to double, the extra bits in IEEE 754 representation are set to 0. But this is an arbitrary choice; those bits could be set to anything, the reality is that we ignore their values. Consequently if Float.toString() formats about 9 digits and Double.toString() formats about 19 digits, then all the last 10 digits are just noise (for a number widened from float to double). I don't think that it matters if those 10 digits make us closer to the IEEE 754 representation with last bits arbitrarily set to zero; it still does not contain any real information.

Float.toString() and Double.toString() are designed for formatting the minimal number of digits necessary. So the advantage of parsing the string representation of those numbers is that BigDecimal gets only significant digits, without noise. The inconvenient of BigDecimal.valueOf(double) where the double is a widened value is that it creates a false sense of precision.

I agree that formatting a number as a string and parsing it back looks like an ugly hack. But actually a more mathematically elegant solution would be very complicated.

0 replies

daniel-shuy · 2021-02-16T20:12:15Z

daniel-shuy
Feb 16, 2021
Author

@andi-huber I may have misunderstood the original purpose BigDecimal was used for double arithmetic in DefaultNumberSystem. I assumed it was to preserve precision between conversions, but it seems like it was only intended to prevent precision loss, and the benefit of preventing precision gain between conversions was an unintended side-effect.

If that is the case, then it is indeed not a bug, but an enhancement. While the change is not strictly necessary - I can simply format the value output with DecimalFormat, I would suggest applying the change because it is currently a little unintuitive.

Because of the current behavior, I initially assumed that the precision is kept between unit conversions. Eg. when converting 1.56 m to cm, it would return 156 cm, and I would not have to format the value output. Imagine my surprise when my colleague told me that the unit conversions were returning "imprecise" values for him! After much trial and error only did we realize that it was only occuring for float, but not double, which really confused us.

If you all do decide not to merge the PR, I hope that at least the documentation can be updated to reflect that (maybe Quantity#to(Unit)?), to help others that may stumble on this issue in the future.

0 replies

andi-huber · 2021-02-16T20:13:11Z

andi-huber
Feb 16, 2021
Collaborator

Hi @desruisseaux - I agree, that setting the additional bits to 0 when doing the widening part is an arbitrary choice, unless the float number actually is an exact representation of the corresponding number! In which case the widening is not just noise, but the correct extension. Granted, we can discuss how relevant that is in practice, but still there is a case in my opinion to keep it that way.

0 replies

keilw · 2021-02-16T20:13:47Z

keilw
Feb 16, 2021
Maintainer

Both double and float are equally inprecise, according to folks including Brian Goetz: http://www.javapractices.com/topic/TopicAction.do?Id=213
What about performance, is there a huge penalty with the string operations and parsing?
And would it only be restricted to explicitly passing a Float or 4.1f primitive value?

0 replies

desruisseaux · 2021-02-16T20:18:51Z

desruisseaux
Feb 16, 2021
Collaborator

@andi-huber : yes you are right. But this is an information that only the user know.

@keilw : saying that both float and double are imprecise needs more context. They are precise in base 2, the imprecision come from our desire to get numbers formatted in base 10, which is a purely human cultural choice.

About performance, yes there is a penalty. But BigDecimal.valueOf(double) is already paying that penalty, so the change proposed by Daniel would have no impact on this aspect.

0 replies

keilw · 2021-02-16T20:26:21Z

keilw
Feb 16, 2021
Maintainer

If it's mainly for values passed as float, then it might be OK. For performance purists they may not like to use Quantity and rather use UnitConverter (which as of now also just supports double or Number)
The difference of double and float when it comes to memory consumption is not as relevant now as it may have been on Java ME Embedded.

0 replies

andi-huber · 2021-02-16T20:50:54Z

andi-huber
Feb 16, 2021
Collaborator

I believe this is not a discussion about memory footprint or performance. When a calculation needs widening of types to BigDecimal, we do this regardless, of the initial value (float or double) that was passed into the Quantity factory method. Purists of any kind will have to resort to other means anyway.

I'd rather say the discussion is about which of the 2 methods (from above) we want to implement. I do see the benefits of the suggested changes (method (2), eg. less confusion for the user; smaller decimal number representation, as intended by the user anyway), but I'm also concerned about breaking a few corner cases, that's all.

1 reply

keilw Feb 16, 2021
Maintainer

Speaking of discussion I turned this into a discussion rather tha an issue. This gives it a ranking button, so please maybe someone could prepare the options in a separate box and everyone who is interested should press the up/down arrows to rank the preferred one higher or lower.

andi-huber · 2021-02-17T04:14:06Z

andi-huber
Feb 17, 2021
Collaborator

Here's one corner case, which is in favor of method (1) in our discussion:

Lets multiply 2 floating point numbers a and b where the result is exactly 1. floatCalc demonstrates in short what we try to calculate. bigDecCalc1 and bigDecCalc2 compare the 2 different approaches.

import java.math.BigDecimal;

public class FloatIssue {

    private static float a = 1024f*1024f;
    private static float b = 1f/a; 

    static void floatCalc() {
        
        float x = a * b;
        
        System.out.println("floatCalc: " + x);
    }
    
    static void bigDecCalc1() {
        
        BigDecimal x1 = new BigDecimal(Double.toString(a));
        BigDecimal x2 = new BigDecimal(Double.toString(b));
        
        BigDecimal x = x1.multiply(x2);
        
        System.out.printf("float to ~18 digit %s (error: %s)%n", x, error(x));
    }
    
    static void bigDecCalc2() {
        
        BigDecimal x1 = new BigDecimal(Float.toString(a));
        BigDecimal x2 = new BigDecimal(Float.toString(b));
        
        BigDecimal x = x1.multiply(x2);
        
        System.out.printf("float to ~9 digit %s (error:  %s)%n", x, error(x));
    }
    
    private static BigDecimal error(BigDecimal x) {
        return BigDecimal.ONE.subtract(x).abs();
    }
    
    public static void main(String[] args) {
        floatCalc();   // 1.0
        bigDecCalc1(); // float to ~18 digit 1.000000000000000000000 (error: 0E-21)
        bigDecCalc2(); // float to ~9 digit 0.99999998279680 (error:  1.720320E-8)
    }
    
}

3 replies

daniel-shuy Feb 17, 2021
Author

hmm, good point, maybe it only makes sense to keep the precision for operations involving a double/float and a RationalNumber

daniel-shuy Feb 17, 2021
Author

Turns out, you've already handled this scenario in DefaultNumberSystem#multiplyWideAndNarrow(Number, NumberType, Number, NumberType). For multiplication of double/float you're not converting to BigDecimal, and instead multiplying them directly, unlike addition/compare.

andi-huber Feb 17, 2021
Collaborator

Yes, correct. That's because multiplication can be done without widening, while keeping the precision goal, so to speak. Addition and compare in the general case can not keep the precision goal. (compare uses addition internally)

andi-huber · 2021-02-17T05:04:30Z

andi-huber
Feb 17, 2021
Collaborator

Option A (for vote)

Use Double.toString(a) for a float a when converting to BigDecimal.

pros: does the conversion to decimal representation using ~18 decimal places, which covers corner cases well, when the float in binary representation is an exact (or close) representation of the real number it is meant to represent/approximate

cons: Java float literals like 0.1f are in decimal notation and often intended to translate to new BigDecimal("0.1"), but it does not, which might puzzle users of the library

0 replies

andi-huber · 2021-02-17T05:04:41Z

andi-huber
Feb 17, 2021
Collaborator

Option B (for vote)

Use Float.toString(a) for a float a when converting to BigDecimal.

pros: Java float literals like 0.1f are in decimal notation and often intended to translate to new BigDecimal("0.1")

cons: not every float originates from a float literal; this method does the conversion to decimal representation using only ~9 decimal places, which covers corner cases badly, when the float in binary representation is an exact (or close) representation of the real number it is meant to represent/approximate

0 replies

Floating point precision not preserved when converting floats #332

daniel-shuy Feb 16, 2021

Replies: 28 comments · 4 replies

andi-huber Feb 16, 2021 Collaborator

daniel-shuy Feb 16, 2021 Author

keilw Feb 16, 2021 Maintainer

andi-huber Feb 16, 2021 Collaborator

andi-huber Feb 16, 2021 Collaborator

daniel-shuy Feb 16, 2021 Author

keilw Feb 16, 2021 Maintainer

andi-huber Feb 16, 2021 Collaborator

keilw Feb 16, 2021 Maintainer

desruisseaux Feb 16, 2021 Collaborator

daniel-shuy Feb 16, 2021 Author

desruisseaux Feb 16, 2021 Collaborator

daniel-shuy Feb 16, 2021 Author

desruisseaux Feb 16, 2021 Collaborator

keilw Feb 16, 2021 Maintainer

daniel-shuy Feb 16, 2021 Author

daniel-shuy Feb 16, 2021 Author

andi-huber Feb 16, 2021 Collaborator

desruisseaux Feb 16, 2021 Collaborator

daniel-shuy Feb 16, 2021 Author

andi-huber Feb 16, 2021 Collaborator

keilw Feb 16, 2021 Maintainer

desruisseaux Feb 16, 2021 Collaborator

keilw Feb 16, 2021 Maintainer

andi-huber Feb 16, 2021 Collaborator

keilw Feb 16, 2021 Maintainer

andi-huber Feb 17, 2021 Collaborator

daniel-shuy Feb 17, 2021 Author

daniel-shuy Feb 17, 2021 Author

andi-huber Feb 17, 2021 Collaborator

andi-huber Feb 17, 2021 Collaborator

Option A (for vote)

andi-huber Feb 17, 2021 Collaborator

Option B (for vote)

daniel-shuy
Feb 16, 2021

Replies: 28 comments 4 replies

andi-huber
Feb 16, 2021
Collaborator

daniel-shuy
Feb 16, 2021
Author

keilw
Feb 16, 2021
Maintainer

andi-huber
Feb 16, 2021
Collaborator

andi-huber
Feb 16, 2021
Collaborator

daniel-shuy
Feb 16, 2021
Author

keilw
Feb 16, 2021
Maintainer

andi-huber
Feb 16, 2021
Collaborator

keilw
Feb 16, 2021
Maintainer

desruisseaux
Feb 16, 2021
Collaborator

daniel-shuy
Feb 16, 2021
Author

desruisseaux
Feb 16, 2021
Collaborator

daniel-shuy
Feb 16, 2021
Author

desruisseaux
Feb 16, 2021
Collaborator

keilw
Feb 16, 2021
Maintainer

daniel-shuy
Feb 16, 2021
Author

daniel-shuy
Feb 16, 2021
Author

andi-huber
Feb 16, 2021
Collaborator

desruisseaux
Feb 16, 2021
Collaborator

daniel-shuy
Feb 16, 2021
Author

andi-huber
Feb 16, 2021
Collaborator

keilw
Feb 16, 2021
Maintainer

desruisseaux
Feb 16, 2021
Collaborator

keilw
Feb 16, 2021
Maintainer

andi-huber
Feb 16, 2021
Collaborator

keilw Feb 16, 2021
Maintainer

andi-huber
Feb 17, 2021
Collaborator

daniel-shuy Feb 17, 2021
Author

daniel-shuy Feb 17, 2021
Author

andi-huber Feb 17, 2021
Collaborator

andi-huber
Feb 17, 2021
Collaborator

andi-huber
Feb 17, 2021
Collaborator