Improve cJSON number handling. #132

haydenroche5 · 2024-01-18T21:57:16Z

Get rid of CJSON_NO_CLIB and its associated code paths, per Ray's request.
Change the type of the J object's valueint member. It should be int32_t if NOTE_LOWMEM and int64_t otherwise. Capture this in a new #define for the type, JINTEGER.
When parsing a JSON number from a JSON string, if the number is within the bounds of the JINTEGER type, convert it to an integer using JAtoI rather than simply casting valuenumber to JINTEGER, which can lose precision (e.g. for a Unix timestamp). If the number is outside the bounds of a JINTEGER, saturate, which is the same behavior as before this commit.
When printing a JSON number, print it as an integer if possible. Otherwise, print it as floating point.
Fix an issue where JItoA would fail to convert the minimum long int value (i.e. LONG_MIN) to a string.

n_cjson.c

- Get rid of CJSON_NO_CLIB and its associated code paths, per Ray's request. - Change the type of the J object's valueint member. It should be int32_t if NOTE_LOWMEM and int64_t otherwise. Capture this in a new #define for the type, JINTEGER. - When parsing a JSON number from a JSON string, if the number is within the bounds of the JINTEGER type, convert it to an integer using JAtoI rather than simply casting valuenumber to JINTEGER, which can lose precision (e.g. for a Unix timestamp). If the number is outside the bounds of a JINTEGER, saturate, which is the same behavior as before this commit. - When printing a JSON number, print it as an integer if possible. Otherwise, print it as floating point. - Fix an issue where JItoA would fail to convert the minimum long int value (i.e. LONG_MIN) to a string. - Add a slew of new unit tests to make sure number handling behaves as expected.

haydenroche5 · 2024-01-21T23:00:15Z

Added unit tests.

n_cjson.c

zfields · 2024-01-23T00:35:45Z

n_atof.c

@@ -257,7 +257,7 @@ char **endPtr;              /* If non-NULL, store terminating character's
        case 5:
            p10 = 1.0e32;
            break;
-#ifndef NOTE_FLOAT
+#ifndef NOTE_LOWMEM


Can we change NOTE_LOWMEM to NOTE_C_LOWMEM?

I would like to yeah. In theory there could be users depending on this macro. Should we change the define to NOTE_C_LOWMEM, but also define NOTE_LOWMEM if NOTE_C_LOWMEM is defined for backwards compatibility?

The same goes for NOTE_FLOAT. I opted to eliminate it entirely, but there could be someone out there using it in their code.

You're right. I think we should change it to NOTE_C_LOWMEM, and also declare NOTE_LOWMEM. There should probably be a deprecation message before it is #define'd, in case it was declared by the build system.

n_cjson.c

zfields · 2024-01-23T03:21:40Z

n_cjson_helpers.c

+    // Conversion to unsigned is required to handle the case where n is
+    // LONG_MIN.


This comment is not obvious. Why is this the case?

If n is JINTEGER_MIN, when we negate it with the unary minus operator, it overflows and the result is undefined behavior (I'm pretty sure). If I keep the original code that was here and pass in LONG_MIN, this

n = -n;

just keeps n as LONG_MIN, at least on my system, with my compiler, etc. Then you end up doing n % 10 on a negative number and the whole algorithm breaks down. If we cast n to unsigned, then the rules for negation are different. See: https://stackoverflow.com/questions/8026694/c-unary-minus-operator-behavior-with-unsigned-operands

The negative of an unsigned quantity is computed by subtracting its value from 2^n, where n is the number of bits in the promoted operand.

So unsignedN = -unsignedN when JINTEGER is 32 bits and n is JINTEGER_MIN yields 2^32-2147483648, which is 2147483648. Then the algorithm works as intended.

I'll explain all this better in an improved code comment.

That explanation makes more sense when the data type is signed. I don't see how this works with an unsigned data type.

JINTEGER_MIN should be 0x80000000, right?

If you make that unsigned, then you have a valid positive number.
unsigned long int unsignedN = n;

Then if you make it negative with -. Now, you have a temporary 64-bit negative number.
0xFFFFFFFF80000000, which you then assign to a 32-bit unsigned number, so you are right back to a valid 32-bit unsigned number 0x80000000.

I understand that modulo will work better on unsigned values, but I don't quite get the point of setting unsignedN to -unsignedN.

zfields · 2024-01-23T03:23:14Z

n_cjson_helpers.c

+    unsigned long int unsignedN = n;
+    long int i, j;
+    if (n < 0) {
+        unsignedN = -unsignedN;


The data type of unisignedN is unsigned long int, so what does making it negative do?

note.h

haydenroche5 requested review from rayozzie and sdt99 January 18, 2024 21:57

haydenroche5 self-assigned this Jan 18, 2024

haydenroche5 commented Jan 18, 2024

View reviewed changes

n_cjson.c Show resolved Hide resolved

haydenroche5 force-pushed the json_number_rework branch 2 times, most recently from cb180ae to fa54b5d Compare January 21, 2024 22:59

haydenroche5 force-pushed the json_number_rework branch from fa54b5d to eecc84e Compare January 21, 2024 22:59

haydenroche5 requested a review from zfields January 21, 2024 23:00

sdt99 reviewed Jan 22, 2024

View reviewed changes

n_cjson.c Show resolved Hide resolved

rayozzie merged commit 86aab4a into blues:master Jan 22, 2024
11 checks passed

zfields reviewed Jan 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve cJSON number handling. #132

Improve cJSON number handling. #132

haydenroche5 commented Jan 18, 2024

haydenroche5 commented Jan 21, 2024

zfields Jan 23, 2024

haydenroche5 Jan 23, 2024

zfields Jan 24, 2024

zfields Jan 23, 2024

haydenroche5 Jan 23, 2024 •

edited

Loading

zfields Jan 24, 2024

zfields Jan 24, 2024

zfields Jan 24, 2024

zfields Jan 23, 2024

haydenroche5 Jan 23, 2024

		// Conversion to unsigned is required to handle the case where n is
		// LONG_MIN.

Improve cJSON number handling. #132

Improve cJSON number handling. #132

Conversation

haydenroche5 commented Jan 18, 2024

haydenroche5 commented Jan 21, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

haydenroche5 Jan 23, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

haydenroche5 Jan 23, 2024 •

edited

Loading