Accepts strings with control characters #348

jwhear · 2014-04-24T17:29:41Z

jq (1.3) parses, without complaint, strings which contain control character U+0083, whereas the JSON spec excludes control characters from strings.

pkoppstein · 2014-05-21T04:33:21Z

Nothing I've read about jq suggests that it should reject invalid JSON. In fact, it would be great if it had the ability (perhaps governed by a switch) to transform imperfect JSON into JSON.

Please also note that the most recent "Proposed Standard" for JSON (http://tools.ietf.org/html/rfc7159) explicitly says:

A JSON parser MAY accept non-JSON forms or extensions.

nicowilliams · 2014-06-09T04:08:20Z

Indeed. I suppose a strict mode would be nice.

nicowilliams · 2014-06-11T20:33:52Z

Actually, this is dangerous, therefore I'm re-opening this.

We're considering defining a "JSON text sequence" MIME type that corresponds roughly to what jq does. Allowing unescaped newlines in strings is destructive to the ability to recover from stream corruption (discard corrupted entries), which can result when they are written in O_APPEND style (think power failures). We're also considering the use of ASCII RS as a text separator for similar purposes. Allowing these text separators (newline or RS) to appear unescaped in strings breaks the recovery algorithm.

DO NOT rely on jq's willingness to accept unescaped control characters in strings.

nicowilliams · 2014-12-24T05:55:56Z

@jwhear RFC7159 says:

   The representation of strings is similar to conventions used in the C
   family of programming languages.  A string begins and ends with
   quotation marks.  All Unicode characters may be placed within the
   quotation marks, except for the characters that must be escaped:
   quotation mark, reverse solidus, and the control characters (U+0000
   through U+001F).

U+0083 is not included in the must-be-escaped list.

nicowilliams · 2014-12-24T06:38:13Z

Oh, this breaks a test in a most non-obvious way. I'll look again after sleeping.

nicowilliams closed this as completed Jun 9, 2014

pkoppstein mentioned this issue Jun 11, 2014

Comments in Json #402

Closed

nicowilliams reopened this Jun 11, 2014

nicowilliams added the interop label Jun 11, 2014

nicowilliams added this to the 1.5 release milestone Jun 11, 2014

nicowilliams closed this as completed in 8ca07a0 Dec 24, 2014

wtlangford mentioned this issue Jun 8, 2016

Trailing nulls stripped from raw input #1128

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accepts strings with control characters #348

Accepts strings with control characters #348

jwhear commented Apr 24, 2014

pkoppstein commented May 21, 2014

nicowilliams commented Jun 9, 2014

nicowilliams commented Jun 11, 2014

nicowilliams commented Dec 24, 2014

nicowilliams commented Dec 24, 2014