No JSON in header value #17

msramek · 2017-03-22T13:19:57Z

Please have a look! This simplifies the header syntax from a JSON to a list of strings. One thing I wasn't sure how to do was how to refer to the parsing algorithm. The way I understand it, the ABNF definition itself already represents a parsing algorithm, so I just added s there and linked them below.

mikewest

Some quick thoughts, thanks for putting this together!

mikewest · 2017-03-22T14:09:19Z

index.src.html

-                ; See Section 2 of [[HTTP-JFV]], and Section 2 of [[RFC7159]]
+    Clear-Site-Data = "Clear-Site-Data" ":" <a>clear-site-data-value</a>
+    <dfn>clear-site-data-value</dfn> = OWS 1#(<a>type</a> OWS)
+    <dfn>type</dfn> = atom


I'd suggest something like:

Clear-Site-Data = 1#( <a>data-type-value</a> / <a>extension-type-value</a> ) <dfn>data-type-value</dfn> = "<dfn>cache</dfn>" / "<dfn>cookies</dfn>" / "<dfn>storage</dfn>" / "<dfn>executionContext</dfn>" <dfn>extension-type-value</dfn> = 1*( <a>ALPHA</a> / "-" )

Updated, but with some changes:

-> From what I have read in other specs so far, the definition of a header always includes the key and separator (i.e. "Clear-Site-Data" ":"), not just the value
-> That's why I have the token "clear-site-data-value", which stands for the header value only
-> The "data-type" and "extension-type" tokens do not need the suffix "-value" (they represent the datatypes, not the header value as a whole, so the name is expressive enough)
-> I still think we need to specify OWS; I don't think spaces around tokens should be a syntax error
-> You suggested that alpha is wrapped in anchors; do you want it to link to RFC5234?

-> From what I have read in other specs so far, the definition of a header always includes the key and separator (i.e. "Clear-Site-Data" ":"), not just the value

I'm following along with things like https://tools.ietf.org/html/rfc7231#section-3.1.2.2 here. I'm sure I've done it differently elsewhere, but this is what I think IETF folks have solidified upon for header ABNF.

-> The "data-type" and "extension-type" tokens do not need the suffix "-value" (they represent the datatypes, not the header value as a whole, so the name is expressive enough)

Sure.

-> I still think we need to specify OWS; I don't think spaces around tokens should be a syntax error

This is part of the # rule in https://tools.ietf.org/html/rfc7230#section-7 which we should be linking to. @annevk suggested elsewhere that we link to https://fetch.spec.whatwg.org/#abnf to make that clear.

-> You suggested that alpha is wrapped in anchors; do you want it to link to RFC5234?

Yes, please. Just copy/paste from https://github.com/w3c/webappsec-csp/blob/master/index.src.html#L101. :)

Ah, alright. I must have read something older then. Updated.

I'm now linking to 7230 for #rule, since it's more detailed (e.g. contains the discussion about OWS). That is helpful at least for a reader like myself.

mikewest · 2017-03-22T14:11:36Z

index.src.html

-  MUST be an array, and that array MUST contain only strings; any other types
-  will result in a parse error.
+  Each atom in the array represents a data type that the user agent MUST
+  clear, and will be parsed as defined in [[#parsing]].


I'd replace these two sentences with something like [[#fetch-integration]] and [[#parsing]] describe how the Clear-Site-Data header is processed.

mikewest · 2017-03-22T14:14:56Z

index.src.html

-  The following are the initial set of known types which may be specified in
-  the member's array value. Future versions of this document may define
+  The following are the initial set of known types which may be specified as
+  the array's elements. Future versions of this document may define
  additional types, and user agents MUST ignore unknown types when parsing the
  header:


Maybe something like The <a grammar>data-type-value</a> grammar defines an initial set of known data types which can be cleared using this API. Future versions ..., and then following it up with the list.

mikewest · 2017-03-22T14:22:23Z

index.src.html


-  5.  Return |types|.
+  5.  Return |recognizedTypes|.


You should account for failure too (e.g., x x would be a value that leads to failure). Might also be clearer to call the return variable values rather than header.

@annevk: The suggestion accounts for failure by adding |type| to |data-types| only if it matches the list of known keywords in data-type-value. Do you think we should do something more than that?

Then you wouldn't have to account for null either. But more importantly, you can't for each failure.

I think we do have to account for null, because we can't for...each over null. But reading Fetch again, I see your point: step 2 and 4.2 of https://fetch.spec.whatwg.org/#extract-header-list-values can return failure. I guess we'd want to handle that in step 3 as well. Thanks!

Done. Agree with returning failure in level 3, this is what Chromium's implementation actually does (we print an error to the console, we don't just ignore it).

Clarifying question: Is it OK to omit the old step #1 which checked if the header exists? Because if we run the current version of the algorithm on any random response, it will return failure. My interpretation of that would be that the browser outputs a console error every time it doesn't see a Clear-Site-Data header. Or is it assumed from the context that the algorithm is only executed for responses that do have the header?

Done. Agree with returning failure in level 3, this is what Chromium's implementation actually does (we print an error to the console, we don't just ignore it).

I'd actually suggest changing this to something like "If |header| is `null` or failure, return an empty list.", and adding a note that user agents should inform developers of failure to parse the header. If you "return failure", then you need to handle "failure" in https://w3c.github.io/webappsec-clear-site-data/#clear-response instead. I think it makes more sense to handle it here by returning an empty list, which means the clearing operation will just do the right thing (nothing).

Clarifying question: Is it OK to omit the old step #1 which checked if the header exists?

Step 1 of https://fetch.spec.whatwg.org/#extract-header-list-values does this check, and returns null, in which case we return an empty list.

Makes sense. Thanks for the explanation! Done.

mikewest

Left some small comments, thanks!

mikewest · 2017-03-23T13:59:01Z

LGTM. I'll rebuild the file for you in a subsequent patch.

No JSON in header.

6b2f9b1

mikewest reviewed Mar 22, 2017

View reviewed changes

Updated the ABNF definition and parsing algorithm.

c751de3

mikewest reviewed Mar 23, 2017

View reviewed changes

Improved the ABNF definition, fixed the failure mode in parsing.

99507b6

mikewest merged commit a430ebe into w3c:master Mar 23, 2017

mikewest mentioned this pull request Mar 23, 2017

Sketch out prose for algorithm definitions. whatwg/infra#92

Closed

mikewest mentioned this pull request Jun 7, 2017

Ensure that more complicated filters are possible to add in the future. #27

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No JSON in header value #17

No JSON in header value #17

msramek commented Mar 22, 2017

mikewest left a comment

mikewest Mar 22, 2017

msramek Mar 23, 2017

mikewest Mar 23, 2017

msramek Mar 23, 2017

mikewest Mar 22, 2017

mikewest Mar 22, 2017

mikewest Mar 22, 2017

annevk Mar 22, 2017

mikewest Mar 23, 2017

annevk Mar 23, 2017

mikewest Mar 23, 2017

msramek Mar 23, 2017

mikewest Mar 23, 2017

mikewest Mar 23, 2017

msramek Mar 23, 2017

mikewest left a comment

mikewest commented Mar 23, 2017

No JSON in header value #17

No JSON in header value #17

Conversation

msramek commented Mar 22, 2017

mikewest left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mikewest left a comment

Choose a reason for hiding this comment

mikewest commented Mar 23, 2017