Align collection job with new aggregation job semantics #596

inahga · 2024-09-27T17:22:25Z

Closes #588.

This does the following:

Communicate job status through field in the body, while always returning 200 OK for polls.
Polling endpoint is communicated through Location header.
Shuffling around of paragraphs for readability.
Align structure names and media types to look like aggregation job ones.

This is a fair bit more invasive than I would have hoped, LMK if anything can be stripped out.

inahga · 2024-09-27T17:25:44Z

It raises the question of whether we should support both sync/async like with aggregation. (n.b. I preserved async-only in this PR).

The only technical rationale I have for it is that since most implementations are doing VDAFs without aggregation parameters, and thus can eagerly aggregate, collection jobs can be fulfilled remarkably quickly. We find that a lot of delay in processing collection jobs happens to just be with polling. If the collection job could be fulfilled instantly, that would be nice.

I don't feel strongly about this though, nor do I think collection jobs are a particularly expensive part of the system, so I'm content to leave it be.

branlwyd · 2024-09-27T18:09:15Z

draft-ietf-ppm-dap.md

-The Leader then begins working with the Helper to aggregate the reports
-satisfying the query (or continues this process, depending on the VDAF) as
-described in {{aggregate-flow}}.
-
 Changing a collection job's parameters is illegal, so further requests to
 `PUT /tasks/{tasks}/collection_jobs/{collection-job-id}` for the same


Suggested change

`PUT /tasks/{tasks}/collection_jobs/{collection-job-id}` for the same

`PUT /tasks/{task-id}/collection_jobs/{collection-job-id}` for the same

(not caused by this PR -- aligns with other uses of parameterized URL path fragments)

branlwyd · 2024-09-27T18:12:11Z

draft-ietf-ppm-dap.md

+with HTTP status 201 Created with a body containing a `CollectionJobResp`. The
+`status` field is set to `processing`, and the response contains a Location
+header containing the relative reference
+`/tasks/{task-id}/collection_jobs/{collection-job-id}`. The Leader SHOULD


Q. What's the justification for including a Location field, here or in the aggregation interaction? In both cases, the URI included in the Location header field is entirely determined by the specification, and the HTTP client has enough information to construct the URI themselves.

Let's make sure we have parity with aggregation jobs, but in both cases I agree we probably don't need a Location header.

The rationale is (unfortunately) subtle.

https://www.rfc-editor.org/rfc/rfc9110#name-201-created

The 201 (Created) status code indicates that the request has been fulfilled and has resulted in one or more new resources being created. The primary resource created by the request is identified by either a Location header field in the response or, if no Location header field is received, by the target URI.

The 201 response content typically describes and links to the resource(s) created. Any validator fields (Section 8.8) sent in the response convey the current validators for a new representation created by the request. Note that the PUT method (Section 9.3.4) has additional requirements that might preclude sending such validators.

Consider aggregation: we PUT at {helper}/tasks/{task-id}/aggregation_jobs/{aggregation-job-id}, BUT we expect the Leader to poll the job at {helper}/tasks/{task-id}/aggregation_jobs/{aggregation-job-id}?step=0. That is, the resource we're polling is at a different URI than the one that was provided in the PUT.

However, this is not true for collection. So I think we can (and should) eschew the Location header for collection. I will edit.

branlwyd · 2024-09-27T18:18:43Z

draft-ietf-ppm-dap.md

+After receiving the response to its `CollectionJobReq`, the Collector makes an
+HTTP GET request to the aforementioned Location to check on the status of the


Suggested change

After receiving the response to its `CollectionJobReq`, the Collector makes an

HTTP GET request to the aforementioned Location to check on the status of the

After receiving the response to its `CollectionJobReq`, the Collector periodically makes

HTTP GET requests to the aforementioned Location to check on the status of the

I think we should highlight that this is a repeated polling operation -- as currently written, the start of this sentence reads as if only a single HTTP GET is sent.

Also, the response might have had status "finished", in which case there's no need to poll?

Also, the response might have had status "finished", in which case there's no need to poll?

I added a blurb to the paragraph Once both aggregate shares are successfully obtained. Let me know if this reads clearer.

branlwyd · 2024-09-27T20:10:21Z

It raises the question of whether we should support both sync/async like with aggregation. (n.b. I preserved async-only in this PR).

The only technical rationale I have for it is that since most implementations are doing VDAFs without aggregation parameters, and thus can eagerly aggregate, collection jobs can be fulfilled remarkably quickly. We find that a lot of delay in processing collection jobs happens to just be with polling. If the collection job could be fulfilled instantly, that would be nice.

I don't feel strongly about this though, nor do I think collection jobs are a particularly expensive part of the system, so I'm content to leave it be.

IMO, we shouldn't consider specifying sync collection unless someone wants to implement it. It would be easy enough to specify, but would lead to further implementation complexity, similarly to the complexity induced by supporting both sync & async aggregation.

cjpatton

IMO, we shouldn't consider specifying sync collection unless someone wants to implement it. It would be easy enough to specify, but would lead to further implementation complexity, similarly to the complexity induced by supporting both sync & async aggregation.

When we discussed this yesterday (2024/10/2), I was in agreement here, but after reading this PR, I actually think the symmetry with the aggregation job section does a lot to improve readability. I don't have a strong opinion either way, but if we think this won't add too much complexity to the specification, then I think we should keep it.

draft-ietf-ppm-dap.md

cjpatton · 2024-10-03T21:32:50Z

draft-ietf-ppm-dap.md

+with HTTP status 201 Created with a body containing a `CollectionJobResp`. The
+`status` field is set to `processing`, and the response contains a Location
+header containing the relative reference
+`/tasks/{task-id}/collection_jobs/{collection-job-id}`. The Leader SHOULD


Let's make sure we have parity with aggregation jobs, but in both cases I agree we probably don't need a Location header.

draft-ietf-ppm-dap.md

cjpatton · 2024-10-03T22:00:23Z

draft-ietf-ppm-dap.md

+After receiving the response to its `CollectionJobReq`, the Collector makes an
+HTTP GET request to the aforementioned Location to check on the status of the


Also, the response might have had status "finished", in which case there's no need to poll?

draft-ietf-ppm-dap.md

branlwyd · 2024-10-04T21:48:01Z

When we discussed this yesterday (2024/10/2), I was in agreement here, but after reading this PR, I actually think the symmetry with the aggregation job section does a lot to improve readability. I don't have a strong opinion either way, but if we think this won't add too much complexity to the specification, then I think we should keep it.

I've been thinking about this some more.

I still think async-only is the right choice for collection, as (a) no one has expressed a desire to implement sync collection & (b) processing a collection job may require aggregating an arbitrary number of reports, with the number of reports to aggregate at least partially out of control of the aggregators -- so sync collection is unlikely to work well in many cases.

If we want to resolve the difference between the aggregation interaction with the collection interaction, I think we should try to attain enough implementation experience with async aggregation to show that it is at least as good as sync aggregation for all use cases, then remove sync aggregation. This would simplify the protocol as well as resolve the disparity between aggregation and collection.

That said, I don't feel strongly enough about this to insist on it. If this argument is not convincing, we can specify both -- there is nothing stopping us from dropping sync aggregation/collection later, if implementation experience shows this is possible.

draft-ietf-ppm-dap.md

This does the following: - Communicate job status through field in the body, while always returning 200 OK for polls. - Polling endpoint is communicated through Location header. - Shuffling around of paragraphs for readability. - Align structure names and media types to look like aggregation job ones.

inahga requested review from tgeoghegan, branlwyd, ekr and chris-wood as code owners September 27, 2024 17:22

branlwyd reviewed Sep 27, 2024

View reviewed changes

cjpatton added wire breaking draft-12 labels Sep 27, 2024

cjpatton reviewed Oct 3, 2024

View reviewed changes

inahga force-pushed the inahga/align-collection branch from dc5b067 to d2590ed Compare October 4, 2024 15:09

inahga requested review from cjpatton and branlwyd October 8, 2024 19:21

inahga mentioned this pull request Oct 8, 2024

Editorial: link to encrypting input shares #605

Merged

cjpatton approved these changes Oct 8, 2024

View reviewed changes

draft-ietf-ppm-dap.md Show resolved Hide resolved

draft-ietf-ppm-dap.md Outdated Show resolved Hide resolved

draft-ietf-ppm-dap.md Outdated Show resolved Hide resolved

draft-ietf-ppm-dap.md Outdated Show resolved Hide resolved

draft-ietf-ppm-dap.md Outdated Show resolved Hide resolved

inahga force-pushed the inahga/align-collection branch 2 times, most recently from 514e6be to 363df15 Compare October 9, 2024 15:14

branlwyd approved these changes Oct 9, 2024

View reviewed changes

inahga force-pushed the inahga/align-collection branch from d2619c5 to 69e63b9 Compare October 9, 2024 16:24

branlwyd merged commit d41cbde into ietf-wg-ppm:main Oct 9, 2024
1 check passed

branlwyd mentioned this pull request Oct 11, 2024

Implement DAP-13 divviup/janus#3436

Open

20 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align collection job with new aggregation job semantics #596

Align collection job with new aggregation job semantics #596

inahga commented Sep 27, 2024

inahga commented Sep 27, 2024

branlwyd Sep 27, 2024

branlwyd Sep 27, 2024

cjpatton Oct 3, 2024

inahga Oct 8, 2024

branlwyd Sep 27, 2024

cjpatton Oct 3, 2024

inahga Oct 8, 2024

branlwyd commented Sep 27, 2024

cjpatton left a comment

cjpatton Oct 3, 2024

cjpatton Oct 3, 2024

branlwyd commented Oct 4, 2024

	`PUT /tasks/{tasks}/collection_jobs/{collection-job-id}` for the same
	`PUT /tasks/{task-id}/collection_jobs/{collection-job-id}` for the same

		After receiving the response to its `CollectionJobReq`, the Collector makes an
		HTTP GET request to the aforementioned Location to check on the status of the

Align collection job with new aggregation job semantics #596

Align collection job with new aggregation job semantics #596

Conversation

inahga commented Sep 27, 2024

inahga commented Sep 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

branlwyd commented Sep 27, 2024

cjpatton left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

branlwyd commented Oct 4, 2024