Aggregation job creator: copy report data directly from `client_reports` to `report_aggregations`, rather than reading full reports #2689

branlwyd · 2024-02-16T01:19:03Z

Currently, the aggregation job creator will read a large number of reports (5000, at time of writing) in order to create aggregation jobs. For VDAFs with large reports, this can require a significant amount of memory.

Instead, the aggregation job creator could read report IDs & other (small) metadata required to generate reports, and use a SQL query which causes Postgres to directly copy the data from the relevant client_reports row into the relevant report_aggregations row. This would decouple the memory usage of the aggregation job creator from the report size of the relevant VDAFs.

The text was updated successfully, but these errors were encountered:

divergentdave · 2024-02-16T17:03:52Z

The aggregation job creator and its SQL proxy are currently outliers in CPU consumption. Doing this copying within the database will improve performance, and give us more headroom before we have to start sharding the aggregation job creator.

divergentdave mentioned this issue Feb 16, 2024

Configurable LIMIT in unaggregated reports query #2690

Merged

divergentdave self-assigned this Feb 16, 2024

divergentdave mentioned this issue Feb 21, 2024

Copy report shares within database when creating aggregation jobs #2727

Merged

divergentdave closed this as completed in #2727 Feb 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aggregation job creator: copy report data directly from `client_reports` to `report_aggregations`, rather than reading full reports #2689

Aggregation job creator: copy report data directly from `client_reports` to `report_aggregations`, rather than reading full reports #2689

branlwyd commented Feb 16, 2024

divergentdave commented Feb 16, 2024

Aggregation job creator: copy report data directly from client_reports to report_aggregations, rather than reading full reports #2689

Aggregation job creator: copy report data directly from client_reports to report_aggregations, rather than reading full reports #2689

Comments

branlwyd commented Feb 16, 2024

divergentdave commented Feb 16, 2024

Aggregation job creator: copy report data directly from `client_reports` to `report_aggregations`, rather than reading full reports #2689

Aggregation job creator: copy report data directly from `client_reports` to `report_aggregations`, rather than reading full reports #2689