Db partition analytics cmeyers2 #10023

chrismeyersfsu · 2021-04-26T18:51:36Z

Keep old primary key based analytics gathering for unpartitioned
tables.
Use created time on new partitioned tables.

80 million partitioned + 1.5 million unpartitioned Events

Query	awx-manage gather_analytics --dry-run Time	Micro Benchmark Query Time*	Query Only Time**
sequential index scan, multiple `::json` casts, 100,000 event batches	102m7.836s	6s	80 minutes
sequential index scan, optimized json cast, 100,000 event batches	48m9.276s	2.2s	30.4 minutes
sequential index scan, optimized json cast, 1,00,000 event batches	39m35.094s	10s	13.3 minutes
sequential table scan, optimized json cast, per-partition batch 600,000 ***	36m42.081s	11.5s	25.5 minutes

*micro benchmarking consists of simply copying a query, running it manually, and observing the runtime.
**micro benchmark time x (80 million / batch size)
**Note that this testing does NOT include the extra modified range query that is needed for correctness. We expect this to be quite fast and is only needed to catch edge case events.

softwarefactory-project-zuul · 2021-04-26T19:04:39Z

Build failed.

awx-api-lint : SUCCESS in 3m 31s
awx-api : FAILURE in 7m 53s
awx-ui : SUCCESS in 12m 42s
awx-swagger : SUCCESS in 7m 04s
awx-detect-schema-change : FAILURE in 7m 07s (non-voting)
awx-ansible-modules : SUCCESS in 6m 10s

softwarefactory-project-zuul · 2021-05-06T19:30:09Z

Build failed.

awx-api-lint : FAILURE in 3m 07s
awx-api : FAILURE in 5m 34s
awx-ui : SUCCESS in 13m 59s
awx-swagger : FAILURE in 6m 52s
awx-detect-schema-change : FAILURE in 8m 02s (non-voting)
awx-ansible-modules : SUCCESS in 4m 19s

softwarefactory-project-zuul · 2021-05-12T13:01:08Z

Build failed.

awx-api-lint : FAILURE in 8m 34s
awx-api : FAILURE in 13m 19s
awx-ui : SUCCESS in 18m 32s
awx-swagger : FAILURE in 13m 17s
awx-detect-schema-change : FAILURE in 13m 57s (non-voting)
awx-ansible-modules : SUCCESS in 10m 04s

softwarefactory-project-zuul · 2021-05-12T18:51:36Z

Build failed.

awx-api-lint : SUCCESS in 2m 15s
awx-api : FAILURE in 5m 52s
awx-ui : SUCCESS in 12m 30s
awx-swagger : FAILURE in 7m 06s
awx-detect-schema-change : FAILURE in 6m 54s (non-voting)
awx-ansible-modules : SUCCESS in 4m 19s

softwarefactory-project-zuul · 2021-05-12T19:32:19Z

Build failed.

awx-api-lint : SUCCESS in 3m 15s
awx-api : FAILURE in 6m 42s
awx-ui : SUCCESS in 16m 59s
awx-swagger : FAILURE in 7m 15s
awx-detect-schema-change : FAILURE in 6m 55s (non-voting)
awx-ansible-modules : SUCCESS in 4m 54s

softwarefactory-project-zuul · 2021-05-12T22:14:56Z

Build failed.

awx-api-lint : SUCCESS in 2m 42s
awx-api : FAILURE in 5m 49s
awx-ui : SUCCESS in 12m 20s
awx-swagger : SUCCESS in 6m 27s
awx-detect-schema-change : FAILURE in 6m 45s (non-voting)
awx-ansible-modules : SUCCESS in 4m 01s

softwarefactory-project-zuul · 2021-05-13T22:03:46Z

Build failed.

awx-api-lint : FAILURE in 2m 10s
awx-api : FAILURE in 5m 37s
awx-ui : SUCCESS in 11m 17s
awx-swagger : SUCCESS in 6m 40s
awx-detect-schema-change : FAILURE in 6m 55s (non-voting)
awx-ansible-modules : SUCCESS in 4m 03s

softwarefactory-project-zuul · 2021-05-13T22:56:31Z

Build succeeded.

awx-api-lint : SUCCESS in 4m 07s
awx-api : SUCCESS in 5m 37s
awx-ui : SUCCESS in 11m 07s
awx-swagger : SUCCESS in 7m 21s
awx-detect-schema-change : FAILURE in 6m 47s (non-voting)
awx-ansible-modules : SUCCESS in 4m 17s

softwarefactory-project-zuul · 2021-05-20T17:40:34Z

Merge Failed.

This change or one of its cross-repo dependencies was unable to be automatically merged with the current state of its repository. Please rebase the change and upload a new patchset.

softwarefactory-project-zuul · 2021-05-20T18:10:06Z

Build succeeded.

awx-api-lint : SUCCESS in 2m 30s
awx-api : SUCCESS in 6m 02s
awx-ui : SUCCESS in 11m 32s
awx-swagger : SUCCESS in 6m 46s
awx-detect-schema-change : FAILURE in 8m 02s (non-voting)
awx-ansible-modules : SUCCESS in 4m 30s

softwarefactory-project-zuul · 2021-05-25T04:31:49Z

Build succeeded.

awx-api-lint : SUCCESS in 3m 11s
awx-api : SUCCESS in 6m 57s
awx-ui : SUCCESS in 10m 57s
awx-swagger : SUCCESS in 6m 37s
awx-detect-schema-change : FAILURE in 6m 34s (non-voting)
awx-ansible-modules : SUCCESS in 4m 18s

softwarefactory-project-zuul · 2021-05-25T19:46:13Z

Build failed.

awx-api-lint : SUCCESS in 4m 23s
awx-api : FAILURE in 7m 20s
awx-ui : SUCCESS in 12m 43s
awx-swagger : FAILURE in 10m 36s
awx-detect-schema-change : FAILURE in 10m 38s (non-voting)
awx-ansible-modules : SUCCESS in 4m 32s

softwarefactory-project-zuul · 2021-05-25T20:47:23Z

Build failed.

awx-api-lint : SUCCESS in 2m 07s
awx-api : FAILURE in 5m 33s
awx-ui : SUCCESS in 20m 13s
awx-swagger : FAILURE in 6m 51s
awx-detect-schema-change : FAILURE in 6m 47s (non-voting)
awx-ansible-modules : SUCCESS in 4m 16s

softwarefactory-project-zuul · 2021-05-25T21:26:44Z

Build failed.

awx-api-lint : SUCCESS in 2m 30s
awx-api : FAILURE in 6m 56s
awx-ui : SUCCESS in 11m 25s
awx-swagger : FAILURE in 6m 33s
awx-detect-schema-change : FAILURE in 6m 54s (non-voting)
awx-ansible-modules : SUCCESS in 5m 51s

softwarefactory-project-zuul · 2021-05-25T22:01:39Z

Build failed.

awx-api-lint : SUCCESS in 2m 24s
awx-api : FAILURE in 5m 29s
awx-ui : SUCCESS in 11m 35s
awx-swagger : FAILURE in 7m 05s
awx-detect-schema-change : FAILURE in 6m 43s (non-voting)
awx-ansible-modules : SUCCESS in 4m 09s

softwarefactory-project-zuul · 2021-05-26T15:08:26Z

Build succeeded.

awx-api-lint : SUCCESS in 2m 36s
awx-api : SUCCESS in 5m 34s
awx-ui : SUCCESS in 11m 52s
awx-swagger : SUCCESS in 6m 51s
awx-detect-schema-change : FAILURE in 6m 40s (non-voting)
awx-ansible-modules : SUCCESS in 4m 34s

* Old, _unpartitioned_main_jobevent table does not have the job_created column * New, main_jobevent does. * Always in clude the job_created column. NULL if old, job_created if new * Bump events_table schema version from 1.2 to 1.3 because of the job_created field

* The order by results in an in-memory sort that COULD blow out the worker mem buffer and result in sorting having to take place on disk. * This WILL happen with a default postgres 4MB mem buffer. We saw as much as 20MB used. Note that AWX defaults postgres mem worker buffer to 3% of the DB memory on external installs and 1% on same-node installs. So for a 16GB remote DB this would not be a problem. * We are going to avoid this problem all together by NOT doing a sort when gathering. Instead, we will sort remotely, in analytics.

* Before, we would get the min and max pk of the set we are to gather. This changeset removes that. * Before, we would, basically, know the size of the set we are to gather and would query 100,000 of those job event records at a time. That logic is now gone. * Now, for unpartitioned job events we gather 4 hours at a time by created time. * Now, for partitioned job events we gather 4 hours at a time by modified time.

* trigger via jobs/<id>/job_events/?limit=10 * Can and should be used in conjunction with an indexed set of fields to generate efficient pagination queries. i.e. jobs/<id>/job_events?limit=10&start_line__gte=10 * If limit is not specified in the query params then the default pagination will be used.

* Do not cascade delete unified job events. We will clean those up in cleanup_job runs * Add limit pagination to all unified job events endpoints

* Use an initial request for max event `counter` to get the total row count, otherwise rely on websocket message counters to update remote row count * For running jobs, request event ranges with counters to handle events getting saved to db out of display order * For jobs that are no longer running, continue to use page/pageSize scheme for paging through the job events

* job_created is a fake field as far as Django is concerned. Under the hood, in postgres, this is the partition key so it is real. sqlite doesn't support partitioning so we need to fake some things. Specifically, we need to remove job_created from being auto-added to get_event_queryset() * Add pagination tests for <unified_job_name>/<id>/<job_events>?limit=x endpoint to make sure the paginator is wired up.

jladdjr · 2021-06-04T16:23:48Z

thx @kdelee!

softwarefactory-project-zuul · 2021-06-04T16:29:15Z

Build succeeded.

awx-api-lint : SUCCESS in 2m 12s
awx-api : SUCCESS in 5m 41s
awx-ui : SUCCESS in 11m 40s
awx-swagger : SUCCESS in 6m 39s
awx-detect-schema-change : FAILURE in 7m 01s (non-voting)
awx-ansible-modules : SUCCESS in 4m 09s

softwarefactory-project-zuul · 2021-06-04T16:48:10Z

Build succeeded (gate pipeline).

awx-api-lint : SUCCESS in 2m 40s
awx-api : SUCCESS in 5m 42s
awx-ui : SUCCESS in 11m 20s
awx-swagger : SUCCESS in 7m 39s
awx-detect-schema-change : FAILURE in 6m 28s (non-voting)
awx-ansible-modules : SUCCESS in 4m 14s
awx-push-new-schema : SUCCESS in 6m 25s (non-voting)

Add OPTIONS documentation for new job limit feature Looking at the docs and stuff from #10023 I'm sure this is somewhere else too, but this is the place that users should naturally expect it to be. Reviewed-by: Chris Meyers <None>

This was referenced Apr 26, 2021

analytics support for db partitions #9962

Closed

partition job_events tables #9839

Closed

chrismeyersfsu force-pushed the db_partition_analytics_cmeyers2 branch 2 times, most recently from 4c3bf9c to a6922be Compare May 6, 2021 19:15

chrismeyersfsu force-pushed the db_partition_analytics_cmeyers2 branch from a6922be to 33a4665 Compare May 12, 2021 12:41

jladdjr force-pushed the db_partition_analytics_cmeyers2 branch from e89cb41 to e814bee Compare May 12, 2021 19:14

jladdjr force-pushed the db_partition_analytics_cmeyers2 branch from e814bee to a249487 Compare May 12, 2021 22:02

jladdjr force-pushed the db_partition_analytics_cmeyers2 branch from 271e6c7 to 641db8a Compare May 13, 2021 22:45

chrismeyersfsu force-pushed the db_partition_analytics_cmeyers2 branch from 3a417ad to 19a7fbb Compare May 20, 2021 17:58

AlanCoding added the component:api label May 25, 2021

jakemcdermott added the component:ui label May 25, 2021

jakemcdermott requested a review from mabashian May 25, 2021 21:21

jakemcdermott force-pushed the db_partition_analytics_cmeyers2 branch from 5ff5811 to 0010040 Compare May 25, 2021 21:49

chrismeyersfsu force-pushed the db_partition_analytics_cmeyers2 branch from ba79159 to 920dcc8 Compare May 26, 2021 19:53

chrismeyersfsu and others added 19 commits June 4, 2021 09:17

bump migrations after devel rebase

1371113

bump migrations

6ce227a

lint

7b188aa

bump migration

ef9f912

remove rebase cruft

84af610

update job cleanup tests for sqlite-based execution

e371de3

lint

f7d2f7a

bump migration

2a23b4c

close db and cache connection in new threads

30871bd

bump db partition migration

1a1d66d

update view to handle hosts/N/ad_hoc_command_events

c429563

add/remove indexes, more get_event_querset()

2131703

* Do not cascade delete unified job events. We will clean those up in cleanup_job runs * Add limit pagination to all unified job events endpoints

move get_queryset handling to child view

31fe500

jladdjr force-pushed the db_partition_analytics_cmeyers2 branch from ada48ba to ffbbcd2 Compare June 4, 2021 16:17

kdelee approved these changes Jun 4, 2021

View reviewed changes

kdelee mentioned this pull request Jun 4, 2021

Implement PostgreSQL event partitioning for better performance and manageability #9039

Closed

softwarefactory-project-zuul bot merged commit 0f6e221 into devel Jun 4, 2021

AlanCoding mentioned this pull request Jun 7, 2021

Add OPTIONS documentation for new job limit feature #10373

Merged

AlanCoding mentioned this pull request Jul 12, 2021

The cleanup_jobs management job now deletes the last project updates of projects #10625

Closed

3 tasks

shanemcd deleted the db_partition_analytics_cmeyers2 branch August 30, 2021 21:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Db partition analytics cmeyers2 #10023

Db partition analytics cmeyers2 #10023

chrismeyersfsu commented Apr 26, 2021

softwarefactory-project-zuul bot commented Apr 26, 2021

softwarefactory-project-zuul bot commented May 6, 2021

softwarefactory-project-zuul bot commented May 12, 2021

softwarefactory-project-zuul bot commented May 12, 2021

softwarefactory-project-zuul bot commented May 12, 2021

softwarefactory-project-zuul bot commented May 12, 2021

softwarefactory-project-zuul bot commented May 13, 2021

softwarefactory-project-zuul bot commented May 13, 2021

softwarefactory-project-zuul bot commented May 20, 2021

softwarefactory-project-zuul bot commented May 20, 2021

softwarefactory-project-zuul bot commented May 25, 2021

softwarefactory-project-zuul bot commented May 25, 2021

softwarefactory-project-zuul bot commented May 25, 2021

softwarefactory-project-zuul bot commented May 25, 2021

softwarefactory-project-zuul bot commented May 25, 2021

softwarefactory-project-zuul bot commented May 26, 2021

jladdjr commented Jun 4, 2021

softwarefactory-project-zuul bot commented Jun 4, 2021

softwarefactory-project-zuul bot commented Jun 4, 2021

Db partition analytics cmeyers2 #10023

Db partition analytics cmeyers2 #10023

Conversation

chrismeyersfsu commented Apr 26, 2021

softwarefactory-project-zuul bot commented Apr 26, 2021

softwarefactory-project-zuul bot commented May 6, 2021

softwarefactory-project-zuul bot commented May 12, 2021

softwarefactory-project-zuul bot commented May 12, 2021

softwarefactory-project-zuul bot commented May 12, 2021

softwarefactory-project-zuul bot commented May 12, 2021

softwarefactory-project-zuul bot commented May 13, 2021

softwarefactory-project-zuul bot commented May 13, 2021

softwarefactory-project-zuul bot commented May 20, 2021

softwarefactory-project-zuul bot commented May 20, 2021

softwarefactory-project-zuul bot commented May 25, 2021

softwarefactory-project-zuul bot commented May 25, 2021

softwarefactory-project-zuul bot commented May 25, 2021

softwarefactory-project-zuul bot commented May 25, 2021

softwarefactory-project-zuul bot commented May 25, 2021

softwarefactory-project-zuul bot commented May 26, 2021

jladdjr commented Jun 4, 2021

softwarefactory-project-zuul bot commented Jun 4, 2021

softwarefactory-project-zuul bot commented Jun 4, 2021