QGOV changes, #4 #11

ThrawnCA · 2022-07-25T04:57:51Z

Overview

Changes to become more compatible with qld-gov-au fork, see #4. This includes several feature improvements.

Capture Flake8 config in a .flake8 file so it applies to manual runs.
Use ckan_cli script so we can run Click or Paster with the same syntax.
Fix typos in README.
Add more documentation of CLI commands.
Add check to ensure that package and resource IDs in URLs match up.
Increase use of six for Py2/Py3 compatibility.
Add a title to validation report links (just containing the validation timestamp).
Use a helper to more robustly detect CKAN 2.9+ in templates, so we're ready for CKAN 3.
Encapsulate database access in ValidationStatusHelper so job logic is simpler.
Turn custom actions into chained actions so other plugins can customise them further.
Update job enqueueing so we only record a resource ID, not the full dictionary (saving memory in Redis).
Skip job enqueueing if we already have a job in progress.
Don't trigger a new validation job for package_patch calls that don't change any resources.
Move the main plugin module up a level, to plugin.py, to simplify paths.
Don't validate "sibling" resources when a single resource is updated.
Use parameterised logging instead of up-front formatting.
Mark broken tests to be skipped, instead of commenting them out.
Use io.BytesIO instead of six.BytesIO for fake files, since it's more consistent.
Pin ckanext-scheming version for consistent testing.

- standardise whitespace - sort imports alphabetically - refactor code to extract helper functions - add more doc comments - put binary operators at start of line instead of end - use parameterised logging

- this is more consistent, when we're already making our strings binary

- We won't necessarily be instantly compatible with all future versions, we need version control

- It's better to put the shared code into plugin.py than into an __init__.py file, and putting it at the higher level lets us skip a bunch of '../' paths

- extract both the package ID and resource ID from the URL and ensure it's the right package for the resource

- This will make it easier to adjust and test our exact enqueueing behaviour since we can examine what is actually sent to the queue

- Passing IDs consumes far less memory on the job queue than entire resource dictionaries

- fix typos - fix repository URLs

- When using a resource API to update a resource, don't treat it as a package update

- Record only the resource ID, instead of the full resource dict. This greatly reduces the memory usage on the job queue server.

- Just use the validation timestamp, it's useful information

… out

…ficient privileges

- Updating a dataset via the web UI should not trigger a validation job

- check for asynchronicity at runtime so we can always hook our action in - use chained actions so we don't block other customisation

- Unless package_patch provides an update to the resources, there's no need for validation

- Synchronous validation doesn't need to pass just an ID, full dict is fine

ThrawnCA added 25 commits July 22, 2022 10:34

[QOL-9122] general cleanup

71f6787

- standardise whitespace - sort imports alphabetically - refactor code to extract helper functions - add more doc comments - put binary operators at start of line instead of end - use parameterised logging

[QOL-9122] replace six.BytesIO and six.StringIO with io.BytesIO

a8ebeb4

- this is more consistent, when we're already making our strings binary

[QOL-9122] introduce 'ckan_cli' to handle both Paster and Click

e2883d8

[QOL-9122] pin version of ckanext-scheming

3ef8a33

- We won't necessarily be instantly compatible with all future versions, we need version control

[QOL-9122] add Flake8 rules and make them pass

870a96f

[QOL-9122] use Flake8 config in CI instead of manually specifying

0844e28

[QOL-9122] move plugin module to higher level

5cf0557

- It's better to put the shared code into plugin.py than into an __init__.py file, and putting it at the higher level lets us skip a bunch of '../' paths

[QOL-9122] use cgi-based mock file storage for testing on CKAN < 2.9

43b1fa7

[QOL-9122] ensure URL package IDs match resource IDs

dfc5683

- extract both the package ID and resource ID from the URL and ensure it's the right package for the resource

[QOL-9122] mock core enqueue function instead of ours

2af8bc9

- This will make it easier to adjust and test our exact enqueueing behaviour since we can examine what is actually sent to the queue

[QOL-9122] allow job consumer to accept resource IDs

08e07fe

- Passing IDs consumes far less memory on the job queue than entire resource dictionaries

[QOL-9122] update README

c556ffd

- fix typos - fix repository URLs

[QOL-9122] skip validating untouched resources in a package

cfa3d15

- When using a resource API to update a resource, don't treat it as a package update

[QOL-9122] retrieve package ID if not provided on resource update

e353f9e

[QOL-9122] shrink validation jobs

661a371

- Record only the resource ID, instead of the full resource dict. This greatly reduces the memory usage on the job queue server.

[QOL-9122] add title to the validation link

1bb3617

- Just use the validation timestamp, it's useful information

[QOL-9122] use variable to manage ckan_cli path in one place

1f61260

[QOL-9122] use ValidationStatusHelper to simplify database access

6f1709c

[QOL-9122] mark broken tests to be skipped instead of commenting them…

4c977db

… out

[QOL-9122] add unit test for attempting to run validation without suf…

aa188bc

…ficient privileges

[QOL-9122] add template helper to reliably detect CKAN 2.9+

b85d8dd

[QOL-9122] add test for avoiding unnecessary validation

79a1668

- Updating a dataset via the web UI should not trigger a validation job

[QOL-9122] make custom actions more robust

6bdf4ab

- check for asynchronicity at runtime so we can always hook our action in - use chained actions so we don't block other customisation

[QOL-9122] skip validation on package_patch API

013c17c

- Unless package_patch provides an update to the resources, there's no need for validation

[QOL-9122] drop TODO that we don't need to implement

fcf353f

- Synchronous validation doesn't need to pass just an ID, full dict is fine

ThrawnCA mentioned this pull request Jul 26, 2022

CKAN 2.9 / Python 3 ckan/ckanext-validation#55

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QGOV changes, #4 #11

QGOV changes, #4 #11

ThrawnCA commented Jul 25, 2022

QGOV changes, #4 #11

Are you sure you want to change the base?

QGOV changes, #4 #11

Conversation

ThrawnCA commented Jul 25, 2022

Overview