chore: [proposal] de-matrix python-version in GHAs #27906

mistercrunch · 2024-04-04T17:34:11Z

SUMMARY

In this PR:

introducing the notion of a python-version named 'current' and 'next' for the setup-backend GHA, and using those semantics in the matrix + required checks. setup-backend is the only place where we install python now, so it's DRY and allowing us to bump in one place in the future, with required checks names staying consistent
run pytest against the next version of python as @villebro suggested, same as we've been doing for integration tests
simplifying the single-item matrix (python-version) to NOT using a matrix. I'm guessing the reason we currently have a single-item matrix is an artifact of supporting multiple version in the past, and/or making it easy to go multi-python-version checks in the future, but there's a burden associated, especially around how this relates to "required checks" specified in .asf.yml
fixing/simplifying the related no-op workflows. We'll need new ones, but will be able to deprecate a bunch and simplify things. For instance, when we migrate to 3.11 in the future, we won't have to manage a bunch of python-version-specific no-ops

About supporting multiple/future version of python, I'd argue that we should focus on a single one for a given CI run, and that if/when we need to CI against multiple version, we run a FULL test suite punctually in a dedicate PR/branch/ref. Point being, it's expensive for every commit to validate multiple versions of python and in many ways its not necessary.

Currently our multi-python-version support is dubious at best, with only few checks that run against multiple versions. I really think we should pick a single version and support it very well. If/when we want to upgrade python version, we'd cut a PR and run CI for that purpose.

If we want to continuously, actively support multiple python versions (and I don't think we should!), I'd suggest either
a release-specific procedure (release manager using release branch, running full CI for that version/release) and/or a nightly job that would keep an eye on that version of python.

craig-rueda · 2024-04-05T22:24:57Z

.github/workflows/no-op.yml

  pre-commit:
    strategy:
      matrix:
-        python-version: ["3.9"]
+        python-version: ["3.9", "3.10", ""]


Do we need to include 3.9? Let's just do 3.10

craig-rueda · 2024-04-05T22:26:00Z

I'm all for just "officially" supporting a single PY version going forward. Most other projects do this AFAIK, so what's stopping us? :)

raphaelauv · 2024-04-08T09:13:10Z

the only benefit of supporting multiple python version is to increase the chance to offer more compatible client/driver libraries for source data to the end users

example I use the database XXX , the python client is not compatible with python 3.11 , I want use latest version of Apache Superset version X.Y.Z that only support 3.11

a policy like supporting the last 2 version of python with active support could e a first step, wdyt ?

mistercrunch · 2024-04-08T15:54:26Z

Right, though we need to define what "support" means. Some criteria:

providing an official Docker release for the version of python - probably running the CI/test suite at release time
running all CI unit/integration tests on that version of python at all times, insuring that say master is compatible with that version of python at most times
not doing pro-active tests, but also not restricting the python package letting people "run at their own risk" (this seems never desirable, and is kind of what we're doing currently)

What we're doing now:

python package let's user install on 3.9, 3.10 and 3.11 I believe, based on the pyproject.toml
we run the full test suite on 3.10
we run one or two test suite on 3.11
Official docker images are all 3.10

In this PR: - simplifying the single-item matrix (python-version) to NOT using a matrix. I'm guessing the reason we currently have a single-item matrix is an artifact of supporting multiple version in the past, and/or making it easy to go multi-python-version checks in the future, but there's a burden associated, especially around how this relates to "required checks" specified in .asf.yml - leveraging the `setup-backend`'s default for python version, making the main python version we use much more DRY. - fixing/simplifying the related no-op. We'll need new ones, but will be able to deprecate a bunch and simplify things. For instance, when we migrate to 3.11 in the future, we won't have to manage a bunch of python-version-specific no-ops About supporting multiple/future version of python, I'd argue that we should focus on a single one for a given CI run, and that if/when we need to CI against multiple version, we run a FULL test suite punctually in a dedicate PR/branch/ref. Point being, it's expensive for every commit to validate multiple versions of python and in many ways its not necessary. Currently our multi-python-version support is dubious at best, with only few checks that run against multiple versions. I really think we should pick a single version and support it very well. If/when we want to upgrade python version, we'd cut a PR and run CI for that purpose. If we want to continuously, actively support multiple python versions (and I don't think we should!), I'd suggest either a release-specific procedure (release manager using release branch, running full CI for that version/release) and/or a nightly job that would keep an eye on that version of python.

mistercrunch · 2024-04-10T17:27:41Z

Capturing the conversation from the Release Strategy Group this morning:

let's support a single python version officially per release, and run CI against that one mainly
let's keep preventing "future regressions" on future python version by running some minimal CI against the future version of python (currently test-postgres against 3.11). For now this is the only place where we need a proper matrix. I might try to rename to "current" and "future" to address the required check issues

.github/workflows/no-op.yml

rusackas · 2024-04-10T20:30:44Z

.github/actions/setup-backend/action.yml

+      shell: bash
+      run: |
+        if [ "${{ inputs.python-version }}" = "current" ]; then
+          echo "PYTHON_VERSION=3.10" >> $GITHUB_ENV


No problem for this PR, but I'm thinking at some point we might want to keep our npm/node/python/whatever version numbers in a JSON file at the repo's root, so they can be pulled into scripts, documentation, etc. as needed.

OMG YES! But GHA doesn't play nicely with that kind of parameter injection. Composition through the "reusable action" approach that we have here is the best we can do with basic actions (calling a reusable action that hard-codes the values).

If we wanted more flexibility I'd suggest a Jinja2-based approach, where we'd have the source templates in say a .github/jinja folder, and render that into .github/ via a commit hook or similar. Seemed overkill for now, but I'd be supportive to move in that direction.

Now comparing to where we were a few months back, we're at a much better place with this setup-backend and setup-supersetbot approach.

rusackas · 2024-04-10T20:46:02Z

.github/workflows/no-op.yml

+    steps:
+      - name: No-op for frontend-build
+        run: exit 0
+


There are some inconsistent breaks between blocks if we want to get all OCD here.

I wonder: would it be a problem to add ALL required checks here? If so, I wonder if we could ingest and iterate through the list from asf.yaml rather than having to maintain this.

rusackas

Left a couple nits/suggestions, but seems fine to me in general.

After merging #27906, we shouldn't need this no-op file that was a bit of a hack in the first place. For more information as to why we needed this originally, view the comments/docs in the file I'm deleting in this PR

mistercrunch requested review from villebro, geido, eschutho, rusackas, betodealmeida, nytai, craig-rueda, john-bodley, kgabryje and dpgaspar as code owners April 4, 2024 17:34

pull-request-size bot added the size/L label Apr 4, 2024

github-actions bot added the github_actions Pull requests that update GitHub Actions code label Apr 4, 2024

mistercrunch force-pushed the no-op branch from bbc3de5 to 4c3b4ac Compare April 4, 2024 17:35

github-actions bot added the preset-io label Apr 4, 2024

mistercrunch mentioned this pull request Apr 4, 2024

chore: remove fixed psycopg2 version #27907

Closed

craig-rueda reviewed Apr 5, 2024

View reviewed changes

mistercrunch force-pushed the no-op branch from 2c1633d to f804b88 Compare April 9, 2024 01:10

pull-request-size bot added size/M and removed size/L labels Apr 9, 2024

mistercrunch added 2 commits April 9, 2024 08:53

rebased

915cbc1

mistercrunch force-pushed the no-op branch from 99ed664 to 915cbc1 Compare April 9, 2024 15:54

re-introducing no-op momentarily - to be deleted

664bf85

pull-request-size bot added size/L and removed size/M labels Apr 9, 2024

setup-backend now supports 'current' and 'future'

e217f31

mistercrunch added 4 commits April 10, 2024 12:18

bycatch, add GITHUB_TOKEN for docker builds

94542c1

fix yaml

4a15f8e

rename future to next

5e43990

run pytest on next version as well

3fd2839

mistercrunch commented Apr 10, 2024

View reviewed changes

.github/workflows/no-op.yml Show resolved Hide resolved

typo

6bd7709

rusackas reviewed Apr 10, 2024

View reviewed changes

rusackas approved these changes Apr 10, 2024

View reviewed changes

mistercrunch merged commit dea4306 into master Apr 10, 2024
38 checks passed

mistercrunch deleted the no-op branch April 10, 2024 21:32

mistercrunch mentioned this pull request Apr 11, 2024

chore: remove no-op.yml as it's not needed anymore #27980

Merged

EnxDev pushed a commit to EnxDev/superset that referenced this pull request Apr 15, 2024

chore: [proposal] de-matrix python-version in GHAs (apache#27906)

b487a13

qleroy pushed a commit to qleroy/superset that referenced this pull request Apr 28, 2024

chore: [proposal] de-matrix python-version in GHAs (apache#27906)

1e68975

jzhao62 pushed a commit to jzhao62/superset that referenced this pull request May 16, 2024

chore: [proposal] de-matrix python-version in GHAs (apache#27906)

00c36ee

vinothkumar66 pushed a commit to vinothkumar66/superset that referenced this pull request Nov 11, 2024

chore: [proposal] de-matrix python-version in GHAs (apache#27906)

3340789

mistercrunch added 🏷️ bot A label used by `supersetbot` to keep track of which PR where auto-tagged with release labels 🚢 4.1.0 labels Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: [proposal] de-matrix python-version in GHAs #27906

chore: [proposal] de-matrix python-version in GHAs #27906

mistercrunch commented Apr 4, 2024 •

edited

Loading

craig-rueda Apr 5, 2024

craig-rueda commented Apr 5, 2024

raphaelauv commented Apr 8, 2024

mistercrunch commented Apr 8, 2024

mistercrunch commented Apr 10, 2024

rusackas Apr 10, 2024

mistercrunch Apr 10, 2024

rusackas Apr 10, 2024

rusackas left a comment

chore: [proposal] de-matrix python-version in GHAs #27906

chore: [proposal] de-matrix python-version in GHAs #27906

Conversation

mistercrunch commented Apr 4, 2024 • edited Loading

SUMMARY

craig-rueda Apr 5, 2024

Choose a reason for hiding this comment

craig-rueda commented Apr 5, 2024

raphaelauv commented Apr 8, 2024

mistercrunch commented Apr 8, 2024

mistercrunch commented Apr 10, 2024

rusackas Apr 10, 2024

Choose a reason for hiding this comment

mistercrunch Apr 10, 2024

Choose a reason for hiding this comment

rusackas Apr 10, 2024

Choose a reason for hiding this comment

rusackas left a comment

Choose a reason for hiding this comment

mistercrunch commented Apr 4, 2024 •

edited

Loading