Executor Import/Load time optimization #30361

o-nikolas · 2023-03-29T19:42:38Z

Overview

This PR aims to improve the time it takes to load/import the various executor modules we have in Airflow.

Motivation

The executors are imported in more places now that various compatibility checks are in core Airflow code (re: AIP-51). Also, decreasing import times is very important for the work of executors vending CLI commands (see #29055), since the CLI code in Airflow is particularly sensitive to slow imports (because all code is loaded fresh each time you run an individual Airflow CLI command).

The changes

This PR mostly includes changes to move some expensive imports that are only used for type checking under the TYPE_CHECKING flag so that they are not run at runtime. As well as moving a select few expensive imports closer to the code which uses them.
The most important changes are in the BaseExecutor module since all other executors load this module, and so benefits made here propagate outward.

Testing

I benchmarked these changes by writing a script to import the various executor modules in a fresh python runtime and timing how long that takes (you can test this yourself quickly from a bash shell by doing something like time python -c 'from airflow.executors.local_executor import LocalExecutor'). Then doing that in a loop for several samples (with some randomness in the order for fairness) both on main and on my development branch.

Results

Most executors saw a ~50% speed increase. Kubernetes, and to a lesser extent Celery, are still quite slow and will need more changes specifically targeted to those modules (in a future PR).
The combined executors (e.g. LocalKubernetesExecutor) saw less gains since they import two executors each, so they're paying double the cost (so they saw half the gains, 25%)

^ Add meaningful description above

Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in newsfragments.

airflow/executors/base_executor.py

o-nikolas · 2023-04-04T03:43:53Z

Fresh rebase from main and this is still green. Anyone have time to review/approve? (CC @uranusjr @potiuk @eladkal)

uranusjr

I still suspect some of the local imports are not necessary, but those are simple enough as mentioned and therefore introduce no downsides.

Move some expensive typing related imports to be under TYPE_CHECKING

o-nikolas requested review from dstandish, jedcunningham, kaxil, XD-DENG and ashb as code owners March 29, 2023 19:42

boring-cyborg bot added provider:cncf-kubernetes Kubernetes provider related issues area:Scheduler including HA (high availability) scheduler labels Mar 29, 2023

o-nikolas requested review from potiuk and uranusjr March 29, 2023 19:43

o-nikolas changed the title ~~Type-related import optimization for Executors~~ Executor Import/Load time optimization Mar 29, 2023

uranusjr reviewed Mar 30, 2023

View reviewed changes

airflow/executors/base_executor.py Outdated Show resolved Hide resolved

uranusjr approved these changes Apr 4, 2023

View reviewed changes

Type related import optimization for Executors

1eac90f

Move some expensive typing related imports to be under TYPE_CHECKING

o-nikolas force-pushed the onikolas/executors_type_import_optimization branch from aeeae43 to 1eac90f Compare April 6, 2023 21:40

potiuk approved these changes Apr 7, 2023

View reviewed changes

potiuk merged commit 840dd25 into apache:main Apr 7, 2023

ephraimbuddy added the type:improvement Changelog: Improvements label Apr 11, 2023

o-nikolas mentioned this pull request Apr 24, 2023

Kubernetes Executor Load Time Optimizations #30727

Merged

o-nikolas mentioned this pull request Aug 21, 2023

Make auth managers provide their own airflow CLI commands #33481

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Executor Import/Load time optimization #30361

Executor Import/Load time optimization #30361

o-nikolas commented Mar 29, 2023 •

edited

Loading

o-nikolas commented Apr 4, 2023

uranusjr left a comment

Executor Import/Load time optimization #30361

Executor Import/Load time optimization #30361

Conversation

o-nikolas commented Mar 29, 2023 • edited Loading

Overview

Motivation

The changes

Testing

Results

o-nikolas commented Apr 4, 2023

uranusjr left a comment

Choose a reason for hiding this comment

o-nikolas commented Mar 29, 2023 •

edited

Loading