Freeze initialized runtime state for use in subsequent executions. #78

ericsnowcurrently · 2021-07-29T19:27:31Z

This is based on a discusssion @markshannon and I had the other day, but it also relates to discussions I've had with other core devs periodically for several years.

The idea is to start up the runtime, finish initialization, and then take a snapshot of the process memory (or a subset). That snapshot is then rendered as a header file (a la frozen modules) which the runtime can use on subsequent executions to get to that initialized state instead of executing all the usual runtime code. (This is reminiscent of a technique xemacs uses.)

Benefits

possibly skip most of runtime init, getting us to running user code much faster
allow us to do one allocation (for the whole snapshot) instead of the many we normally do
(we may be able to get that snapshot into the DATA section to avoid allocation altogether, though likely not worth the trouble)
the snapshot could be re-used to speed up creating subinterpreters
if we make the snapshot dump human-readable, it could be a useful diagnostic tool

Caveats and Challenges

? other than relatively short-lived ones, most Python processes won't benefit all that much
must be part of the build process (probably not realistic to do at runtime)
taking the snapshot might not be so easy
turning the snapshot back into a fully initialized runtime might not be so easy
there are lots of things to fix up (e.g. offsets, pointers, object hashes, maybe refcounts), which may make it too complex or otherwise neutralize any performance gains
command-line options and env vars can invalidate the snapshot

Open Questions

is it worth it?
is it worth the time to figure out if it's worth it?
would it make sense to do this with a subset of runtime initialization?
what should be in the snapshot?
what should the format be for the snapshot dump?
make it human-readable?
how to turn the initialized runtime into a snapshot?
- in-proc vs. external
- stdout vs. outfile
what should the format be for the data we will use to initialize the runtime? (e.g. in a header file)
how to render the snapshot dump as that data?
how to go from that data to a fully initialized runtime?

ericsnowcurrently · 2021-07-29T19:27:55Z

One thing @markshannon suggested is that we start off with the snapshot as just the initial graph of PyObject *, rather than the full runtime state.

iritkatriel · 2021-07-29T19:38:50Z

must be part of the build process (probably not realistic to do at runtime)

Could it not be impacted by the runtime environment, like ENV variables?

ericsnowcurrently · 2021-07-29T19:43:03Z

They would definitely impact the solution. We'd have to figure out how to deal with that.

ericsnowcurrently mentioned this issue Jul 29, 2021

Use member non-pointer declarations in Py*State structs instead of heap-allocated pointers. #79

Closed

ericsnowcurrently added the investigate label Aug 2, 2021

ericsnowcurrently assigned ericsnowcurrently and unassigned ericsnowcurrently Aug 17, 2021

faster-cpython locked and limited conversation to collaborators Dec 2, 2021

markshannon closed this as completed Dec 2, 2021

gramster added this to Fancy CPython Board Jan 10, 2022

gramster moved this to Todo in Fancy CPython Board Jan 10, 2022

gramster moved this from Todo to Done in Fancy CPython Board Jan 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

Freeze initialized runtime state for use in subsequent executions. #78

Freeze initialized runtime state for use in subsequent executions. #78

ericsnowcurrently commented Jul 29, 2021

ericsnowcurrently commented Jul 29, 2021

iritkatriel commented Jul 29, 2021

ericsnowcurrently commented Jul 29, 2021

This issue was moved to a discussion.

This issue was moved to a discussion.

Freeze initialized runtime state for use in subsequent executions. #78

Freeze initialized runtime state for use in subsequent executions. #78

Comments

ericsnowcurrently commented Jul 29, 2021

Benefits

Caveats and Challenges

Open Questions

ericsnowcurrently commented Jul 29, 2021

iritkatriel commented Jul 29, 2021

ericsnowcurrently commented Jul 29, 2021

This issue was moved to a discussion.