esm, loader: move to own thread #43658

JakobJingleheimer · 2022-07-02T19:38:09Z

What is the problem this feature will solve?

limit contamination
facilitate synchronous import.meta.resolve()

For the initial implementation, the same loaders thread will be used for all user-land threads. A subsequent enhancement may add a configuration option to the Worker constructor to spawn its own dedicated loaders thread.

What is the feature you are proposing to solve the problem?

Move loaders off-thread

What alternatives have you considered?

No response

Per direction from TSC

Following in the vein of babel: babel/babel#14025
And https://github.com/bmeck/using-node-workshop/tree/main/steps/6_async_and_blocking

notify package authors of change (please comment to be added if not already included—sorry I know only so many)
- esmock
- jest?
- mocha (Gil Tayar)
- ts-node (Andrew Bradley)
- yarn (Maël Nison)

The text was updated successfully, but these errors were encountered:

JakobJingleheimer · 2022-07-02T19:49:08Z

@bmeck @MylesBorins could either of you speak to the impetus for this?

aduh95 · 2022-07-05T09:29:19Z

Related: #31229

GeoffreyBooth · 2022-07-20T05:57:04Z

@bmeck @MylesBorins could either of you speak to the impetus for this?

Pros and cons here: nodejs/modules#351 (comment)

cspotcode · 2022-07-20T11:11:06Z

Can add to the list of package authors:
esmock

I wonder, does it make sense for the loaders team to have a thread somewhere which can be subscribed to for notifications of breaking changes? All relevant discussion can happen elsewhere, the thread would be like an RSS feed. Package maintainers can opt-in to notifications of breaking or potentially exciting/disruptive changes by subscribing to that thread.

Might scale better than us hoping we know a comprehensive list of all loaders.

JakobJingleheimer · 2022-07-20T21:40:20Z

Since Node.js doesn't maintain anything of the sort, that sounds like a better alternative to "surprise!". It's not full-proof, though.

JakobJingleheimer · 2022-09-17T13:21:42Z

We have a working Proof of Concept: https://github.com/JakobJingleheimer/worker-atomics

JakobJingleheimer · 2022-10-19T21:21:34Z

Hi! Any estimated date for done this thread ?

This thread blocking #43772 which needed for

https://github.com/yarnpkg/berry/discussions/4044#discussioncomment-2740697

Asking cause looks like it's long time thread ( currently 4 months )

Hiya! Please look right above your post to find the active WIP PR link (in the fancy GitHub callout), which was updated recently.

JakobJingleheimer · 2022-10-21T20:53:37Z

I was chatting with Anna earlier today, and she mentioned the CPU and memory cost of the 2nd thread is non-trivial—effectively doubling node's basic footprint.

Is that something we've already considered? Do we know of any way to mitigate that?

bmeck · 2022-10-21T21:07:19Z

If you use --loader on a single threaded application that is somewhat true but shouldn'tbe 2x. Plenty of stuff like v8 intrinsics, c++ code, etc. are not recreated. If the loader is shared amongst worker threads it can actually be a net gain. Overall ability to intercept CJS and being opt-in seems fine. If we need to save more there are ways to do so that haven't been done since currently the worker thread impl does spin up a full node env.

…

On Fri, Oct 21, 2022, 3:53 PM Jacob Smith ***@***.***> wrote: I was chatting with Anna earlier today, and she mentioned the CPU and memory cost of the 2nd thread is non-trivial—effectively doubling node's basic footprint. Is that something we've already considered? Do we know of any way to mitigate that? — Reply to this email directly, view it on GitHub <#43658 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AABZJIYVULTTYEQVPJ42AQ3WEL7F3ANCNFSM52PQG7HQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

mcollina · 2022-10-23T18:23:26Z

The way this issue is framed implies that import loaders are moved off thread. In other terms, only users deliberately opting in are using a different thread. However, this is not the direction the work went and all ESM loading logic is being moved. I spoke to quite a few people over time about this and this was not brought up before today.

This is a significant blocker for me as Node.js is often used in environment that are constrained by memory. At the bare minimum we should investigate:

how much more memory is needed?
how much more latency this will add?
can we avoid it, i.e. only move user-provided code off thread?
will this memory cost disappear or will the thread be kept around?

GeoffreyBooth · 2022-10-23T20:33:38Z

I agree that we should find answers to those questions. I would hold off on adding it to the TSC agenda until we have those answers.

JakobJingleheimer · 2022-10-23T20:36:03Z

Edit: I believe it was always our intention to get those numbers before landing.

how much more memory is needed?

This does not seem to be a straightforward answer. I can't truly measure it until the implementation is runnable (currently blocked by a V8 error). Preliminary result data from our experiment via /usr/bin/time -l are:

metric	baseline	off-threaded
maximum resident set size	39174144	54509568
average shared memory size	0	0
average unshared data size	0	0
average unshared stack size	0	0
page reclaims	2581	3517
page faults	0	0
swaps	0	0
block input operations	0	0
block output operations	0	0
messages sent	0	0
messages received	0	0
signals received	0	0
voluntary context switches	1
involuntary context switches	67	134
instructions retired	280244491	550971942
cycles elapsed	101541186	203115216
peak memory footprint	15487232	28354496

This very basic rough comparison suggests additional memory consumption at peak of 12,867,264 (~83% increase).

how much more latency this will add?

Preliminary results suggest there's a small initial cost when the worker initialises, but in terms of latency incurred for the work off-loaded, there's effectively none (nanoseconds).

can we avoid it, i.e. only move user-provided code off thread?

I think this is contrary to the goals we're attempting to achieve with a separate thread (but maybe we only care about custom loaders).

will this memory cost disappear or will the thread be kept around?

The thread gracefully terminates after the last call to "public" ESMLoader. However, in the tests done so far, that coincides with node itself gracefully terminating. We could specifically test keeping node around for a while after all the ESM / loader stuff to see if the thread terminates. I'm not sure how best to go about that: worker.once('exit') does not trigger (and also, I fear if it did work, it would prevent/undo the worker.unref(), leading to false results).

GeoffreyBooth · 2022-10-23T21:53:45Z

There's also a potential benefit in moving non-custom loading off-thread as it would protect internals from prototype pollution (I think). That would argue that we should make this same refactor for CommonJS too.

aduh95 · 2022-10-23T23:31:51Z

This is a significant blocker for me as Node.js is often used in environment that are constrained by memory. At the bare minimum we should investigate:

how much more memory is needed?

how much more latency this will add?

can we avoid it, i.e. only move user-provided code off thread?

will this memory cost disappear or will the thread be kept around?

@mcollina I'm not convinced we can answer those questions before we have a fully working implementation; without data, we can only make assumptions, and I wouldn't want us to draw a conclusion over possibly baseless assumptions.

mcollina · 2022-10-24T07:09:11Z

Absolutely! I'm concerned about the addition to our startup memory footprint, as this matters for some of our usecases.

I'm flagging that this might be problematic and I was surprised because it was not mentioned in the main text of the issue.

A few more question:

if a new Worker thread is spawned in the lifetime of the application, will this create another thread to load ESM?
what about dynamic import? Will it need re-spawning the thread?

JakobJingleheimer · 2022-10-24T07:59:18Z

A few more question:

if a new Worker thread is spawned in the lifetime of the application, will this create another thread to load ESM?

what about dynamic import? Will it need re-spawning the thread?

In both cases, (in the current design/implementation) only if the "loaders" worker is not around (otherwise it will be re-used).

mcollina · 2022-10-24T08:27:35Z

In both cases, (in the current design/implementation) only if the "loaders" worker is not around (otherwise it will be re-used).

So there is only one loaders worker for all worker threads created by Node.js?

targos · 2022-10-24T08:32:27Z

No, there's a separate loaders worker for each worker thread.

Flarna · 2022-10-24T08:33:15Z

I assume it should be possible to configure loader hooks per worker thread therefore this may complicate this thread.

I'm also a bit skeptical to have a single loader thread for all workers as this seems to allow "leaking" data between workers which should be isolated. Also this single loader worker would be parallelism blocker.

mhdawson · 2022-11-01T16:02:45Z

The comment from @targos and @JakobJingleheimer seem contradictory?

cspotcode · 2022-11-01T19:03:23Z

There are 3 different threading models being considered, that I'm aware of.

JakobJingleheimer · 2022-11-01T22:24:33Z

Sorry, I think #43658 (comment) is the intended behaviour (the current PR may not achieve that yet). The rationale being that workers are intended to be isolated, so their loaders' state should also be isolated/fresh.

JakobJingleheimer · 2022-11-13T22:00:06Z

Quick update from our recent team meeting: We invited Gil Tayar (author/maintainer of several pertinent libraries) to discuss spawning a dedicated loaders thread per user-land thread, and he noted that will add enormous complexity to library authors (on top of the extra complexity on node's side). We decided for an initial implementation, it would be better to use a single loaders thread shared by all user-land threads and add caveats to the relevant sections of the docs. If there is sufficient appetite, we can subsequently add a configuration option (perhaps to the Worker constructor) to spawn dedicated loaders threads.

mcollina · 2022-11-14T10:42:13Z

Was there any progress on not spawning any thread if no custom loaders are defined?

JakobJingleheimer · 2022-11-15T07:21:02Z

Yep! nodejs/loaders#118 (comment) and I believe this should be logistically/technically possible.

JakobJingleheimer added the feature request Issues that request new features to be added to Node.js. label Jul 2, 2022

JakobJingleheimer self-assigned this Jul 2, 2022

JakobJingleheimer added esm Issues and PRs related to the ECMAScript Modules implementation. loaders Issues and PRs related to ES module loaders labels Jul 2, 2022

JakobJingleheimer removed the feature request Issues that request new features to be added to Node.js. label Jul 2, 2022

JakobJingleheimer added this to the ESM Loaders: CJS parity milestone Jul 24, 2022

GeoffreyBooth mentioned this issue Jul 30, 2022

esm: add --import flag #43942

Merged

1 task

JakobJingleheimer mentioned this issue Sep 18, 2022

Move ESM loaders off-thread #44710

Merged

7 tasks

JakobJingleheimer changed the title ~~esm, loaders: move to own thread~~ esm, loader: move to own thread Sep 18, 2022

GeoffreyBooth mentioned this issue Sep 27, 2022

feat(esm): leverage loaders when resolving subsequent loaders #43772

Merged

This comment was marked as off-topic.

Sign in to view

mcollina added the tsc-agenda Issues and PRs to discuss during the meetings of the TSC. label Oct 23, 2022

mhdawson mentioned this issue Oct 24, 2022

Node.js Technical Steering Committee (TSC) Meeting 2022-10-26 nodejs/TSC#1298

Closed

mhdawson mentioned this issue Oct 31, 2022

Node.js Technical Steering Committee (TSC) Meeting 2022-11-02 nodejs/TSC#1302

Closed

mcollina removed the tsc-agenda Issues and PRs to discuss during the meetings of the TSC. label Nov 1, 2022

lemanschik mentioned this issue Nov 29, 2022

Verify NJS 19,1 import require flags lemanschik/modules#2

Closed

38 tasks

aduh95 closed this as completed in #44710 Apr 13, 2023

Flarna mentioned this issue May 29, 2024

Module Hooks cannot be registered from worker thread in 22.2.0+ #53182

Closed

GeoffreyBooth mentioned this issue May 30, 2024

Module customization hooks and worker threads nodejs/TSC#1566

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

esm, loader: move to own thread #43658

esm, loader: move to own thread #43658

JakobJingleheimer commented Jul 2, 2022 •

edited

Loading

JakobJingleheimer commented Jul 2, 2022

aduh95 commented Jul 5, 2022

GeoffreyBooth commented Jul 20, 2022

cspotcode commented Jul 20, 2022

JakobJingleheimer commented Jul 20, 2022

JakobJingleheimer commented Sep 17, 2022

This comment was marked as off-topic.

This comment was marked as off-topic.

JakobJingleheimer commented Oct 19, 2022 •

edited

Loading

JakobJingleheimer commented Oct 21, 2022

bmeck commented Oct 21, 2022 via email

mcollina commented Oct 23, 2022 •

edited

Loading

GeoffreyBooth commented Oct 23, 2022

JakobJingleheimer commented Oct 23, 2022 •

edited

Loading

GeoffreyBooth commented Oct 23, 2022

aduh95 commented Oct 23, 2022

mcollina commented Oct 24, 2022

JakobJingleheimer commented Oct 24, 2022

mcollina commented Oct 24, 2022

targos commented Oct 24, 2022

Flarna commented Oct 24, 2022

mhdawson commented Nov 1, 2022

cspotcode commented Nov 1, 2022

JakobJingleheimer commented Nov 1, 2022

JakobJingleheimer commented Nov 13, 2022

mcollina commented Nov 14, 2022

JakobJingleheimer commented Nov 15, 2022

esm, loader: move to own thread #43658

esm, loader: move to own thread #43658

Comments

JakobJingleheimer commented Jul 2, 2022 • edited Loading

What is the problem this feature will solve?

What is the feature you are proposing to solve the problem?

What alternatives have you considered?

JakobJingleheimer commented Jul 2, 2022

aduh95 commented Jul 5, 2022

GeoffreyBooth commented Jul 20, 2022

cspotcode commented Jul 20, 2022

JakobJingleheimer commented Jul 20, 2022

JakobJingleheimer commented Sep 17, 2022

This comment was marked as off-topic.

This comment was marked as off-topic.

JakobJingleheimer commented Oct 19, 2022 • edited Loading

JakobJingleheimer commented Oct 21, 2022

bmeck commented Oct 21, 2022 via email

mcollina commented Oct 23, 2022 • edited Loading

GeoffreyBooth commented Oct 23, 2022

JakobJingleheimer commented Oct 23, 2022 • edited Loading

GeoffreyBooth commented Oct 23, 2022

aduh95 commented Oct 23, 2022

mcollina commented Oct 24, 2022

JakobJingleheimer commented Oct 24, 2022

mcollina commented Oct 24, 2022

targos commented Oct 24, 2022

Flarna commented Oct 24, 2022

mhdawson commented Nov 1, 2022

cspotcode commented Nov 1, 2022

JakobJingleheimer commented Nov 1, 2022

JakobJingleheimer commented Nov 13, 2022

mcollina commented Nov 14, 2022

JakobJingleheimer commented Nov 15, 2022

JakobJingleheimer commented Jul 2, 2022 •

edited

Loading

JakobJingleheimer commented Oct 19, 2022 •

edited

Loading

mcollina commented Oct 23, 2022 •

edited

Loading

JakobJingleheimer commented Oct 23, 2022 •

edited

Loading