Remove IdMap from source file. #9257

farmaazon · 2024-03-04T10:09:33Z

The id-map grows with the code rapidly, and even reducing its size won't help much; still, the id-map tends to take 100x more space than the code it describes. But perhaps we don't need to store them in file at all.

Main idea

Instead of storing id-map in source file, we could exchange it using Language Server API.

The id-map purpose is to make an identification of particular AST nodes, what in turn is needed to:

Assign node metadata (like position, color, etc.) and - in the future - widget metadata (like which widget was directly picked by the user).
Remove expression UUIDs from metadata section of a source file #10182 identify expressions in messages between GUI and Engine (where visualization is attached, what is the type of expression, etc.) Because of execution contexts and visualizations, these identifications should be persistent.

The second point do not require AST IDs persistence, while the first point may be handled by storing direct spans instead of IDs in the source file.

Further ideas and optimizations

To avoid initial exchange and reduce id-map size, we could assume some deterministic way of assigning IDs to nodes without id-map entry, for example: for every node with assigned ID from map, we assign increments of this ID to its children in DFS order (of course, if we meet another node with already assigned ID, we skip the entire subtree).
or, we could resign from exchanging IDs, and just add list of span pairs (A, B) meaning, that old code's span A's identity should be passed to new code's span B. Both parties then should manage updating their information about visualizations, execution stacks etc.

Gui Tasks

Give feedback

Add a draft title or issue reference here
Options

Engine Tasks

Give feedback

Remove expression UUIDs from metadata section of a source file #10182

0 of 5

-gui -language-server
Options

JaroslavTulach · 2024-03-05T10:47:55Z

The Truffle instrumentation is built around SourceSectionFilter, not UUIDs. The filter uses ranges (offset, line, column) and tags. It would be great if we adhered more closely to the capabilities offered by Truffle when eliminating the IdMap usage.

Let's assume the following is implemented: Investigate hosting yjs server within GraalJS #7954

Then the AST is hosted in the same process as the engine and its instrumentation. Assuming y.js is efficient in synchronizing AST modifications, we don't need UUIDs to keep element identities. The AST elements themselves represent identity. Visualizations shall become part of the AST - attaching/removing them modifies some attribute of clearly identified AST element. When there is an AST change that modifies the source code we need to traverse the AST and recompute ranges (offset, line, column) of elements that require instrumentation. A change in the source code always comes with new ranges. All of that is done inside of a single process without any need for network communication via a special protocol.

Opportunities with visualizations: Y.js system offers concept of subdocuments - lazily loaded entities used for on demand synchronization. We could use them for distributing visualizations across the peers. IDEs would annotate an AST element as eligible for obtaining visualization. Engine would compute the visualization and attach it as a subdocument. IDEs interested in that visualization would load it and observe its changes.

Longer term future vision: with the AST being in the same process as Truffle we should make a connection between the AST elements and Truffle nodes and avoid reparsing the whole .enso file when only a subtree is changed.

JaroslavTulach · 2024-03-28T14:00:08Z

I have rephrased the strategic vision in a separate document. An essential part of it still remains the sharing of ASTs among peers via y.js server.

4e6 · 2024-06-11T11:18:19Z

#10182 will add an optional IdMap parameter in the text/applyEdit request.

I propose to simplify the serialized IdMap elements format from the current

[[{"index":{"value":0},"size":{"value":4}},"bfae92f7-07d0-4886-b37b-a26e17ee943f"],...]

to an array of 3 elements: index, size and uuid

[[0,4,"bfae92f7-07d0-4886-b37b-a26e17ee943f"],...]

farmaazon · 2024-06-11T11:24:11Z

#10182 will add an optional IdMap parameter in the text/applyEdit request.

I propose to simplify the serialized IdMap elements format from the current
[[{"index":{"value":0},"size":{"value":4}},"bfae92f7-07d0-4886-b37b-a26e17ee943f"],...]
to an array of 3 elements: index, size and uuid
[[0,4,"bfae92f7-07d0-4886-b37b-a26e17ee943f"],...]

I'm ok, but perhaps we should support reading both formats for some time.

JaroslavTulach · 2024-06-11T13:24:08Z

I propose to simplify the serialized IdMap elements format from the current
to an array of 3 elements: index, size and uuid
[[0,4,"bfae92f7-07d0-4886-b37b-a26e17ee943f"],...]

the format change has been tracked as Compress enormous metadata section by 50% while making it friendlier #7989

I'm ok, but perhaps we should support reading both formats for some time.

Yes, being able to read previous node positions in new IDE is essential.

4e6 · 2024-06-11T15:30:07Z

I'm not going to change the parser. I was talking about the JSON-RPC text/applyEdit parameter

4e6 · 2024-06-17T09:24:32Z

#10283 adds an optional idMap parameter to text/applyEdits that overrides/complements the IdMap in the file

interface TextApplyEditParameters {
  /** The file edit. */
  edit: FileEdit;

  /**
   * A flag indicating whether we should re-execute the program after applying
   * the edit. Default value is `true`, indicating the program should be
   * re-executed.
   */
  execute?: boolean;

  /**
   * An identifiers map associated with this file as an array of
   * index, length, uuid triples. The old id map format that was used in the
   * source file is also supported.
   */
  idMap?: [number, number, UUID][];
}

@farmaazon the next step could be to keep in the file only the IDs that are used in the second metadata line (with node positions, etc.) and send the rest of the IDs with the text/applyEdit request. This should reduce the size of the IdMap section in the file metadata. If you're ok with the idea, I'll implement it as a followup PR.

farmaazon · 2024-06-17T09:58:03Z

@farmaazon the next step could be to keep in the file only the IDs that are used in the second metadata line (with node positions, etc.) and send the rest of the IDs with the text/applyEdit request. This should reduce the size of the IdMap section in the file metadata. If you're ok with the idea, I'll implement it as a followup PR.

Looks good, I'm ready to help with ydoc-server code if needed.

enso-bot · 2024-06-24T17:06:36Z

Dmitry Bushev reports a new STANDUP for today (2024-06-24):

Progress: Started working on storing the reduced IdMap in the file. Implemented the IdMap diff. Implemented sending IdMap in the applyEdit request. It should be finished by 2024-06-29.

Next Day: Next day I will be working on the #9257 task. Continue working on the task

enso-bot · 2024-06-25T16:40:13Z

Dmitry Bushev reports a new STANDUP for today (2024-06-25):

Progress: Playing with different usage cases. Debugged and fixed the issue with the metadata recovery. Fixed the issue with the broken IdMap between the project restarts. Undrafted the PR It should be finished by 2024-06-29.

Next Day: Next day I will be working on the #9257 task. Continue working on the task

farmaazon added d-hard Difficulty: significant prior knowledge required p-medium Should be completed in the next few sprints x-design -language-server -gui labels Mar 4, 2024

github-project-automation bot added this to Issues Board Mar 4, 2024

github-project-automation bot moved this to ❓New in Issues Board Mar 4, 2024

farmaazon mentioned this issue Mar 4, 2024

Attached workflow breaks in GUI2 #9198

Closed

enso-bot bot mentioned this issue Mar 6, 2024

Prefix autoscoped constructors with .. #9275

Closed

This was referenced Mar 12, 2024

Compress enormous metadata section by 50% while making it friendlier #7989

Closed

Language Server 2.0 #5419

Open

Investigate hosting yjs server within GraalJS #7954

Closed

JaroslavTulach mentioned this issue Apr 30, 2024

Discrepancy in parser metadata handling between LS and GUI #6718

Closed

JaroslavTulach added x-on-hold and removed x-on-hold labels May 14, 2024

JaroslavTulach assigned 4e6 May 14, 2024

JaroslavTulach moved this from ❓New to 📤 Backlog in Issues Board May 14, 2024

4e6 mentioned this issue Jun 5, 2024

Remove expression UUIDs from metadata section of a source file #10182

Closed

hubertp moved this from 📤 Backlog to 🔧 Implementation in Issues Board Jun 11, 2024

4e6 moved this from 🔧 Implementation to 👁️ Code review in Issues Board Jun 14, 2024

4e6 moved this from 👁️ Code review to ⚙️ Design in Issues Board Jun 15, 2024

4e6 mentioned this issue Jun 23, 2024

Persist a subset of IdMap #10347

Merged

3 tasks

4e6 moved this from ⚙️ Design to 🔧 Implementation in Issues Board Jun 24, 2024

4e6 moved this from 🔧 Implementation to 👁️ Code review in Issues Board Jun 26, 2024

mergify bot closed this as completed in #10347 Jul 8, 2024

mergify bot closed this as completed in b2c4559 Jul 8, 2024

github-project-automation bot moved this from 👁️ Code review to 🟢 Accepted in Issues Board Jul 8, 2024

JaroslavTulach mentioned this issue Jul 9, 2024

Initializing a full Language Server JSON protocol is slow #10452

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove IdMap from source file. #9257

Remove IdMap from source file. #9257

farmaazon commented Mar 4, 2024 •

edited by 4e6

Loading

Gui Tasks

Engine Tasks

JaroslavTulach commented Mar 5, 2024 •

edited

Loading

JaroslavTulach commented Mar 28, 2024

4e6 commented Jun 11, 2024

farmaazon commented Jun 11, 2024 •

edited

Loading

JaroslavTulach commented Jun 11, 2024

4e6 commented Jun 11, 2024

4e6 commented Jun 17, 2024

farmaazon commented Jun 17, 2024

enso-bot bot commented Jun 24, 2024

enso-bot bot commented Jun 25, 2024

Remove IdMap from source file. #9257

Remove IdMap from source file. #9257

Comments

farmaazon commented Mar 4, 2024 • edited by 4e6 Loading

Main idea

Further ideas and optimizations

Gui Tasks

Engine Tasks

JaroslavTulach commented Mar 5, 2024 • edited Loading

JaroslavTulach commented Mar 28, 2024

4e6 commented Jun 11, 2024

farmaazon commented Jun 11, 2024 • edited Loading

JaroslavTulach commented Jun 11, 2024

4e6 commented Jun 11, 2024

4e6 commented Jun 17, 2024

farmaazon commented Jun 17, 2024

enso-bot bot commented Jun 24, 2024

enso-bot bot commented Jun 25, 2024

farmaazon commented Mar 4, 2024 •

edited by 4e6

Loading

JaroslavTulach commented Mar 5, 2024 •

edited

Loading

farmaazon commented Jun 11, 2024 •

edited

Loading