Create IDE plugins for major editors #2821

astrojuanlu · 2023-07-18T16:12:12Z

We have evidence that users struggle when assigning catalog entries to node functions. For example, #2726:

when dataset names are strings, IDEs can't check that they actually match. I ran into this problem before when some code changes lead to the wrong dataset being used. When dataset names are variable names, this kind of issues will be caught sooner.

This is something that an IDE extension could help with. There is already a kedro-lsp extension (see #712 (comment)), but doesn't appear to be maintained anymore.

Such extension could help with other things, to be defined.

Evidence markers

Helps with retention of existing users; most users of this are IDE users and they're already Kedro users
Might help with visibility if it's featured on the marketplace (not sure)

astrojuanlu · 2023-08-01T22:36:17Z

PyCharm now supports LSP https://blog.jetbrains.com/platform/2023/07/lsp-for-plugin-developers/ via @datajoely

datajoely · 2023-08-15T18:31:22Z

@astrojuanlu excited to see this move ever so slightly forward.

The number one feature for me is the link between our "magic string" input/output/parameter references in node definitions and their YAML counterpart in the catalog. IDE users are used to ⌘ Command + Click symbols and jumping to their definition.

The pre-requisite for building this is to store the YAML line no/cursor position of the catalog entry at load-time.
I've held off building this myself (for a couple of years at least) since we the config loader was in flux and only now feels like it's reaching a stable future design.
- I think some of this metadate we'd need is hidden by the OmeagaConf.load method
- This also gets complicated when resolving the 'winning' key after environment resolution, potentially even more so once we take factories into account.
- If you look at the implementation for @limdauto's kedro-lsp the majority of the work is spent building a YAML scanner.

If we were to store the file/line number reference in the live catalog object we could do exciting things.

astrojuanlu · 2023-09-09T12:59:44Z

I had another user complain about hard navigation/autocomplete/typo detection for dataset names. Creating an IDE plugin as stated in this issue is of course one of the possible solutions, maybe others could be explored.

noklam · 2023-09-09T13:58:32Z

Maybe worth explore the kedro-lsp in 0.19

…

On Sat, 9 Sept 2023, 13:59 Juan Luis Cano Rodríguez, < ***@***.***> wrote: I had another user complain about hard navigation/autocomplete/typo detection for dataset names. Creating an IDE plugin as stated in this issue is of course one of the possible solutions, maybe others could be explored. — Reply to this email directly, view it on GitHub <#2821 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AELAWLZVWMWY3WNUUN2K2J3XZRR4VANCNFSM6AAAAAA2OUGNHI> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

astrojuanlu · 2023-12-07T11:32:11Z

Nice work from the DVC folks on their VSCode extension

astrojuanlu · 2023-12-07T11:46:02Z

The pre-requisite for building this is to store the YAML line no/cursor position of the catalog entry at load-time.

For reference, ruamel.yaml provides the cursor information:

In [11]: from ruamel.yaml import YAML

In [12]: yaml = YAML()

In [13]: data = yaml.load("""
    ...:      # testing line and column based on SO
    ...:      # http://stackoverflow.com/questions/13319067/
    ...:      - key1: item 1
    ...:        key2: item 2
    ...:      - key3: another item 1
    ...:        key4: another item 2
    ...:         """)

In [14]: data
Out[14]: [{'key1': 'item 1', 'key2': 'item 2'}, {'key3': 'another item 1', 'key4': 'another item 2'}]

In [15]: type(data)
Out[15]: ruamel.yaml.comments.CommentedSeq

In [16]: data[0].lc
Out[16]: LineCol(3, 7)

In [17]: type(data[0])
Out[17]: ruamel.yaml.comments.CommentedMap

However:

I think some of this metadate we'd need is hidden by the OmeagaConf.load method

Indeed, OmegaConf.load I/O is coupled with PyYAML unless an already loaded object is directly provided:

https://github.com/omry/omegaconf/blob/fd730509ef10a074f97b1738c630720157ceeeab/omegaconf/omegaconf.py#L183-L193

I think this supports the idea of separating the loading from the resolving part as proposed in #2481

astrojuanlu · 2024-01-12T18:58:46Z

Similar case of trying to parse YAML and give good error messages:

The initial version of rattler-build used serde & serde_yaml to parse the recipe. That worked OK, but was limited because we could not get great error messages out of serde_yaml, since it doesn't report the locations (line & column) of the encountered issues.

https://prefix.dev/blog/rattler_build_a_new_parser

astrojuanlu · 2024-01-15T11:57:23Z

However, the primary downside for me was the requirement to set up configurations using YAML.I would prefer it to be closed within a Python script because editor completion.

https://www.reddit.com/r/kaggle/comments/18j4c0e/what_pipeline_libraries_do_you_recommend_for/?utm_source=share&utm_medium=web2x&context=3

astrojuanlu · 2024-02-06T12:19:55Z

A user asking about this https://linen-slack.kedro.org/t/16374428/hey-team-kedro-is-there-any-vscode-extension-through-which-w#4844d74c-e02a-48da-9a7f-db4fea33703a

inigohidalgo · 2024-02-06T12:52:58Z

Just saw this issue linked from the slack conversation.

This would solve my number one problem with kedro as a user. I raised this in a user interview some months back, and also in various conversations: the disconnect between string objects in python and the objects which those strings represent within kedro. Refactoring is a huge hassle whenever it involves changing the shape of pipelines as there always end up being orphan datasets.

So this issue has a huge upvote from me.

noklam · 2024-03-08T09:49:45Z

There are many technical complexity for this, i.e. dynamic generated config/catalog

dataset patterns
variable interpolation
resolver

Nonetheless, I believe it's a huge improvement and we don't necessary wait until we have a full solution. I spent quite a bit of time to look at the original kedro-lsp and finally make something that runs in 0.19.x. I'm pretty excited about this.

datajoely · 2024-03-08T10:00:49Z

The tool tip is awesome here! I guess dataset factories would work the same, I still think it would be nice to have an equivalent of dbt compile where you could jump to the resolved config.

noklam · 2024-03-08T13:38:41Z

IDE Plugins are very broad, I have seen a few things mentions here and recalled some discussion in the past:

parameters editor(?) - example: DVC plugin
Autocompletion, type hints, go-to definition ("click on YAML")
- Complexity: uncertainty about OmegaConf.load and PyYAML
TBD

IDE support:

VSCode (seems to be the easiest)
PyCharm suport LSP
Notebook - There are jupyter-lsp, is it possible to make some features work on notebook too?
terminal (niche but possible)

Backward compatibility: I am not sure yet how to make VSCode plugin that backward compatible, or maybe we shouldn't care this so soon because this can be a good reason to drive more 0.19 adoption.

User Research: (??) - there are many possibilities and unknown here

datajoely · 2024-03-08T13:56:28Z

Honestly I think VS Code is the right call for any initial MVP - you can just point to DVC and Databricks making the same call

astrojuanlu · 2024-03-08T14:38:04Z

Yeah the title of this issue is too broad. Let's start with an LSP for Kedro, basically bringing kedro-lsp back to life and working with Kedro 0.19 + docs for how to set it up on VSCode. That's already a massive usability boost for folks.

@noklam shall we open a separate issue for it?

noklam · 2024-03-08T16:41:06Z

#3691 - let's move the LSP specific discussion here

noklam · 2024-04-22T17:51:30Z

https://blog.jetbrains.com/platform/2023/07/lsp-for-plugin-developers/

I did some research today and found that in theory Pycharm support LSPs, but it's only limited to paid users which is disappointing.

datajoely · 2024-04-23T07:55:47Z

Disappointing but I think it's still a great step especially if we target an enterprise user persona

astrojuanlu · 2024-05-13T08:08:48Z

VSCode plugin exists already, see #3691

astrojuanlu · 2024-07-21T21:51:33Z

Considering this done for now, if we ever decide to target other editors we can model those after the existing VS Code extension.

astrojuanlu added the Issue: Feature Request New feature or improvement to existing feature label Jul 18, 2023

astrojuanlu added this to Kedro Framework Jul 18, 2023

github-actions bot mentioned this issue Aug 18, 2023

Monthly issue metrics report #2950

Closed

This was referenced Aug 31, 2023

Improve DataCatalog and ConfigLoader with autocompletion and meaningful representation when it get printed #1721

Closed

[DRAFT] - Refactor the OmegaConfigLoader load_and_merge_dir_config method #2591

Closed

astrojuanlu mentioned this issue Dec 7, 2023

[Proposal] - Replace ConfigLoader with ConfigLoader and ConfigResolver #2481

Closed

merelcht added this to the Create IDE plugins for major editors milestone Jan 12, 2024

noklam mentioned this issue Mar 8, 2024

Kedro Language Server for VSCode to make navigation easier #3691

Closed

astrojuanlu added the Type: Parent Issue label Apr 8, 2024

noklam mentioned this issue May 21, 2024

Add navigation support for OmegaConfig Syntax kedro-org/vscode-kedro#8

Open

astrojuanlu closed this as completed Jul 21, 2024

github-project-automation bot moved this to Done in Kedro Framework Jul 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create IDE plugins for major editors #2821

Create IDE plugins for major editors #2821

astrojuanlu commented Jul 18, 2023 •

edited by yetudada

Loading

astrojuanlu commented Aug 1, 2023

datajoely commented Aug 15, 2023

astrojuanlu commented Sep 9, 2023

noklam commented Sep 9, 2023 via email •

edited

Loading

astrojuanlu commented Dec 7, 2023

astrojuanlu commented Dec 7, 2023

astrojuanlu commented Jan 12, 2024

astrojuanlu commented Jan 15, 2024

astrojuanlu commented Feb 6, 2024

inigohidalgo commented Feb 6, 2024

noklam commented Mar 8, 2024 •

edited

Loading

datajoely commented Mar 8, 2024

noklam commented Mar 8, 2024

datajoely commented Mar 8, 2024

astrojuanlu commented Mar 8, 2024

noklam commented Mar 8, 2024

noklam commented Apr 22, 2024

datajoely commented Apr 23, 2024

astrojuanlu commented May 13, 2024

astrojuanlu commented Jul 21, 2024

Create IDE plugins for major editors #2821

Create IDE plugins for major editors #2821

Comments

astrojuanlu commented Jul 18, 2023 • edited by yetudada Loading

Evidence markers

astrojuanlu commented Aug 1, 2023

datajoely commented Aug 15, 2023

astrojuanlu commented Sep 9, 2023

noklam commented Sep 9, 2023 via email • edited Loading

astrojuanlu commented Dec 7, 2023

astrojuanlu commented Dec 7, 2023

astrojuanlu commented Jan 12, 2024

astrojuanlu commented Jan 15, 2024

astrojuanlu commented Feb 6, 2024

inigohidalgo commented Feb 6, 2024

noklam commented Mar 8, 2024 • edited Loading

datajoely commented Mar 8, 2024

noklam commented Mar 8, 2024

datajoely commented Mar 8, 2024

astrojuanlu commented Mar 8, 2024

noklam commented Mar 8, 2024

noklam commented Apr 22, 2024

datajoely commented Apr 23, 2024

astrojuanlu commented May 13, 2024

astrojuanlu commented Jul 21, 2024

astrojuanlu commented Jul 18, 2023 •

edited by yetudada

Loading

noklam commented Sep 9, 2023 via email •

edited

Loading

noklam commented Mar 8, 2024 •

edited

Loading