Add ability to handle magic commands in Jupyter notebook #5030

dhruvmanila · 2023-06-12T16:17:25Z

Currently, if any cell contains any magic commands it'll be ignored even if the cell contains valid Python code. This might lead to false positive as some code is intentionally ignored for linting (undefined symbol).

At a high level overview what black does is the following:

Using IPython's transformer, convert the magic commands into the respective function calls (% ... will be converted to get_ipython().run_cell_magic)
Replace cell magics with tokens (string values which is a valid Python code) and keep track of such replacements
Replace line magics in the same way as mentioned in (2)
Returned the transformed code along with the replacements
At the end, use the replacements to get back the original content

Reference implementation in black: https://github.com/psf/black/blob/main/src/black/handle_ipynb_magics.py

The text was updated successfully, but these errors were encountered:

dhruvmanila · 2023-06-14T16:33:17Z

Jupytext has a simpler implementation although lots of regex: https://github.com/mwouts/jupytext/blob/main/jupytext/magics.py

henryiii · 2023-07-18T03:30:19Z

There might be an easier but dumber way to do with without invoking IPython, which I would assume would be slow. You could add a configurable list of "ignored" magics. The default could contain the built-in magics that don't have much affect: %%time, %%timeit, etc. But a user could override the list via configuration. Detecting them I'd assume is already mostly done, since it can ignore them?

This wouldn't cover magics that modify stuff or change the language, but would cover a lot of common cases & is the situation that is easiest to report violations and autofix.

dhruvmanila · 2023-07-20T04:08:09Z

Hey, thanks for the suggestions. Yes, invoking IPython would be slow and we're not going that route. Instead, we'll use a new parser mode in which the parser will parse the magic commands at possible locations. New tokens and AST nodes are introduced which will be available only in that mode. Although the node isn't finalized, you can refer to astral-sh/RustPython-Parser#31.

dhruvmanila · 2023-07-26T02:11:49Z

The solution which we've implemented is to integrate it in the parser directly. The following changes were made to the parser to accommodate the requirement:

A new Jupyter mode was added to allow lexing/parsing the magic commands only for Jupyter Notebooks and give an error otherwise.
The lexer will recognize the magic commands and emit the MagicCommand token wherever it's allowed¹ i.e., as standalone statement (%matplotlib inline) or in an assignment statement (dir = !pwd).
- Lex Jupyter line magic with Mode::Jupyter RustPython-Parser#23
- Lex Jupyter Magic in assignment value position RustPython-Parser#30
Two new nodes StmtLineMagic and ExprLineMagic are added for the parser.
- Add line magic stmt and expr AST nodes RustPython-Parser#31

References

All possible magic command related documentation is present on this page: https://ipython.readthedocs.io/en/stable/interactive/reference.html#interactive-use
Original implementation in IPython codebase: https://github.com/ipython/ipython/blob/main/IPython/core/inputtransformer2.py
sonar-python implementation: https://github.com/SonarSource/sonar-python/blob/master/python-frontend/src/main/java/org/sonar/python/api/IPythonGrammarBuilder.java

The allowed magic token positions are determined directly from the IPython implementation of the same. ↩

## Summary This PR adds support for a stricter version of help end escape commands[^1] in the parser. By stricter, I mean that the escape tokens are only at the end of the command and there are no tokens at the start. This makes it difficult to implement it in the lexer without having to do a lot of look aheads or keeping track of previous tokens. Now, as we're adding this in the parser, the lexer needs to recognize and emit a new token for `?`. So, `Question` token is added which will be recognized only in `Jupyter` mode. The conditions applied are the same as the ones in the original implementation in IPython codebase (which is a regex): * There can only be either 1 or 2 question mark(s) at the end * The node before the question mark can be a `Name`, `Attribute`, `Subscript` (only with integer constants in slice position), or any combination of the 3 nodes. ## Test Plan Added test cases for various combination of the possible nodes in the command value position and update the snapshots. fixes: #6359 fixes: #5030 (This is the final piece) [^1]: #6272 (comment)

## Summary This PR adds support for a stricter version of help end escape commands[^1] in the parser. By stricter, I mean that the escape tokens are only at the end of the command and there are no tokens at the start. This makes it difficult to implement it in the lexer without having to do a lot of look aheads or keeping track of previous tokens. Now, as we're adding this in the parser, the lexer needs to recognize and emit a new token for `?`. So, `Question` token is added which will be recognized only in `Jupyter` mode. The conditions applied are the same as the ones in the original implementation in IPython codebase (which is a regex): * There can only be either 1 or 2 question mark(s) at the end * The node before the question mark can be a `Name`, `Attribute`, `Subscript` (only with integer constants in slice position), or any combination of the 3 nodes. ## Test Plan Added test cases for various combination of the possible nodes in the command value position and update the snapshots. fixes: astral-sh#6359 fixes: astral-sh#5030 (This is the final piece) [^1]: astral-sh#6272 (comment)

dhruvmanila added the core Related to core functionality label Jun 12, 2023

dhruvmanila mentioned this issue Jun 19, 2023

Complete Jupyter notebook integration #5188

Closed

26 tasks

dhruvmanila self-assigned this Jun 26, 2023

dhruvmanila mentioned this issue Jun 27, 2023

Feature request: support Cython and Jupyter magic? #1079

Closed

dhruvmanila mentioned this issue Jul 6, 2023

Use Jupyter mode while parsing Notebook files #5552

Merged

This was referenced Aug 2, 2023

Support help end escape command with priority #6272

Merged

Add support for help end IPython escape commands #6358

Merged

dhruvmanila closed this as completed in #6358 Aug 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to handle magic commands in Jupyter notebook #5030

Add ability to handle magic commands in Jupyter notebook #5030

dhruvmanila commented Jun 12, 2023

dhruvmanila commented Jun 14, 2023 •

edited

Loading

henryiii commented Jul 18, 2023

dhruvmanila commented Jul 20, 2023

dhruvmanila commented Jul 26, 2023

Add ability to handle magic commands in Jupyter notebook #5030

Add ability to handle magic commands in Jupyter notebook #5030

Comments

dhruvmanila commented Jun 12, 2023

dhruvmanila commented Jun 14, 2023 • edited Loading

henryiii commented Jul 18, 2023

dhruvmanila commented Jul 20, 2023

dhruvmanila commented Jul 26, 2023

References

Footnotes

dhruvmanila commented Jun 14, 2023 •

edited

Loading