stack_data, executing, and pure_eval #748

alexmojaki · 2020-06-27T18:14:53Z

Hi! I've recently created a few libraries that are helpful for extracting information from frames and tracebacks, and I think they could be very useful to sentry.

executing can identify the exact AST node being executed by a frame in many cases. It's always disappointed me that Python only points to a line in each frame in tracebacks, especially when the expression/statement at fault spans many lines. Sometimes it can be really hard to interpret. Currently executing is used in IPython (in master, unreleased) to highlight the node, here's what it looks like:

executing can also infer a function __qualname__ from a code object, meaning your traceback can say lorem/ipsum.py in MyClass.__init__ at line 123 instead of just __init__, which is much more informative.
pure_eval can safely evaluate certain AST nodes without side effects, so that if the source for a frame contains expressions like self.foo and bar[key] their values can often be shown alongside plain variables.
stack_data collects data from tracebacks for the purpose of formatting and displaying them. It uses executing and pure_eval and also provides a lot of functionality of its own. For example, in this frame:

there are lines from complex_filter which the user doesn't need to see, and stack_data knows how to filter those out by only collecting lines which belong in the current scope. I integratedstack_data into IPython which allowed removing a lot of flaky custom introspection code, fixing several bugs and making the code more maintainable.

stack_data identifies the locations of variables and evaluated expressions in the source lines. An intelligent UI could use this, for example, to allow the user to hover over a variable in source code to see its value rather than scroll through a list to find it.

Overall there are lots of possibilities. What would interest you? I'm willing to make a PR and make any necessary changes to the libraries.

The text was updated successfully, but these errors were encountered:

untitaker · 2020-06-28T15:29:04Z

Hey @alexmojaki, this seems like a nice library!

We want to avoid adding more dependencies than absolutely necessary, which is why you'll find we have no external dependencies beyond urllib3 and certifi.
I have strong concerns about the runtime overhead of this library, not only for the happy path when nothing is crashing but also when reporting a 500 to Sentry. Particularly concerning your idea to transfer the values of sub-expressions to sentry.

I'm only going to comment on the ideas you presented.

there are lines from complex_filter which the user doesn't need to see, and stack_data knows how to filter those out by only collecting lines which belong in the current scope.

Please keep in mind that you have no guarantee of the filename containing valid sourcecode, for example in Jinja templates that isn't the case (it's also one of the reasons we have refrained from adding syntax highlighting to Sentry, the other reason being laziness).

An intelligent UI could use this, for example, to allow the user to hover over a variable in source code to see its value rather than scroll through a list to find it.

We would have to establish how to send this kind of information to the server in a way that makes sense for other languages than Python.

I believe an initial PoC, of whichever feature you're thinking of implementing, should pull those libraries in as optional dependency only. before_send seems like a very cheap way to hook into the SDK since its hint object will contain exc_info and you have access to the entire payload that is sent to sentry.

Hope that helps,
Markus

alexmojaki · 2020-06-28T20:47:55Z

OK, if these are going to be optional dependencies, I'm going to avoid stack_data, as that would just make the code more complicated.

I've made PRs for a couple of the features mentioned. There's some finicky details I haven't handled - for example the tests currently assume that the optional libraries are installed. But you can see the general idea well enough for a PoC as you said. Before I polish things up, can you confirm if the concept is proven?

The feature that would interest me the most is the first one I listed - highlighting the executing node. This would require changes on the server side in order to render the information. Shall I just ignore that complication for now and make another PoC PR which adds the data to the frame and we can worry about rendering later?

untitaker · 2020-07-09T15:43:39Z

Shall I just ignore that complication for now and make another PoC PR which adds the data to the frame and we can worry about rendering later?

Yes, preferrably behind an integration like mentioned in #749. The server can handle superfluous attributes, sort of.

See #748

untitaker · 2020-07-15T10:30:45Z

@alexmojaki did I get it right if I say that pure_eval is py3-only?

alexmojaki · 2020-07-15T10:32:44Z

Yes.

untitaker · 2020-07-15T10:33:49Z

Ok cool. Just so you know you're in charge of creating the sentry-docs PRs. There should be one per integration so one for executing and one for pure_eval, unless you plan to refactor those further.

alexmojaki · 2020-07-18T11:49:52Z

I'm going to make a docs PR for pure_eval. I don't know how I can add an integration suggestion for it when the integration has three dependencies: pure_eval, executing, and asttokens.

For executing, I don't think there's much point in people using it unless the feature for highlighting the node range goes through. I've made a server side PR which I don't think I can complete on my own.

Related: #762 and #748

untitaker · 2020-07-19T11:55:48Z

Makes sense. I think pure-eval has most value anyway. Still unsure what to do about the server-side changes. We would probably need to review the exact protocol changes with others in the company.

Related: getsentry/sentry-python#748 getsentry/sentry-python#762 getsentry/sentry-python#763

github-actions · 2021-12-23T15:13:31Z

This issue has gone three weeks without activity. In another week, I will close it.

But! If you comment or otherwise update it, I will reset the clock, and if you label it Status: Backlog or Status: In Progress, I will leave it alone ... forever!

"A weed is but an unloved flower." ― Ella Wheeler Wilcox 🥀

This was referenced Jun 28, 2020

Use executing to infer code qualname #749

Merged

Extract additional expression values with pure_eval #750

Closed

untitaker added the Enhancement New feature or request label Jul 13, 2020

untitaker pushed a commit that referenced this issue Jul 13, 2020

Use executing to infer code qualname (#749)

5c34ead

See #748

This was referenced Jul 14, 2020

Add range of executing node to frames #761

Closed

Extract additional expression values with pure_eval #762

Merged

This was referenced Jul 18, 2020

Add setup.py extra for pure_eval #763

Merged

Add page for pure_eval integration getsentry/sentry-docs#1907

Merged

untitaker pushed a commit that referenced this issue Jul 19, 2020

Add setup.py extra for pure_eval (#763)

b117955

Related: #762 and #748

untitaker pushed a commit to getsentry/sentry-docs that referenced this issue Jul 22, 2020

Add page for pure_eval integration (#1907)

6067f53

Related: getsentry/sentry-python#748 getsentry/sentry-python#762 getsentry/sentry-python#763

vmarkovtsev mentioned this issue Aug 31, 2020

pure_eval misses the variable on the crash line #805

Closed

alexmojaki mentioned this issue Sep 1, 2020

stack_data, executing, and pure_eval rollbar/pyrollbar#350

Closed

alexmojaki mentioned this issue Jan 1, 2021

stack_data, executing, and pure_eval bugsnag/bugsnag-python#252

Open

alexmojaki mentioned this issue Jan 27, 2021

Related library: pure_eval lmfit/asteval#83

Closed

github-actions bot added the Status: Stale label Dec 23, 2021

github-actions bot closed this as completed Dec 30, 2021

sl0thentr0py reopened this Jan 21, 2022

sl0thentr0py added Status: Backlog and removed Status: Stale labels Jan 21, 2022

sl0thentr0py closed this as completed Jan 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stack_data, executing, and pure_eval #748

stack_data, executing, and pure_eval #748

alexmojaki commented Jun 27, 2020

untitaker commented Jun 28, 2020

alexmojaki commented Jun 28, 2020

untitaker commented Jul 9, 2020

untitaker commented Jul 15, 2020

alexmojaki commented Jul 15, 2020

untitaker commented Jul 15, 2020

alexmojaki commented Jul 18, 2020

untitaker commented Jul 19, 2020

github-actions bot commented Dec 23, 2021

stack_data, executing, and pure_eval #748

stack_data, executing, and pure_eval #748

Comments

alexmojaki commented Jun 27, 2020

untitaker commented Jun 28, 2020

alexmojaki commented Jun 28, 2020

untitaker commented Jul 9, 2020

untitaker commented Jul 15, 2020

alexmojaki commented Jul 15, 2020

untitaker commented Jul 15, 2020

alexmojaki commented Jul 18, 2020

untitaker commented Jul 19, 2020

github-actions bot commented Dec 23, 2021