tracing: Provide a GUI/Console-Based Visualization of Spans and Events #884

davidbarsky · 2020-08-04T17:50:12Z

Feature Request

Motivation

A decently common feature request that comes up in conversations is for tracing to provide some sort of visualization of spans and events in some sort of GUI. This would help tracing users better understand and debug their applications.

Proposal

At a high level, I can think of a few options:

Revive https://github.com/tokio-rs/console and build out a robust, terminal-based UI.
Build something similar to https://github.com/jonathanj/eliottree. In some sense, davidbarsky/tracing-tree is halfway there, but it'd need to operate over something over some intermediate representation of spans.
Direct users to something like Jaeger in a container with https://docs.rs/tracing-opentelemetry/0.5.0/tracing_opentelemetry/, but that might be a bit heavyweight.
Use pprof. This might be kinda easy!

Alternatives

Don't do this.

The text was updated successfully, but these errors were encountered:

hawkw · 2020-08-04T18:22:41Z

I'm strongly in favour of more options for visualization. I think that broadly, we have two categories of options:

Write our own thing. This would take a bunch of work, but it has the advantage that a new tool could be designed specifically for the types of data that tracing outputs.
Implement integrations with existing tools. This is probably much less work, and has the advantage that we don't have to implement any of the UI parts. That, in particular, might be good, since I personally have little to no experience with any kind of UI programming, whether it's GUI or console-based. However, the disadvantage of this is that no other system's data model will line up exactly with tracing's. Potentially, some fidelity is lost when the other system can't represent something that tracing emits, or when tracing doesn't record something the other system expects. Also, there are sometimes semantic differences: if we implement an integration with a system that's based on callstack sampling, it would make sense to represent spans to the other system as "stack frames". However, a tracing span is not exactly a stack frame — it might span multiple stack frames, be entered multiple times in different callstacks, et cetera, and represents a higher-level notion of context in the actual application logic, rather than being a detail of what actual functions are called.

I think we definitely want to provide integrations with as many other existing tools as possible. There are already several — tracing-coz, tracing-tracy, and tracing-flame, as well as the OpenTelemetry integration all come to mind, as well as tracing-wasm's support for browser perf analysis tools. However, one thing about these other tools is that they tend to be tuned for a particular use case. For example, tracy is intended for gamedev, and it has a first-class concept of "frames" (as in video, not as in stack frames), so it may not be suitable for debugging a microservice. In contrast, OpenTelemetry is a distributed tracing tool that makes request-response RPC models first class...which you probably don't want if you want to debug a game. And obviously, the browser performance profiling tools only make sense when you're running in the browser.

So, while integrations with other tools are very valuable, they don't really give us a solution that we can suggest to anyone who wants to be able to visualize trace data. For example, OpenTelemetry is a great option if you are implementing a microservice in a distributed application that already has all the infrastructure to use this data set up — if you're already running a Jaeger collector or something — but it's not something it would make sense to suggest to the rustc maintainers. This suggests that we might want to think about doing our own thing that's specific to tracing, in addition to supporting people who are implementing integrations.

I think that perhaps the best use-case to target with such a tool is interactive debugging. There are already several integrations that are more intended for performance profling use cases, like tracing-flame; these tend to generate a static representation that records a single run of the software (e.g. a flamegraph SVG). We might want to think about a tool for interactive, on-line debugging of a running system, and/or for interactively exploring a saved capture from a running system. In particular, I might want to prioritize a syntax for interactively filtering/querying the data, and controlling how it is formatted.

In re: your specific suggestions:

Use pprof. This might be kinda easy!

I think pprof is quite nice, and we should definitely have a layer for outputting the pprof format. However, it's very strongly geared towards profiling in particular, and its data model seems to emphasize performance data. I'm not sure if pprof output would be the ideal solution for an interactive debugging tool.

najamelan · 2020-08-10T19:12:51Z

I have started working on a simple web gui that can separate out the log from different async components side by side. This is what I have so far. It's far from finished, needs a lot of polish and features.

What you see is a log file from a program which has 3 components. A server and a client that send messages back and forth through a relay. It uses the tracing instrument method to give instrumented executors to each of these tasks so their log statements are annotated.

The first image shows the filter boxes with the names of the instrumented executors filled in. The second image shows a scroll down, where you can see how the program flow goes through the three components.

Some next steps I am planning:

use json input, which will allow putting the time stamps on the left, avoiding repeating them in each column as well as selecting log levels to show.
styling
make columns collapsable
have a column on the left that shows statements that are hidden on all other columns
get input from a cli app that you can just pipe into rather than loading a log file into the browser
...

najamelan · 2020-09-02T13:48:37Z

This is still a work in progress, but I'm traveling. Next week I will polish some more. What's missing right now:

automatic integration that doesn't require opening the log file with a browse button in the navigator. Maybe nicer with a native gui like gtk? For a browser frontend, after looking into it I found it wasn't worth the hassle as it requires running a separate server in the background or integrating a server in the process that does the logging.

It does have collapsible columns, detects log levels and colors them as well as letting you filter on them.

So I quickly published it on github pages so others can have a look at it: https://najamelan.github.io/tracing_prism/
You can find the code for now on my github profile.

It should be great if you have a wide screen, but I'm on a laptop right now so i haven't tested that yet myself. Give it a try.
If you want to run it offline, you can download the repository and checkout the gh-pages branch. Be sure to configure firefox to allow loading scripts on a "file://" link.

If you want to compile it yourself, change the dependencies of the thespis crates to git links pointing to the dev branches.

denismaxim0v · 2021-02-21T09:01:40Z

Should this still be open? I'd love to help working on this, seems like an interesting issue

najamelan · 2021-02-21T14:47:12Z

This reminded me that I forgot to update the post above. tracing_prism now does support JSON. I'm pretty happy with it personally. I looked into a more streamlined workflow than loading the log in the page with the browse button, but I felt that in the end it wasn't worth it, so I have put that on halt unless there is demand.

Of course people might prefer other ways of visualizing the logs, so I suppose @hawkw will get back to you explaining why she added the "help wanted" tag...

patrickelectric · 2023-11-17T20:07:49Z

It would be nice to have support to one of the formats that profiler.firefox support, like: https://docs.google.com/document/d/1CvAClvFfyA5R-PhYUmn5OOQtYMH4h6I0nSsKchNAySU/preview#heading=h.yr4qxyxotyw

davidbarsky added kind/feature New feature or request needs/design Additional design discussion is required. labels Aug 4, 2020

davidbarsky mentioned this issue Aug 4, 2020

Use tracing spans to trace the entire MIR interp stack rust-lang/rust#75143

Merged

hawkw changed the title ~~tracing: Provide of GUI/Console-Based Visualization of Spans and Events~~ tracing: Provide a GUI/Console-Based Visualization of Spans and Events Aug 4, 2020

hawkw pinned this issue Oct 27, 2020

hawkw added the help wanted Extra attention is needed label Oct 27, 2020

eddyb mentioned this issue Oct 27, 2020

Support a TUI "live view" version of -Z self-profile. rust-lang/rust#53630

Open

BohuTANG mentioned this issue Jun 10, 2021

[observability] http tracing databendlabs/databend#805

Closed

hawkw unpinned this issue Sep 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tracing: Provide a GUI/Console-Based Visualization of Spans and Events #884

tracing: Provide a GUI/Console-Based Visualization of Spans and Events #884

davidbarsky commented Aug 4, 2020

hawkw commented Aug 4, 2020 •

edited

Loading

najamelan commented Aug 10, 2020 •

edited

Loading

najamelan commented Sep 2, 2020 •

edited

Loading

denismaxim0v commented Feb 21, 2021

najamelan commented Feb 21, 2021

patrickelectric commented Nov 17, 2023

tracing: Provide a GUI/Console-Based Visualization of Spans and Events #884

tracing: Provide a GUI/Console-Based Visualization of Spans and Events #884

Comments

davidbarsky commented Aug 4, 2020

Feature Request

Motivation

Proposal

Alternatives

hawkw commented Aug 4, 2020 • edited Loading

najamelan commented Aug 10, 2020 • edited Loading

najamelan commented Sep 2, 2020 • edited Loading

denismaxim0v commented Feb 21, 2021

najamelan commented Feb 21, 2021

patrickelectric commented Nov 17, 2023

hawkw commented Aug 4, 2020 •

edited

Loading

najamelan commented Aug 10, 2020 •

edited

Loading

najamelan commented Sep 2, 2020 •

edited

Loading