You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently jaeger has problems with traces which exceed few hundred spans.
I noticed following:
searching for traces fetches very large payloads - basically /api/traces returns all matching traces along with spans, tags and logs, but the UI shows only a summary like number of spans per service, duration, etc. The search response should contain only the details required to render the screen. Currently the UI becomes very sluggish when the search result contains a large trace or does not render the results at all.
showing trace view for a very large trace has similar performance issues - the response payload is very large, the UI is sluggish or does not render.
In this case it would make sense to only fetch some part of the trace duration - e.g. first 200 spans. Or top 200 spans when counting distance from the root span. Some way to expand additional spans could allow to gradually navigate the trace. Or maybe some span filters would do the job.
another idea that comes to mind would involve dedicating some bits in the span id for "distance from root" to make it possible to search for the top spans efficiently
one more way to improve it would be to fetch spans without tags/log and fetch those only when "tags" or "logs" section on a span is "expanded"
paylods for large traces are pretty large (10MB, 100MB and more) and the browser has problems with processing them
I was able to view traces that had more than 1k spans (it was laggy and took some time), but not ~5k.
The traces I encountered were usually created by long iterative processes - e.g. recalculating something on thousands of records.
Another more pathological reason is bad communication design - an example would be a process which emits thousands of messages instead of using some batching.
I've tried on 1.0. It's much better - the search results appeared even though there was a large trace there - it showed it as having 10k spans (probably truncated?) with around 9MB (uncompressed) json. I also managed to open the trace view. It is slow and hangs sometimes for a fraction of a second, but it can be used to see something if needed.
Previously it wasn't working at all and was causing the browser's tab to hang. So - yes, it's much better now.
Currently jaeger has problems with traces which exceed few hundred spans.
I noticed following:
I was able to view traces that had more than 1k spans (it was laggy and took some time), but not ~5k.
The traces I encountered were usually created by long iterative processes - e.g. recalculating something on thousands of records.
Another more pathological reason is bad communication design - an example would be a process which emits thousands of messages instead of using some batching.
Related post on jaeger-tracing group: here
The text was updated successfully, but these errors were encountered: