Explore the use of StackWalker in REST Client #42508

geoand · 2024-08-13T11:56:53Z

Description

Currently in the REST Client when quarkus.rest-client.capture-stacktrace is set to true (which is the default), we capture the entire stacktrace of the Thread in order to enhance the debugging experience when something goes wrong.

@johnaohara has found however that in some cases up to 20% of the users application code callstack in capturing a stack trace, which is a waste of cpu cycles, but also forces the JVM to safepoint the mutating threads during every call to the api

Implementation ideas

No response

The text was updated successfully, but these errors were encountered:

quarkus-bot · 2024-08-13T11:56:56Z

/cc @cescoffier (rest-client)

johnaohara · 2024-08-13T12:01:25Z

Flamegraph of a sample application that where the value of quarkus.rest-client.capture-stacktrace is the default true

geoand · 2024-08-13T12:12:13Z

Gotcha, thanks.

geoand · 2024-08-13T14:53:58Z

We can also look into caching the part of the stacktrace we care about

This is done by utilizing the StackWalker API and limiting the number of frames captured Closes: quarkusio#42508

geoand · 2024-08-26T07:31:15Z

@johnaohara would you be able to test #42544?

The reason I ask is that I wasn't able to reproduce the initial results so I don't have a baseline

johnaohara · 2024-08-26T07:35:31Z

Hi @geoand I did not have chance last week. I public holiday today, I can verify the patch tomorrow

geoand · 2024-08-26T07:38:28Z

Thanks!

geoand · 2024-09-03T11:19:20Z

@johnaohara when you get a chance to look at #42544, I'd be happy to hear about the results :)

johnaohara · 2024-09-03T11:26:45Z

@geoand checking now

geoand · 2024-09-03T11:29:44Z

🙏🏽

johnaohara · 2024-09-03T16:08:23Z

@geoand quick update on the testing so far. I tried with your PR branch, but saw some extra call stacks that I was not expecting, so backported to 3.12.2 (this was the version that produced the above baseline), and I am still seeing the unexpected stack, see;

I am looking at the test now to see why these stacks are showing in the flamegraph

gsmet · 2024-09-03T16:11:59Z

@johnaohara you talking about the ones from ExtLogRecord?

If so, I reported it here: #42858 and it should be fixed in current main with the update of SmallRye Common to 2.6.0. The PR might need a rebase.

johnaohara · 2024-09-03T16:14:38Z

@gsmet yeah, that is what I was talking about. But I cherry-picked onto 3.12.2, so expected it to disappear.

I just realized what I have done wrong, I will try running the test again.

gsmet · 2024-09-03T16:48:11Z

This particular issue is in all 3.13.x and in 3.14.1. It should be fixed in 3.14.2.

johnaohara · 2024-09-04T08:18:42Z

@geoand sorry about the noise yesterday.
testing with #42544

The cpu time for processing the stack frames (in my particular test) has dropped approx 10% with the new implementation (691 cpu samples -> 623 cpu samples)

Application code stack went from 21.2% of cpu time processing stack to 19.2% with the change. Although it is improved, there is still a considerable overhead on each invocation

Before:

After:

geoand · 2024-09-04T09:19:51Z

Thanks a lot @johnaohara!

Application code stack went from 21.2% of cpu time processing stack to 19.2% with the change. Although it is improved, there is still a considerable overhead on each invocation

Indeed....

I wonder if we should change the default to not capture the stack... @cescoffier @gsmet WDYT?

johnaohara · 2024-09-04T09:26:09Z

obvious setting quarkus.rest-client.capture-stacktrace=false removes all the calls

idk if the stack traces were intended to be used in a prod deployment or just for development

geoand · 2024-09-04T09:31:20Z

They were intended for both

gsmet · 2024-09-04T09:34:33Z

How confusing is the stack trace without it?

I wonder if we should have the following behavior:

Enabled in dev and test mode / disabled in prod mode by default
Make sure you have at least the name of the REST Client around even in prod mode - can we at least push this info?

Not sure if it's feasible but that might help?

geoand · 2024-09-04T09:38:40Z

How confusing is the stack trace without it?

It's almost meaningless...

Enabled in dev and test mode / disabled in prod mode by default

Right, I thought of that one as well, but I am still not convinced it's a good idea.

Make sure you have at least the name of the REST Client around even in prod mode - can we at least push this info?

We can do that yeah. Actually we have the class and the method - so although there would be no real stacktrace, at least you would know which method is at fault...

gsmet · 2024-09-04T09:49:42Z

Yeah better than nothing and better than slowing down the whole app.

Now, that's not the only place where our stack traces are border line useless unfortunately :/

geoand · 2024-09-04T09:50:16Z

True, so if @cescoffier is also on board, I can make the change

geoand · 2024-09-04T10:42:54Z

Actually, we already print the method that causes the failure, so nothing needs to be done on that front

cescoffier · 2024-09-04T11:17:20Z

What's the gist of the work that need to be done?

geoand · 2024-09-04T11:22:42Z

We would only change the default or the property that controls whether or not we capture the real stack.

cescoffier · 2024-09-04T17:36:56Z

Ah ok, makes sense.

geoand · 2024-09-04T17:53:40Z

I'll do it tomorrow

geoand · 2024-09-05T07:01:39Z

#43037 changes the default

geoand · 2024-09-05T07:16:34Z

I'm going to close this as won't fix as changing the default pretty much makes this obsolete.

geoand added kind/enhancement New feature or request area/rest-client labels Aug 13, 2024

geoand added a commit to geoand/quarkus that referenced this issue Aug 14, 2024

Limit overhead of capturing stacktrace in REST Client

4a8b9d3

This is done by utilizing the StackWalker API and limiting the number of frames captured Closes: quarkusio#42508

geoand mentioned this issue Aug 14, 2024

Limit overhead of capturing stacktrace in REST Client #42544

Closed

geoand added a commit to geoand/quarkus that referenced this issue Aug 26, 2024

Limit overhead of capturing stacktrace in REST Client

b999a90

This is done by utilizing the StackWalker API and limiting the number of frames captured Closes: quarkusio#42508

geoand mentioned this issue Sep 5, 2024

Don't capture stacktraces for REST Client by default #43037

Merged

geoand closed this as not planned Won't fix, can't repro, duplicate, stale Sep 5, 2024

geoand added the triage/wontfix This will not be worked on label Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explore the use of StackWalker in REST Client #42508

Explore the use of StackWalker in REST Client #42508

geoand commented Aug 13, 2024

quarkus-bot bot commented Aug 13, 2024

johnaohara commented Aug 13, 2024

geoand commented Aug 13, 2024

geoand commented Aug 13, 2024

geoand commented Aug 26, 2024

johnaohara commented Aug 26, 2024

geoand commented Aug 26, 2024

geoand commented Sep 3, 2024

johnaohara commented Sep 3, 2024

geoand commented Sep 3, 2024

johnaohara commented Sep 3, 2024

gsmet commented Sep 3, 2024

johnaohara commented Sep 3, 2024

gsmet commented Sep 3, 2024

johnaohara commented Sep 4, 2024

geoand commented Sep 4, 2024 •

edited

Loading

johnaohara commented Sep 4, 2024

geoand commented Sep 4, 2024

gsmet commented Sep 4, 2024

geoand commented Sep 4, 2024 •

edited

Loading

gsmet commented Sep 4, 2024

geoand commented Sep 4, 2024

geoand commented Sep 4, 2024 •

edited

Loading

cescoffier commented Sep 4, 2024

geoand commented Sep 4, 2024

cescoffier commented Sep 4, 2024

geoand commented Sep 4, 2024

geoand commented Sep 5, 2024

geoand commented Sep 5, 2024

Explore the use of StackWalker in REST Client #42508

Explore the use of StackWalker in REST Client #42508

Comments

geoand commented Aug 13, 2024

Description

Implementation ideas

quarkus-bot bot commented Aug 13, 2024

johnaohara commented Aug 13, 2024

geoand commented Aug 13, 2024

geoand commented Aug 13, 2024

geoand commented Aug 26, 2024

johnaohara commented Aug 26, 2024

geoand commented Aug 26, 2024

geoand commented Sep 3, 2024

johnaohara commented Sep 3, 2024

geoand commented Sep 3, 2024

johnaohara commented Sep 3, 2024

gsmet commented Sep 3, 2024

johnaohara commented Sep 3, 2024

gsmet commented Sep 3, 2024

johnaohara commented Sep 4, 2024

geoand commented Sep 4, 2024 • edited Loading

johnaohara commented Sep 4, 2024

geoand commented Sep 4, 2024

gsmet commented Sep 4, 2024

geoand commented Sep 4, 2024 • edited Loading

gsmet commented Sep 4, 2024

geoand commented Sep 4, 2024

geoand commented Sep 4, 2024 • edited Loading

cescoffier commented Sep 4, 2024

geoand commented Sep 4, 2024

cescoffier commented Sep 4, 2024

geoand commented Sep 4, 2024

geoand commented Sep 5, 2024

geoand commented Sep 5, 2024

geoand commented Sep 4, 2024 •

edited

Loading

geoand commented Sep 4, 2024 •

edited

Loading

geoand commented Sep 4, 2024 •

edited

Loading